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Sorghum 


Sorghum includes many widely cultivated grasses 
having a variety of names. Sorghum is known as 
guinea corn in West Africa, Kafir corn in South Africa, 
mtama in East Africa, and durra in the Sudan. In 
India sorghums are called juar, jowar, or cholam; in 
China, kaoliang; and in the United States, milo. Culti- 
vated sorghums in the United States are classified as a 
single species, Sorghum bicolor, although there are 
many varieties and hybrids. The two major types of 
sorghum are the grain, or nonsaccharine, type, culti- 
vated for grain production and to a lesser extent for 
forage, and the sweet, or saccharine, type, used for 
forage production and for making syrup and sugar. 


Grain Sorghum 


Grain sorghum is grown in the United States chiefly 
in the Southwest and the Great Plains. It is a warm- 
season crop which withstands heat and moisture 
stress better than most other crops, but extremely 
high temperatures and extended drought may re- 
duce yields. Sorghum responds well to optimum 
growing conditions, fertility, and management to 
produce large grain yields. It is extensively grown 
in Texas, Kansas, Nebraska, Oklahoma, Missouri, Col- 
orado, and South Dakota. This grain production is fed 
to cattle, poultry, swine, and sheep primarily, with 
less than 2% going into nonagricultural markets such 
as starch, dextrins, flour, and industrial preparations. 
Sorghum is considered nearly equal to corn in feed 
value. 

Origins and description. Sorghums originated in the 
northeastern quadrant of Africa. They have been 
grown in Africa and Asia for more than 2000 years. 
Introduction of a sorghum called chicken corn was 
made on the southern Atlantic coast in Colonial 
American times, but it was not successfully culti- 
vated. The variety escaped and became a weed. Prac- 
tically all grain sorghums of importance, until re- 
cent years, introduced into the United States were 
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tall, late maturing, and generally unadapted. Since 
its introduction into the United States, the crop has 
been altered in many ways, these changes coming 
as a result of naturally occurring genetic mutations 
combined with hybridization and selection work of 
plant breeders. The rapid expansion in acreage came 
with the development of widely adapted varieties 
and later higher-yielding hybrids. The fact that hy- 
brid grain sorghums with high yield potential could 
be produced with stems that are short enough for 
harvesting mechanically (Fig. 1) made the crop ap- 
pealing to many farmers. 

Grain sorghum is difficult to distinguish from corn 
in its early growth stages, but at later stages it 
becomes strikingly different. Sorghum plants may 
tiller (put out new shoots), producing several head- 
bearing culms from the basal nodes. Secondary 
culms may also develop from nodal buds along the 
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Fig. 1. Dwarf grain sorghum hybrids are grown throughout the sorghum belt of the United 
States because their short stems make them adaptable to mechanical harvesting. 
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Fig. 2. Heads of grain sorghum. (a) Typical variety. (b) Typical hybrid. 


main stem. The inflorescence (head) varies from a 
dense to a lax panicle, and the spikelets produce per- 
fect flowers that are subject to both self- and cross- 
fertilization. The amount of natural cross-pollination 
ranges from 25 to 0% but averages about 5%. Mature 
grain in different varieties varies in size and color 
from white to cream, red, and brown. Color pig- 
ments are located in the pericarp (outer covering) of 
the grain or in a layer of cells beneath the pericarp 
called the testa. In some varieties the testa is absent; 
when the testa is present, however, the seed color is 
brown or some variation of brown. The endosperm 
(starch portion of the seed) is either white or yellow. 
There are endosperm modifications which cause the 
starch to be sugary or waxy, or to process a higher 
lysine content. The texture of the endosperm may 
vary from completely corneous to full floury. Most 
sorghums are intermediate in endosperm texture. 
Grain sorghums are classified into types desig- 
nated as milo, kafir, feterita, hegari, durra, shallu, 
kaoliang, and zerazera. This classification is based on 
morphological rather than cytological differences, 
since all types (including forage sorghums, broom- 


corn, and Sudangrass) have 10 pairs of chromosomes 
and freely cross. 

Varieties and hybrids. A combine-height grain 
sorghum was developed before World War I but was 
not accepted by farmers or agriculturalists. Accep- 
tance of combine grain sorghums was stimulated 
by the drought of the 1930s and by a farm labor 
shortage during World War I. A number of widely 
adapted productive varieties were developed dur- 
ing this period. Nonetheless, varieties disappeared 
rapidly when hybrids were introduced in the mid- 
1950s (Fig. 2). 

The commercial production of seed of sorghum 
hybrids was made possible by the discovery of cyto- 
plasmic male sterility in the early 1950s. This type of 
sterility, as used in corn and a few other crops, pre- 
vents the development of normal pollen grains and 
makes possible the formation of hybrid seed by cross- 
pollination. Because the flowers of sorghum are per- 
fect, containing both staminate (male) and pistillate 
(female) parts, the production of commercial quanti- 
ties of hybrid seed was not possible without a work- 
able male-sterility mechanism. The first hybrid seed 
in quantity was sold to farmers in 1957, and within a 
period of less than 5 years hybrids had replaced most 
of the varieties previously grown. Sorghum hybrids 
yield at least 20% more than varieties, and concur- 
rent improvements in cultural practices have com- 
bined to boost per-acre yield over 50% since their 
development. The emphasis on research in fertiliza- 
tion, irrigation, insect and disease control, and other 
areas has provided information to help account for 
the remarkable yield and production increases over 
the years. 

Through a process of conversion many of the best 
varieties from around the world are being changed 
from tall, late, unadapted types to short, early, very 
useful cultivars. From this program plant breeders 
are expanding germ-plasm utilization and are devel- 
oping parents of hybrids for the entire sorghum- 
producing world. See BREEDING (PLANT). 

Planting. Grain sorghum seeds are small and 
should not be planted too deep since sorghum lacks 
the soil-penetrating ability of corn. A seeding depth 
of 1 in. (2.5 cm) is acceptable in moist and friable 
soil, but 2 in. (6 cm) may be necessary under dry 
soil conditions. The seeds are planted either in rows 
wide enough for tractor cultivation or in narrower 
rows if cultivation is not intended. Row planters for 
corn, cotton, field beans, and sugarbeets may be used 
when equipped with the proper seed plates. When 
wheat farming is practiced, much grain sorghum is 
planted with a grain drill with alternate or various 
feeder holes plugged to provide the desired row 
spacing. Soil temperature largely determines when 
the seed should be planted, assuming soil moisture 
conditions are adequate. Being tropical in origin, 
sorghum should not be planted in the spring until 
soil temperature is 65-70°F (18-22°C) at the plant- 
ing depth and until there is little chance of subse- 
quent lower temperatures. Dry-land grain sorghum is 
planted at a rate of 3-5 Ib/acre 3.3-5.5 kg/hectare), 
and the rate is increased up to 10 Ib/acre (11 kg/ha) 


when planted under more favorable moisture condi- 
tions and irrigation. 

Cultivation. Good seedbed preparation is essential 
for full stands and for weed control. Tilling fields im- 
proves soil structure in most cases and often aids 
in warming the soil. A rotary hoe is effective in con- 
trolling weeds when the plants are small. Subsequent 
cultivations are made as needed with the same equip- 
ment used for cultivating corn. Minimum tillage is 
practiced in many areas where control of weeds with 
chemicals is a part of the technology of sorghum pro- 
duction. When used correctly, herbicides are a boon 
to sorghum culture, but when used carelessly, dis- 
appointment may result. There are several chemical 
herbicides registered and approved for weed control 
in sorghum. 

Harvesting. Nearly all grain sorghum is harvested 
standing in the field with a combine (Fig. 1). Harvest 
begins in southern Texas in early June and slowly 
proceeds northward. In the central and northern 
Great Plains, the crop is usually harvested after frost. 
The grain threshes freely from the head when the 
seed moisture content is 20-25% or lower. The grain 
should not contain more than 12% moisture to en- 
sure safe storage after harvest. Grain dryers are often 
used when the grain at harvest is not dry enough for 
optimum storage. Proper storage must be maintained 
until the grain can be marketed. The industry’s eco- 
nomic health lies in the ability to provide grain of the 
right quality in the right quantity at the proper time 
and place. Frederick R. Miller 


Sweet Sorghum 


Commonly known as sorgo, sweet sorghum was in- 
troduced into North America from China in 1850, 





Fig. 3. Sweet sorghum in Oklahoma. The scale indicates 
feet. 1 ft = 0.3 m. (USDA) 


although its ancestry traces back to Egypt. It is an 
annual, rather drought-resistant crop. The culms are 
from 2 to 15 ft (0.6 to 4.6 m) tall, and the hard cortical 
layer, or shell, encloses a sweet, juicy pith that is in- 
terspersed with vascular bundles. At each node both 
a leaf and a lateral bud alternate on opposite sides; 
the internodes are alternately grooved on one side. 
Leaves are smooth with glossy or waxy surfaces and 
have margins with small, sharp, curved teeth. The 
leaves fold and roll up during drought. The inflores- 
cence is a panicle of varying size having many pri- 
mary branches with paired ellipsoidal spikelets con- 
taining two florets in each fertile sessile spikelet. The 
plant is self-pollinated. See CORTEX (PLANT); PITH. 
Seed is planted in cultivated rows and fertilized 
similarly to corn. Maturity varies between 90 and 
125 days. The juice contains about 12% sugar. The 
main sorghum-syrup-producing area is in the south- 
central and southeastern United States (Fig. 3). 
Leonard D. Baver 


Diseases 


Sorghums are plagued by a variety of diseases that 
vary in importance from year to year and among loca- 
tions because of the environment, plant genotypes, 
cultural practices, variations in pathogens, or the in- 
teraction of any of these factors. These diseases may 
be classified into five general categories: those that 
rot the seed or injure seedling roots; those that attack 
the leaves, making the plants less productive; those 
that attack or destroy the grain in the heads; those 
that cause root and stalk rots; and those caused by 
viruses or viruslike organisms. 

Fungi causing seed rotting and seedling diseases 
may be seed-borne or soil-inhabiting and are most 
destructive after planting, when the soil is cold and 
wet. Species of Fusarium, Pythium, Helminthospo- 
rium, and Penicillium are the most important fungi 
involved. Damage may be considerably reduced by 
planting sound hybrid seed treated with an approved 
fungicide in soil warm enough to ensure prompt ger- 
mination. 

Leaf diseases are caused by three species of bacte- 
ria and at least eight species of fungi. Many of these 
pathogens are favored by high temperatures and hu- 
mid conditions, but a few are favored by cool, humid 
conditions. Disease lesions occurring as discolored 
spots or streaks may coalesce to involve the entire 
leaf. Rotation, seed treatment, and the use of resis- 
tant varieties are recommended control measures. 

Three of the four known smuts of sorghum are 
found in the United States; covered kernel, loose ker- 
nel, and head smut. Kernel smuts, while historically 
important, are now controlled by routine seed treat- 
ments. Head smut (Fig. 4) destroys the entire head 
and continues to cause major crop losses. Resistant 
hybrids are providing control, although new strains 
of the fungus pathogen require the periodic devel- 
opment of new resistant hybrids. 

Sorghum downy mildew (Fig. 5) has spread 
throughout the southern and central sorghum- 
growing regions. Losses result when the disease 
systemically invades the plant. Diseased plants are 
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barren. Fortunately, excellent resistance has been de- 
veloped and used. 

Maize dwarf mosaic, caused by an aphid- 
transmitted virus, spread throughout the sorghum- 
growing regions during the 1970s. Host tolerance re- 
duces losses caused by this prevalent disease. Yellow 
sorghum stunt, caused by a mycoplasmalike organ- 
ism and transmitted by leafhoppers, rarely reaches 
economically significant proportions. 

Several diseases of the roots and stalks are of 
primary importance. Periconia root and crown 
rot, which caused extensive damage to milo and 
darso sorghums, is controlled by resistant vari- 
eties. Pytbium graminicola causes a major root 
rot during periods of frequent rainfall in dryland 
sorghums. Charcoal rot, most evident as the plant ap- 
proaches maturity under extreme conditions of heat 
or drought, causes shredding of the stalks and ex- 
tensive lodging. Anthracnose, or red rot, develops in 
susceptible hybrids during wet years; plants lodge at 
the base of the peduncle. Development of resistant 
or tolerant hybrids appears to be the only effective 
method of control of the stalk rots if irrigation is not 
possible. 

Grain mold is a disease of mature grain caused by 
species of Fusarium and Curvularia. These fungi 





Fig. 4. Sorghum head smut, a disease that completely destroys the normal head and 
replaces it with masses of smut spores. 





Fig. 5. Sorghum downy mildew, a serious fungus disease 
that systematically invades sorghum plants, causing 
stripped leaves and barren stalks. 


infect at the flowering stage and rot the seed as it 
matures, particularly during wet weather at harvest 
time. See PLANT PATHOLOGY. Richard A. Frederiksen 

Bibliography. W. E Bennett et al., Modern Grain 
Sorghum Production, 1990; H. Doggett, Sorghum, 
1970; J. R. Quinby, Sorghum Improvement and the 
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Williams, R. A. Frederiksen, and G. D. Bengston, 
Proceedings of the International Sorghum Dis- 
ease Workshop, International Crops Research Insti- 
tute for the Semi-Arid Tropics, Hyderabad, 1980; 
R. J. Williams, R. A. Frederiksen, and J. C. Girard, 
Sorghum and Pearl Millet Disease Identification 
Handbook, International Crops Research Institute 
for the Semi-Arid Tropics, Inform. Bull. 2, Hyderabad, 
1978. 


Sound 


The mechanical excitation of an elastic medium. 
Originally, sound was considered to be only that 
which is heard. This admitted questions such as 
whether or not sound was generated by trees falling 
where no one could hear. A more mechanistic ap- 
proach avoids these questions and also allows acous- 
tic disturbances too high in frequency (ultrasonic) 
to be heard or too low (infrasonic) to be classed as 
extensions of those events that can be heard. 

A source of sound undergoes rapid changes of 
shape, size, or position that disturb adjacent ele- 
ments of the surrounding medium, causing them 
to move about their equilibrium positions. These 
disturbances in turn are transmitted elastically to 


neighboring elements. This chain of events propa- 
gates to larger and larger distances, constituting a 
wave traveling through the medium. If the wave 
contains the appropriate range of frequencies and 
impinges on the ear, it generates the nerve im- 
pulses that are perceived as hearing. See HEARING 
(HUMAN). 


Acoustic Pressure 


A sound wave compresses and dilates the mate- 
rial elements it passes through, generating associ- 
ated pressure fluctuations. An appropriate sensor 
(a microphone, for example) placed in the sound 
field will record a time-varying deviation from the 
equilibrium pressure found at that point within the 
fluid. The changing total pressure P measured will 
vary about the equilibrium pressure Po by a small 
amount called the acoustic pressure, p = P — Po. 
The SI unit of pressure is the pascal (Pa), equal 
to 1 newton per square meter (N/m?). Standard at- 
mospheric pressure (14.7 Ib/in.”) is approximately 
1 bar = 10° dyne/cm? = 10° Pa. For a typical sound 
in air, the amplitude of the acoustic pressure may 
be about 0.1 Pa (one-millionth of an atmosphere); 
most sounds cause relatively slight perturbations of 
the total pressure. See MICROPHONE; PRESSURE; PRES- 
SURE MEASUREMENT; PRESSURE TRANSDUCER; SOUND 
PRESSURE. 


Plane Waves 


One of the more basic sound waves is the travel- 
ing plane wave. This is a pressure wave progressing 
through the medium in one direction, say the +x di- 
rection, with infinite extent in the y and z directions. 
A two-dimensional analog is ocean surf advancing to- 
ward a very long, straight, and even beach. The surf 
looks like a long, corrugated surface advancing uni- 
formly toward the shore but extending transversely 
in a series of parallel peaks and troughs. A plane 
wave has acoustic pressure controlled by the one- 
dimensional wave equation in cartesian coordinates, 
Eq. (1). An appropriate solution to this equation is 
Eq. (2), with fany function differentiable twice with 


ap _ 10°p 

ie ae D 
x 

p=f(t- =) D 


respect to x or t. This solution has the property that 
the acoustic pressure p has a single value for all pairs 
of x and ¢ such that the phase (¢ — x/c) is constant. 
At some point Xp and time tọ, the acoustic pressure p 
has the value po = fto — Xo/c). As t increases from to 
to to + Af, the value po will move from xo to a new 
position xo + Ax, where Ax and At are related by 
Eq. (3). Solving for Ax in terms of Af gives Ax/At=c, 


A 
Xo + Ax) =h- Xo 6) 
c C 


(to + At) — 
so that the specific value po is translated through 
space with a speed of propagation c. Thus, c is the 
speed of sound of the wave. See WAVE (PHYSICS); 
WAVE EQUATION; WAVE MOTION. 


Harmonic waves. A most important plane wave, 
both conceptually and mathematically, is the 
smoothly oscillating monofrequency plane wave de- 
scribed by Eq. (4). The amplitude of this wave is P. 


p= P cos [2ar (- J (4) 


The phase (argument of the cosine) increases with 
time, and at a point in space the cosine will pass 
through one full cycle for each increase in phase of 
27. The period T required for each cycle must there- 
fore be such that 27fT = 27, or T= 1/f, so that f= 1/T 
can be identified as the frequency of oscillation of the 
pressure wave. During this period 7, each portion 
of the waveform has advanced through a distance 
à = cT, and this distance i must be the wavelength. 
This gives the fundamental relation (5) between 
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the frequency, wavelength, and speed of sound in 
any medium. For example, in air at room temperature 
the speed of sound is 343 m/s (1125 ft/s). A sound of 
frequency 1 kHz (1000 cycles per second) will have 
a wavelength of à = c/f = 343/1000 m = 0.34 m 
(1.1 ft). Lower frequencies will have longer wave- 
lengths: a sound of 100 Hz in air has a wavelength 
of 3.4 m (11 ft). For comparison, in fresh water at 
room temperature the speed of sound is 1480 m/s 
(4856 ft/s), and the wavelength of 1-kHz sound is 
nearly 1.5 m 5 ft), almost five times greater than the 
wavelength for the same frequency in air. 

Because of many close analogies between sound 
and electricity, it is often convenient in practice to 
define the effective pressure amplitude P, = Ph/ 2in 
a wave such as that of Eq. (4). Similarly, the frequency 
is often represented by the angular frequency 
w = 2x f and the wavelength expressed reciprocally 
by the wave number k = 2x7/À. With these defini- 
tions, Eq. (4) could be written as Eq. (6), and Eq. (5) 
as Eq. (7). 


p=~2P, cos (wt — kx) © 
= 7 
aes 7) 


See ALTERNATING-CURRENT CIRCUIT THEORY. 
Transient and continuous waves. Monofrequency 
waves are the building blocks for more complicated 
waves that are either continuous or transient in time. 
For example, a sawtooth continuous wave has acous- 
tic pressure which, during each cycle, begins at 
a positive value Pmax, decreases uniformly (linearly 
with time) to a negative value —Pmax at the end of 
the period T, and then jumps instantaneously back 
to the positive value Pyax, repeating this cycle indefi- 
nitely. It is a direct consequence of the wave equation 
that this waveform can be described equivalently 
as a Fourier superposition, or summation, of waves 
~Pit+p2+p3+--:-, each of the form of Eq. (8), where 


Pr= Z Tens sin [2x (¢- al (8) 
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the fundamental frequency f= 1/T gives the repeti- 
tion rate of the waveform and 7 has integer values 1, 
2, 3, .... The waves p, constitute the overtones of 
the waveform. In this case the frequencies nf of the 
overtones are integer multiples of the fundamental 
frequency f, and the overtones are termed harmon- 
ics. Any signal that is nonrepeating or is nonzero only 
with some limited duration of time can be written as 
a summation of waves that are not harmonically re- 
lated or, in the extreme, an integration of waveforms 
of all frequencies. As an illustration, a very sharp pos- 
itive pulse of pressure traveling in the x direction and 
lasting for an infinitesimally short duration of time is 
represented by the Dirac delta function é(@¢ — x/c). 
This function, which has unbounded value where 
t = x/c and is zero elsewhere, can be expressed as an 
integral, as in Eq. 9). These considerations show that 


a(r-ž)=2 f cos[zar(r-*)| df 9) 


a study of monofrequency sound waves is sufficient 
to deal with all sound waves, and that the fundamen- 
tal concepts of frequency and wavelength permeate 
all aspects of sound. See FOURIER SERIES AND TRANS- 
FORMS; HARMONIC (PERIODIC PHENOMENA); NONSI- 
NUSOIDAL WAVEFORM; WAVEFORM. 

Standing waves. In many situations, sound is gen- 
erated in an enclosed space that traps the sound 
within or between boundaries. For example, if there 
is a boundary that causes the pressure wave given 
by Eq. (4) to be completely reflected back on itself, 
then there is an additional wave given by Eq. (10), 


p' =Pcos [227 G + I (10) 


which represents a monofrequency plane wave trav- 
eling in the —x direction. This wave combines with 
the incident wave, resulting in a standing wave witha 
pressure dependence py given by Eq. (11). This kind 


i 21x 
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of wave would be found within a sounding organ 
pipe and is analogous to a vibrating guitar string. The 
pressure waveform pr; is zero at positions x = 4/4, 
34/4, 5/4, .... These nodes (pressure nulls) occur 
every half-wavelength in space. Midway between 
them are antinodes at which the pressure waveform 
oscillates in time between its maximum and min- 
imum values of +2P. More complicated standing 
waves can be formed from waves traveling in a num- 
ber of directions. For example, vibrating panels or 
drum heads support standing waves in two dimen- 
sions, and steady tones in rooms can excite three- 
dimensional standing waves. See VIBRATION. 


Particle Speed and Displacement 


As sound passes through a fluid, the small fluid ele- 
ments are displaced from their equilibrium rest posi- 
tions by the fluctuating pressure gradients. The mo- 
tion of a fluid element is described by the particle 
velocity “ with which it moves about its equilibrium 


position. This is related to the pressure by Newton’s 
second law. With the neglect of some small non- 
linear and viscous terms, this law can be written 
as Eq. (12), where po is the equilibrium density of 


Ou 
po — = -Vp (12) 


ot 
the fluid and V is the gradient operator. See CALCU- 
LUS OF VECTORS; FLUID-FLOW PRINCIPLES; NEWTON’S 
LAWS OF MOTION. 

For a one-dimensional plane wave moving in the 
+x direction, the acoustic pressure p and particle 
speed u are proportional and related by p/u = poc. 
The product oc is a basic measure of the elastic 
properties of the fluid and is called the characteris- 
tic impedance. This is an index of the acoustic “hard- 
ness” or “softness” of a fluid or solid. (The term char- 
acteristic impedance is restricted to the plane-wave 
value of oc; more generally, the term specific acous- 
tic impedance is used.) Some representative values 
of the speed of sound, the density, and the specific 
acoustic impedance are given in Table 1. Significant 
differences among these quantities for gases, liquids, 
and solids are evident. See ACOUSTIC IMPEDANCE. 

Because fluids cannot support shear (except for 
small effects related to viscosity), the particle veloc- 
ity of the fluid elements is parallel to the direction 
of propagation of the sound wave and the motion is 
longitudinal. In contrast, solids can transmit shear- 
ing or bending motion—reeds, strings, drum heads, 
tuning forks, and chimes can vibrate transversely. 


Description of Sound 


The characterization of a sound is based primarily on 
human psychological responses to it. Because of the 
nature of human perceptions, the correlations be- 
tween basically subjective evaluations such as loud- 
ness, pitch, and timbre and more physical qualities 
such as energy, frequency, and frequency spectrum 
are subtle and not necessarily universal. 

Intensity, loudness, and the decibel. The strength 
of a sound wave is described by its intensity J. 
From basic physical principles, the instantaneous 
rate at which energy is transmitted by a sound wave 
through unit area is given by the product of acous- 
tic pressure and the component of particle velocity 
perpendicular to the area. The time average of this 
quantity is the acoustic intensity, as in Eq. (13). Fora 
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plane monofrequency traveling wave, this is given by 
Eq. (14) in the direction of propagation. If all quanti- 
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ties are expressed in SI units (pressure amplitude or 
effective pressure amplitude in Pa, speed of sound 
in m/s, and density in kg/m’), then the intensity will 
be in watts per square meter (W/m?). 

Because of the way the strength of a sound is per- 
ceived, it has become conventional to specify the 


























TABLE 1. Density, speed of sound, and specific acoustic impedance in selected materials* 
Gases Density (0), kg/m® Speed of sound (c), m/s 
Air e2 343 
Oxygen (0°C; 32°F) 1.43 317 
Hydrogen (0°C; 32°F) 0.09 1270 
Liquids Density (po), kg/mê Speed of sound (c), m/s 
Water 998 1,481 
Seawater (13°C; 55°F) 1,026 1,500 
Ethyl alcohol 790 1,150 
Mercury 13,600 1,450 
Glycerin 1,260 1,980 
Specific acoustic impedance 
Speed of sound (c), m/s (poc), N- s/m® 
Density (p0), o o 
Solids kg/m? bar bulk bar bulk 
Aluminum 2,700 57150 6,300 13.9 17.0 
Brass 8,500 3,500 4,700 29.8 40.0 
Lead 11,300 1,200 2,050 13.6 23.2 
Steel 7,700 5,050 6,100 39.0 47.0 
Glass 2,300 5,200 5,600 12.0 12.9 
Lucite 1,200 1,800 2,650 22 3.2 
Concrete 2,600 — 3,100 — 8.0 
“Temperature = 20°C = 68°F unless otherwise indicated. Pressure = 1 atm = 101.3 kPa = 14.7 Ibf/in.2 41 kg/m = 0.0624 Ibm/ft3; 1 m/s = 
3.281 ft/s; 1 N - s/m = 6.366 x 1073 Ibf - s/ft3. 
SOURCE: After L. E. Kinsler et al., Fundamentals of Acoustics, 3d ed., Wiley, 1982. 








intensity of sound in terms of a logarithmic scale 
with the (dimensionless) unit of the decibel (dB). 
An individual with unimpaired hearing has a thresh- 
old of perception near 10~!? W/m? between about 
2 and 4 kHz, the frequency range of greatest sensi- 
tivity. As the intensity of a sound of fixed frequency 
is increased, the subjective evaluation of loudness 
also increases, but not proportionally. Rather, the lis- 
tener tends to judge that every successive doubling 
of the acoustic intensity corresponds to the same in- 
crease in loudness. This is conveniently expressed 
by Eq. (15), where the logarithm is to base 10, J 


Lı = 10 log (=) (15) 


ref 


is the intensity of the sound field in W/m?, Iver is 
10-'? W/m?, and Z; is the intensity level in dB re 
10-!* W/m2. On this scale, the weakest sounds that 
can be perceived have an intensity level of 0 dB, 
normal conversational levels are around 60 dB, and 
hearing can be damaged if exposed even for short 
times to levels above about 120 dB. Every doubling 
of the intensity increases the intensity level by 3 
dB. For sounds between about 500 Hz and 4 kHz, 
the loudness of the sound is doubled if the intensity 
level increases about 9 dB. This doubling of loudness 
corresponds to about an eightfold increase in inten- 
sity. For sounds lying higher than 4 kHz or lower 
than 500 Hz, the sensitivity of the ear is appreciably 
lessened. Sounds at these frequency extremes must 
have higher threshold intensity levels before they can 
be perceived, and doubling of the loudness requires 
smaller changes in the intensity with the result that at 
higher levels sounds of equal intensities tend to have 
more similar loudnesses. It is because of this charac- 


teristic that reducing the volume of recorded music 
causes it to sound thin or tinny, lacking both highs 
and lows of frequency. See DECIBEL; LOUDNESS. 
Since most sound-measuring equipment detects 
acoustic pressure rather than intensity, it is conve- 
nient to define an equivalent scale in terms of the 
sound pressure level. Under the assumption that 
Eq. (14) is valid for most commonly encountered 
sound fields, a reference effective pressure ampli- 
tude Pef = 20 micropascals (j1Pa) generates the refer- 
ence intensity of 10~!? W/m? in air (at standard tem- 
perature and pressure) and a sound pressure level 
(SPL) can be defined by Eq. (16), where P, is the 





Pe 
SPL = 20 log ( ) (16) 
Pret 


effective pressure amplitude in Pa. The intensity 
level and sound-pressure level are usually taken as 
identical, but this is not always true (these levels may 
not be equivalent for standing waves, for example). 
For underwater sounds, the sound pressure level is 
also expressed by Eq. (16), but the reference effec- 
tive pressure is defined as 1 Pa. 

Frequency and pitch. How “high” sound of a par- 
ticular frequency appears to be is described by the 
sense of pitch. A few minutes with a frequency gen- 
erator and a loudspeaker show that pitch is closely 
related to the frequency. Higher pitch corresponds 
to higher frequency, with small influences depend- 
ing on loudness, duration, and the complexity of 
the waveform. For the pure tones (monofrequency 
sounds) encountered mainly in the laboratory, pitch 
and frequency are not found to be proportional. Dou- 
bling the frequency less than doubles the pitch. For 
the more complex waveforms usually encountered, 
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however, the presence of harmonics favors a propor- 
tional relationship between pitch and frequency. See 
PITCH. 

Consonance and dissonance. Two tones generated 
together cannot be distinguished from each other 
if their frequencies are the same. If their frequen- 
cies fı and (slightly higher) fz are nearly but not ex- 
actly identical, the ear will perceive a slow beating, 
hearing a single tone of slowly and regularly vary- 
ing amplitude. The combination yields an equiva- 
lent signal given by Eq. (17), which is heard as a 


cos (2af t) + cos (2afat) 


= 2 cos (2474 ) cos (2084 ) a7 


single tone having a frequency that is the average of 
the frequencies of the two individual tones and an 
amplitude that varies slowly according to the differ- 
ence of the two frequencies. As f) increases more, 
the beating will quicken until it first becomes coarse 
and unpleasant (dissonant) and then resolves into 
two separate tones of different pitches. With still fur- 
ther increase, a sense of beating and dissonance will 
reappear, leading into a blending or consonance that 
then breaks again into beats and dissonance, and the 
whole cycle of events repeats. These islands of con- 
sonance surrounded by beating and dissonance are 
attained whenever the ratio of the two frequencies 
becomes that of small integers, f/f; = 1/1, 2/1, 3/2, 
4/3, .... The larger the integers in the ratio, the more 
subtle the effects become. See BEAT. 

Frequency spectrum and timbre. Sounds can be 
characterized by many subjective terms such as 
clean, nasal, edgy, brassy, or hollow. Each term at- 
tempts to describe the nature of a complex wave- 
form that may be of very short or long duration and 
that consists of a superposition or combination of 
a number of pure tones. The sound of a person’s 
whistling or of a flute played softly often has a pure, 
clean, but somewhat dull quality. These sounds con- 
sist mainly of a pure tone with few or no harmonics. 
As described above, complex repetitive waveforms 
are made up of a fundamental tone and a number 
of harmonics whose frequencies are integer multi- 
ples of the fundamental frequency. Blown or bowed 
instruments such as flute, bowed violin, oboe, and 
trumpet provide good examples. Other sounds are 
transient, or nonrepetitive, and usually consist of a 
fundamental plus a number of overtones whose fre- 
quencies are not integer multiples of the lowest. Pi- 
ano, tympani, cymbals, and plucked violin generate 
these kinds of sounds. 

An important factor is the way in which a sound 
commences. When a chime is struck, there is a clear 
sharp onset much like a hammer hitting an anvil; the 
higher overtones are quite short in duration, how- 
ever, and quickly die out, leaving only a few nearly 
harmonic lower overtones that give a clear sensa- 
tion of pitch. Plucking a guitar string near the bridge 
with a fingernail yields a similar effect. A gong gives 
a very complex impression because there are many 


nonconsonant overtones present, dying away at dif- 
ferent rates, so that the sound seems to shift in pitch 
and timbre, and may appear nonmusical or merely 
noise to some. It is the abundance of harmonics and 
overtones, the distribution of intensity among them, 
and how they preferentially die away in time that 
provide the subjective evaluation of the nature or 
timbre of the sound. See MUSICAL ACOUSTICS. 


Propagation of Sound 


Plane waves are a considerable simplification of an 
actual sound field. The sound radiated from a source 
(such as a loudspeaker, a hand clap, or a voice) must 
spread outward much like the widening circles from 
a pebble thrown into a lake. 

Spherical waves. A simple model of this more real- 
istic case is a spherical source vibrating uniformly in 
all directions with a single frequency of motion. The 
sound field must be spherically symmetric with an 
amplitude that decreases with increasing distance r 
from the source, and the fluid elements must have 
particle velocities that are directed radially. A solu- 
tion of the wave equation with spherical symme- 
try describing this kind of motion for an outgoing 
monofrequency traveling wave is given by Eq. (18), 


p= = cos (wt — kr) (d8) 
F 


where w = 2xf is the angular frequency and k = 
27/X the wave number. The pressure amplitude is 
A/r and diminishes inversely with distance from the 
source. If the spatially dependent pressure ampli- 
tude is defined by Eq. (19), then the intensity is still 


A 
PY) = F a9) 


given by Eq. (14), but with P and P, interpreted as 
A/r and A/NN2, respectively. Thus, the intensity 
falls off with distance as 1/7?. This is consistent with 
conservation of energy. The acoustic power sent 
through a sphere of radius r surrounding the source 
is the intensity of the wave multiplied by the surface 
area through which it passes. This yields 42777 = 
41 A?/(2poC), which is independent of r. 

While the particle velocity for this wave is a rela- 
tively complicated function of r, at distances r << À 
the ratio p/u approaches poc, the same as for a trav- 
eling plane wave. Further, in this limit the surfaces 
of constant phase, for which (¢ — r/c) has constant 
value, become more and more planar. Thus, at suf- 
ficient distances from the source a spherical wave 
becomes indistinguishable from a plane wave when 
viewed over regions of the space whose dimensions 
are small with respect to r. This asymptotic behavior 
allows use of the simple plane-wave relationships for 
many situations. 

Directional waves. Not all sources radiate their 
sound uniformly in all directions. When someone 
is speaking in an unconfined space, for example an 
open field, a listener circling the speaker hears the 
voice most well defined when the speaker is fac- 
ing the listener. The voice loses definition when the 


speaker is facing away from the listener. Higher fre- 
quencies tend to be more pronounced in front of the 
speaker, whereas lower frequencies are perceived 
more or less uniformly around the speaker. 

Dipole source. The fields radiated by sources of more 
complicated shape and size can be calculated by con- 
sidering the complicated source as being made up of 
a collection of small spherical sources, each radiat- 
ing a pressure wave like that given by Eq. (18), and 
then adding the pressure fields together. A simple ex- 
ample is the dipole source, consisting of two small 
sources spaced very closely together and radiating 
180° out of phase, so that one is shrinking as the other 
is expanding, and vice versa. The two sources will 
nearly cancel because as one is generating a positive 
pressure, the other is generating a negative pressure. 
Because the two sources are slightly separated, how- 
ever, the fields will not exactly cancel. If @ is defined 
as the angle included between the desired direction 
in space and the line joining the centers of the two 
(out-of-phase) sources, and d is the distance between 
the two sources, then at large distances r away from 
the pair (Fig. 1a) the total acoustic pressure field 


(b) 


Fig. 1. Pressure field of a dipole source. (a) Coordinates 0 
and r used to describe the field. The distance between the 
component sources is d. (b) Nodal surfaces of the field 
when the distance d is sufficiently large with respect to the 
wavelength A. 


is given approximately by Eq. (20). This equation is 
Akd . 
p= —(cos@) sin (wt — kr) (20) 
r 


valid when the wavelength of sound is much larger 
than the distance separating the sources (kd < 1) 
and r is much larger than d. The amplitude of the 
pressure P(r)is given by (Akd/r)|cos 6]. In any fixed 
direction, the amplitude of the pressure falls off as 
1/r, but at a fixed distance the pressure amplitude 
is modulated in direction by |cos 6]. In the direction 
0 = 0 (or 0 = x radians), the two sources lie one 
behind the other and the cancellation is least com- 
plete. If 0 = 7/2, then the two sources lie side by 
side and the cancellation is total. There is a nodal 
surface, defined by the plane 0 = 7/2, exactly mid- 
way between the two sources and perpendicular to 
the line joining them. 

If the distance d is not significantly less than the 
wavelength à, then Eq. (20) is not accurate, and 
a more complicated expression, Eq. (21), must be 


A 1 
p=2- sin (5 kd cos a) sin (wt — kr) (21) 
r 


used. There is still a nodal surface at 0 = 2/2, but if 
kd is large enough there may be additional nodal sur- 
faces, each a cone concentric with the line joining 
the sources, in directions given by angles 0, satisfy- 
ing kd cos 0, = 2nz with n = 1, 2,... (Fig. 1b). 
Generalized source. Generalizing from the above 
shows that the pressure field radiated by a source of 
arbitrary shape will have an amplitude that at large 
distance can be written as the product of a func- 
tion of r and a function of direction, as in Eq. (22). 


PCr, 0, $) = Px MHO, p) (22) 


Here, P,() falls off as 1/r, and H(@,¢) is a function 
only of direction and has a maximum value of 1. At 
fixed distance r, the pressure amplitude has maxi- 
mum value given by P,,(r), termed the axial pres- 
sure amplitude. The direction for which H(@,@) has 
its maximum magnitude of 1 defines the acoustic axis 
of the source. The acoustic axis may be a plane or 
a series of surfaces of constant 6 or ¢, but often the 
acoustic axis is a single line. The ratio of the intensity 
found at distance r in some arbitrary direction given 
by 6 and ¢ to the value found at the same distance 
on the acoustic axis is simply H?(6,é), and the ex- 
pression of this as an intensity level, called the beam 
pattern b@,¢), is given by Eq. (23). The decibel value 


b@, $) = 20108 HO, p) (23) 


of the beam pattern is the reduction in intensity level 
in that direction compared to the value on the acous- 
tic axis (at the same distance from the source). 
Baffled piston source. A loudspeaker or a hole in a par- 
tition can be represented as a baffled piston. This is 
a flat, circular source of radius a mounted on a large, 
flat, rigid surface, commonly called a baffle. All por- 
tions of the source move together with uniform par- 
ticle velocity normal to the baffle. It is convenient to 
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Fig. 2. Coordinates r and 0 used to describe the pressure 
field of a baffled piston. The radius of the piston is a. 


measure the distance r from the center of the piston 
and the direction 6 with respect to the perpendicu- 
lar to the piston and baffle (Fig. 2). The acoustic axis 
is the line 0 = 0, and the principal sound radiated by 
the piston propagates along this axis, falling offas 1/r 
at larger distances and spreading within a conical 
nodal surface surrounding the axis. This spreading 
beam of sound is the major lobe of the radiated pres- 
sure field. The angle 6, defining this nodal surface is 
given by ka sin 6; = 3.83. This surface will exist only 





if the frequency is high enough and the wavelength 
consequently small enough that ka is greater than 
3.83. For sufficiently high frequencies, a number of 
nodal surfaces may exist, defined by angles 0, satisfy- 
ing Eq. (24). [Here, the quantities /;,, are the values of 


ka sin@, = fin = 3.83, 7.02, 10.17,... Q 


the argument of the first-order Bessel function J, that 
reduce it to zero: JiG1,) = 0.] Between these nodal 
surfaces will be minor lobes of lower-intensity levels. 
Each of these minor lobes will be funnel- or bowl- 
shaped, surrounding the major lobe as a figure of 
revolution about the 6 = 0 acoustic axis. Some exam- 
ples for a piston with different values of ka are shown 
in Fig. 3. The first side lobe, when it exists, has an 
intensity level about 18 dB less than the major lobe. 
Subjectively, sound heard by someone standing in 
the direction of the first side lobe is about one-fourth 
as loud as that heard in the direction of the major 
lobe at the same distance. For a given piston, it is ap- 
parent that higher frequencies (shorter wavelengths) 
will yield pressure fields that have narrower major 
lobes and more nodal surfaces and minor lobes. 
This is one reason why higher-frequency sounds are 
often radiated by smaller loudspeakers (tweeters) in 
high-fidelity systems. The smaller loudspeakers radi- 
ate the shorter wavelength sounds more uniformly 
over larger angles, helping to distribute the energy 
throughout the space. These small speakers are not 
useful for lower frequencies, however, because they 
cannot move enough air to generate sufficiently in- 
tense low-frequency sound. See BESSEL FUNCTIONS; 
LOUDSPEAKER. 

In addition to the geometrically simple far field 
found at large distances from a source, there will also 
often be a rather complicated near field in the imme- 
diate vicinity of the source. For the piston source, the 
transition between near and far fields occurs around 




















Fig. 3. Directivity patterns for a rigid circular piston mounted on an infinite baffle for different values of ka, where a is the 
radius of the piston and k is the wave number. (a) ka = 3. (b) ka = 5. (c) ka = 10. 


a distance 7, estimated by r;/a ~ a/i. See DIRECTIV- 
ITY. 

Diffraction. It is possible to hear but not see 
around the corner ofa tall building. However, higher- 
frequency sound (with shorter wavelength) tends to 
bend or “spill” less around edges and corners than 
does sound of lower frequency. The ability of a wave 
to spread out after traveling through an opening and 
to bend around obstacles is termed diffraction. This 
is why it is often difficult to shield a listener from an 
undesired source of noise, like blocking aircraft or 
traffic noise from nearby residences. Simply erecting 
a brick or concrete wall between source and receiver 
is often an insufficient remedy, because the sounds 
may diffract around the top of the wall and reach the 
listeners with sufficient intensity to be distracting or 
bothersome. The effectiveness of a barrier in block- 
ing sound can be roughly estimated by calculation 
of an insertion loss. For a long solid wall (brick or 
concrete) of uniform height the insertion loss can 
be estimated from Eq. (25), where r is the direct- 


D-r 
AL; = 19 + 10 log e=) (25) 


path distance (the distance in a straight line from 
source to receiver), D is the new path length from 
the source just over the top of the wall to the receiver, 
and AZ, is the difference between the intensity level 
that would be experienced if there were no barrier 
and the intensity level with the barrier. The equa- 
tion is accurate when (D — r)/A < 1. See ACOUSTIC 
NOISE; DIFFRACTION. 

Speed of sound. In fluids, the acoustic pressure 
and the density variation are related by Eq. (26), 


p= Rs (26) 


where the condensation s is (9 — po)/po and R 
is a constant of proportionality. This is a three- 
dimensional Hooke’s law stating that stress and 
strain are proportional. The propagation of sound 
requires such rapid density changes that there is 
not enough time for thermal energy to flow from 
compressed (higher-temperature) elements to de- 
compressed (lower-temperature) elements. Because 
there is little heat flow, the process is nearly adia- 
batic and & is the adiabatic bulk modulus (rather 
than the more familiar isothermal modulus relevant 
to constant-temperature processes). A derivation of 
the wave equation for a fluid reveals that the speed 
of sound is given by Eq. (27). Equations (26) and (27) 
can be combined to give Eq. (28). 


R 

C= z (27) 
0 

P= poc’s (28) 


See ADIABATIC PROCESS; HOOKE’S LAW. 

In most fluids, the equation of state is not known, 
and the speed of sound must be measured directly 
or inferred from experimental determinations of the 
bulk modulus. In distilled water the measured speed 
of sound in meters per second can be represented 


by a simplified equation (29), where T is the temper- 


c = 1403 + 4.9T — 0.0577 +0.16R (29) 


ature in degrees Celsius and % is the equilibrium 
pressure in bars. For most gases at reasonable pres- 
sures and temperatures, however, the equation of 
state is known to be very close to the perfect gas 
law, Eq. G0), where R is the specific gas constant 


FY = PoRTx 60) 


and Tx the absolute temperature. (Absolute temper- 
ature Tx in kelvin and standard temperature T in de- 
grees Celsius are related by 7x = T + 273.) Further, 
for sound in a gas, the instantaneous pressure and 
density are related by Eq. (1), where y is the ratio 


P p ) 
ae eee GD 
H ( Po 
of specific heats. For air, y = 1.4. For small pres- 
sure fluctuations, Eq. (31) can be approximated by 
Eq. G2). Comparing this with Eq. (28), and using the 
P 


A =1+ys 62) 


perfect gas law, Eq. (50), gives a theoretical expres- 
sion for the speed of sound in an ideal gas, Eq. (33). 


c= JYRTx G3) 


Thus, the speed of sound in an ideal gas depends 
on temperature, increasing as the square root of the 
absolute temperature. Because many applications of 
Eq. (33) are for gases around room temperature, it 
is convenient to reexpress the formula in terms of 
degrees Celsius and the speed of sound co at T = 0°C 
G2°F, as in Eq. 34). This equation can be used to 


| T 
cC = Co 1+ 33 G34) 


calculate the speed of sound for a perfect gas at any 
temperature in terms of the speed of sound in that 
gas at 0°C (32°F). In air, for example, cp = 331 m/s 
(1086 ft/s), and at 20°C (68°F) the speed of sound is 
calculated to be 343 m/s (1125 ft/s). See GAS; HEAT 
CAPACITY; UNDERWATER SOUND. 

In solids, the sound speed depends on the trans- 
verse extent of the solid with respect to the wave. If 
the solid is rodlike, with transverse dimensions less 
than the acoustic wavelength, the solid can swell and 
contract transversely with relative ease, which tends 
to slow the wave down. For this case, the speed of 
sound is given by Eq. (35) where </ is Young’s mod- 


a 
c= J2 65) 
Po 


ulus. If, in the other extreme, the solid has transverse 
dimensions much larger than the acoustic wave- 
length, then a longitudinal wave will travel with a 
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greater speed of sound given by Eq. (36), where <¥ is 


B Gi 
= ` 3+ 4 > (36) 
Po 


the shear modulus. These two different propagation 
speeds are often called bar and bulk sound speeds, 
respectively. See ELASTICITY; YOUNG’S MODULUS. 

Ray and refraction. Since the speed of sound varies 
with the local temperature (and pressure, in other 
than perfect gases), the speed of a sound wave can be 
a function of position. Different portions of a sound 
wave may travel with different speeds of sound. 

Ifa highly directional piston projects a sound wave 
horizontally into the air, and if the value of ka is 
large enough, the major lobe of the sound radiated 
by the piston is confined to directions within a very 
small angle 0, about the (horizontal) acoustic axis. 
The beam of sound has surfaces of constant phase, 
each resembling a disk, perpendicular to and prop- 
agating out along the acoustic axis with the speed 
of sound of the air. If, however, the air temperature 
decreases gradually with height above the ground, 
then the upper portions of the beam travel in air 
with a lesser sound speed than the lower portions. 
The top of a surface of constant phase gradually lags 
behind the bottom, and the disk tilts a little so that 
its normal points slightly upward. Since the direction 
of propagation is normal to the surface of constant 
phase, this surface gradually propagates upward, ris- 
ing above the horizontal as it travels outward away 
from the piston. Thus, as a consequence of the speed 
of sound decreasing with altitude, the beam of sound 
is refracted upward in an arc with one end horizon- 
tal at the piston and the other tilting progressively 
upward as distance from the piston increases. 

Each small element of a surface of constant phase 
traces a line in space, defining a ray along which 
acoustic energy travels. The sound beam can then be 
viewed as a ray bundle, like a sheaf of wheat, with 
the rays distributed over the cross-sectional area of 
the surface of constant phase (Fig. 4). As the major 
lobe spreads with distance, this area increases and 
the rays are less densely concentrated. The number 
of rays per unit area transverse to the propagation 
path measures the energy density of the sound at 
that point. 











Fig. 4. Representation of an upwardly refracted sound 
beam as a ray bundle. 


It is possible to use the concept of rays to study 
the propagation of a sound field. The ray paths define 
the trajectories over which acoustic energy is trans- 
ported by the traveling wave, and the flux density of 
the rays measures the intensity to be found at each 
point in space. This approach, an alternative way to 
study the propagation of sound, is approximate in 
nature but has the advantage of being very easy to 
visualize. 

An equation describing refraction can be obtained 
from the three-dimensional wave equation (37), with 
c now a function of space, by trying a solution of the 
form of Eq. (38). The pressure amplitude P(%,y,z) 


1 3? 
Vp= a £ (37) 
p=P (xyz) cos E (« — raza) ] 68) 
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depends on location, and co is a convenient con- 
stant reference value of sound speed. The quantity 
y has the dimensions of a length, and constant wy 
determines a surface of constant phase. The local 
direction of propagation is perpendicular to this sur- 
face and is given by Vy, the gradient of y. For the 
plane traveling wave of Eq. (4), the surfaces, given by 
x = constant, are planes perpendicular to the x axis. 
The gradient of x points in the +x direction, the di- 
rection of propagation of the phase surface. For the 
spherical wave of Eq. (6), the surfaces are given by 
r = constant, and each is a sphere concentric with 
the source at r = 0. The gradient of r is direct radially 
outward at each point. Substitution of Eq. (58) into 
the wave equation and some approximation results 
in the Eikonal equation (39) for the direction of prop- 


iwyr=(2)° G9) 


agation. The Eikonal equation is valid if the fractional 
changes in P and c in any direction over distances of 
a wavelength are very small. In general, solving this 
equation requires integral and differential calculus 
and results in rather difficult formulas. 

For the case of a one-dimensional speed of sound 
profile c = c(z), where z represents altitude, the so- 
lution can be simply stated: If direction is defined 
by an angle 0 of elevation (0 = 0 points horizon- 
tally and 6 = 2/2 points vertically), then the local 
direction of propagation of a ray must obey Snell’s 
law, Eq. (40). If the elevation angle of the ray has 

cos@ cos Oo 


= (40) 
c Co 








value 6o at an altitude where the speed of sound is 
Co, then as the ray propagates to a new altitude where 
the speed of sound is c2), its directional angle will 
change to a new value 6(2) according to Eq. (40). 
This allows the ray to be traced point by point as it 
propagates through the fluid, each new directional 
angle at each new point being obtained from Snell’s 
law. If the speed of sound varies linearly with altitude, 
that is, c(z) = co — gz, where the gradient g = dc/dz 
of the profile is a constant, then the ray path will be 


the arc of a circle whose center lies at the altitude 
where the speed of sound would vanish. [This would 
never happen since c(z) will stop decreasing linearly 
long before this altitude is reached. It is, however, a 
useful artifice for drawing the trajectory of the ray.] 
Figure 5 provides an example of such a ray path. See 
ATMOSPHERIC ACOUSTICS; REFRACTION OF WAVES. 

Reflection and transmission. If a sound wave travel- 
ing in one fluid strikes a boundary between the first 
fluid and a second, then there may be reflection and 
transmission of sound. For most cases, it is sufficient 
to consider the waves to be planar. The first fluid con- 
tains the incident wave of intensity J; and reflected 
wave of intensity J,, the second fluid, from which the 
sound is reflected, contains the transmitted wave of 
intensity J,. The directions of the incident, reflected, 
and transmitted plane sound waves may be specified 
by the grazing angles 6;, 0,, and 6; (measured be- 
tween the respective directions of propagation and 
the plane of the reflecting surface). 

The intensity reflection coefficient J,/J; and the in- 
tensity transmission coefficient I,/I; are found by im- 
posing boundary conditions that require continuity 
of pressure and continuity of the normal component 
of the particle velocity at the interface between the 
two fluids. These boundary conditions then require 
that the grazing angle of the incident plane wave 
equals that of the reflected plane wave, Eq. (41). For 


0; => 6,. (41) 


the transmitted wave, the grazing angle 6; must be 
given by Eq. (42), where c; and c3 are the values of the 


cos 0; = (2) cos 6; (42) 


Cy 


speed of sound in the first and second fluids, respec- 
tively. Both of these results are consistent with Snell’s 
law. There are two possible situations depending on 
whether the speeds of sound c; and cz and the angle 
of incidence are such that cos 6; < 1, which presents 
no conceptual problem, or cos 6; << 1, which does. 

If cos 6, < 1, then the transmitted wave is a plane 
wave sent into the second fluid. The beam of sound 
is refracted according to Snell’s law, being inclined 
toward the normal to the boundary if cp is less than 
c, and being deflected away from the normal if c, 
exceeds cı. The reflected and transmitted intensities 
are given by Eqs. (43) and (44), where pı and p2 are 


p1ıC2 sin 0; 2 














L, P2C2 sin 6; 
L z pPıCı sin 0, (43) 
(2C2 sin 0; 
(2 ) 
IL pıCı 
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z 2 
( pic, sin *) 

p2C2 ~—-sin dd; 
the densities of the first and second fluids, respec- 


tively. Clearly, the intensities depend strongly on the 
angle of incidence. If there is a large mismatch in the 
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Fig. 5. Ray path for the refraction of sound in a speed-of-sound profile. (a) Speed of 
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sound c as a function of altitude z. (b) Path of the ray. r = horizontal distance from source; 


6 = angle of elevation of the ray. 


characteristic acoustic resistances of the two fluids, 
regardless of which is the larger, the reflection coef- 
ficient approaches unity. If p2c2 < p1c,, the second 
medium appears rigid, and the pressure amplitude at 
the boundary (the amplitude of the sum of the inci- 
dent and reflected waves evaluated at the interface) 
is as great as possible. In the other extreme, p2c, < 
fic), the second medium offers virtually no opposi- 
tion to the motion of the first, so there is no reactive 
force and therefore no pressure at the interface. In 
this case the boundary is called pressure release. 

If cos 6; < 1, then the transmission angle has no 
physical meaning. In this case the intensity reflection 
coefficient has unit value, and the transmitted wave 
travels parallel to the boundary as if it were attached 
to it with an amplitude that decays exponentially 
with increasing distance from the boundary. This 
case is the acoustic analog of the situation in optics 
when there is complete internal reflection. See RE- 
FLECTION AND TRANSMISSION COEFFICIENTS; REFLEC- 
TION OF ELECTROMAGNETIC RADIATION; REFLECTION 
OF SOUND. 

Absorption. When sound propagates through a 
medium, there are a number of mechanisms by 
which the acoustic energy is converted to heat and 
the sound wave weakened until it is entirely dissi- 
pated. This absorption of acoustic energy is charac- 
terized by a spatial absorption coefficient a for trav- 
eling waves. For a monofrequency traveling plane 
wave, Eq. (4) must be modified to Eq. (45). [Simi- 


p=Pe “cos [22 G — =) (45) 


larly, a monofrequency traveling spherical wave like 
that of Eq. (18) would be multiplied by a factor 
exp(—ar).] The pressure amplitude diminishes ex- 
ponentially with increasing distance and the sound 
pressure level decreases linearly with x, according 
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TABLE 2. Values of a/f? for selected gases and liquids* 








Classical Observed 
thermoviscous value, 
value, a/f? x 10", a/f? X 10", 
Np - s?/m Np - s?/m 
Gases 
Argon 1.85 1.87 
Helium 0.53 0.54 
Nitrogen 1.35 1.64 
Oxygen 1.61 1.92 
Liquids 
Acetone TO 30.0 
Benzene 750 
Ethyl alcohol 50 
Glycerine 3000 3000 
Mercury 6.0 8.0 
Water (fresh) 8.1 25.0 





*Temperature = 20°C = 68°F. Pressure = 1 atm = 101.3 kPa = 14.7 
Ibf/in.2 1 Np - s?/m = 0.3048 Np - s?/ft. 

SOURCE: After L. E. Kinsler et al., Fundamentals of Acoustics, 3d 
ed., Wiley, 1982; and J. J. Markham, R. T. Beyer, and R. B. Lindsay, 
Absorption of sound in fluids, Rev. Mod. Phys., 23:353-411, 1951. 





to Eq. (46). The dimension of a is the inverse meter 


SPL(x) = SPL(O) — 8.7 ax (46) 


(m7). For historical reasons, and to avoid ambiguity 
with other quantities like the wave number k, ab- 
sorption is given in units of nepers per meter (Np/m), 
where the neper is a dimensionless label. Because of 
the use of sound pressure level in acoustics, spatial 
absorption can also be specified in terms of decibels 
per meter (dB/m). From Eq. (46), the absorption co- 
efficient in dB/m is 8.7a. For standing waves, the 
sound field dies away with time according to a fac- 
tor exp(—fd. The temporal absorption coefficient 
B, given in nepers per second (Np/s), is related to a 
by $ = ac. Some values of spatial absorption coeffi- 
cients are given in Table 2. See NEPER. 

Classical mechanisms. One mechanism of absorption 
is shear viscosity, the analog of friction for fluids. 
Molecules traveling between fluid elements of dif- 
fering speeds will transfer momentum between the 
elements through intermolecular collisions, diffus- 
ing the differences in collective motion into random 
(thermal) motion of the individual molecules. This 
loss of acoustic energy into heat is described by 
the absorption coefficient for viscosity av. In flu- 
ids, this coefficient increases with the square of the 
frequency (up to about 10!° Hz). See VISCOSITY. 

Another mechanism arises from the thermal con- 
ductivity of the fluid. In a perfectly insulating fluid 
(zero thermal conductivity), cold regions could not 
conduct thermal energy away from hot regions. In 
real materials, small amounts of thermal energy can 
flow between these hotter and colder regions, tend- 
ing to equalize the temperatures a little and convert- 
ing acoustic energy into heat. This energy conver- 
sion is described by a thermal absorption coefficient 
a, Which also increases with the square of the fre- 
quency. The losses from shear viscosity and thermal 
conductivity add together to give the classical ther- 


moviscous absorption coefficient a, = av + a@;. See 
CONDUCTION (HEAT). 

Structural relaxation. In certain liquids, including 
water, acetone, and alcohols, the observed absorp- 
tion is significantly larger than that predicted by the 
classical absorption coefficient but has the same fre- 
quency dependence. A postulated mechanism is a 
structural relaxation, involving transitions between 
two preferred clusterings of nearest neighbors, one 
favored at lower density and the other at higher. 
When the sound wave travels through the fluid, the 
pressure and temperature fluctuations cause transi- 
tions between these two structural states. The net 
effect is to introduce a bulk viscosity coefficient ad- 
ditive to the shear viscosity. In water, the effect of 
bulk viscosity results in a measured absorption coef- 
ficient about three times the classical value. 

Thermal molecular relaxation. In some gases with poly- 
atomic molecules, like air, thermal molecular relaxa- 
tion is an important mechanism of absorption. When 
two gas molecules collide, in addition to simply re- 
bounding from each other like two billiard balls, one 
or the other of the molecules may be excited into in- 
ternal vibrations, thus removing energy from the col- 
lective acoustic motion. Once excited, the molecule 
may release this energy back into kinetic energy of 
motion during another collision, but this release is 
not coordinated with the acoustic wave and so ap- 
pears at random, contributing to the thermal motion 
of the molecules. It takes a characteristic amount of 
time, called the relaxation time tT, to accomplish the 
conversion of energy into and from these internal 
vibrations. If t is sufficiently small compared to the 
period T of the acoustic wave, then the mechanism is 
fully activated and the resulting absorption increases 
with the square of the frequency. At higher frequen- 
cies such that T < t, the absorption from this effect 
becomes a constant independent of the frequency. 
Thus, while at lower frequencies molecular thermal 
relaxation can be an important source of absorption 
in certain gases, at higher frequencies the effect is 
completely dominated by the classical thermovis- 
cous absorption. In air, collisions involving carbon 
dioxide with either nitrogen or oxygen, and oxygen 
with water vapor, are important in exciting internal 
energy states. The total absorption coefficient is a 
strong function of humidity and frequency through- 
out the audible range of frequencies. The effects 
are strong enough that the reverberation character- 
istics of a poorly ventilated concert hall can change 
during the course of a concert as the moisture ex- 
uded by the audience alters the humidity. Figure 6 
shows curves for the absorption coefficient in deci- 
bels per meter as a function of frequency for three 
different humidities and the classical thermoviscous 
value. 

Chemical relaxations. In certain liquids, there can be 
chemical relaxations involving the concentrations 
of various chemical species and how they change 
with temperature. A sound wave passing through the 
liquid causes local temperature changes, and these 
in turn can cause changes in the ionization of dis- 
solved salts. The basic mechanisms cause losses quite 
similar in effect to the molecular thermal relaxation 
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Fig. 6. Absorption of sound in air at 20°C (68°F) as a 
function of frequency for three humidities and the classical 
thermoviscous value. 1 dB/m = 0.3 dB/ft. (After H. E. Bass, 
H.-J. Bauer, and L. B. Evans, Atmospheric absorption of 
sound: Analytical expressions, J. Acous. Soc. Amer., 
52:821-825, 1972) 


described above. In seawater, chemical relaxations 
contribute appreciable additional absorption at fre- 
quencies below about 100 kHz for magnesium sul- 
fate (MgSO, and below about 1 kHz for boric acid. 
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Fig. 7. Absorption of sound in fresh water and in seawater 
of 35 parts per thousand salinity at atmospheric pressure 
and 5 C (41°F). 1 dB/m = 0.3 dB/ft. (After F. H. Fisher and 
V. P. Simmons, Sound absorption in sea water, J. Acous. 
Soc. Amer., 62:558-664, 1977) 
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Figure 7 gives a representative curve of the fre- 
quency dependence of the absorption coefficient 
in decibels per meter for seawater. See SEAWATER. 
Alan B. Coppens 
Bibliography. L. L. Beranek, Acoustics, 1986; R. E. 
Berg and D. G. Stork, The Physics of Sound, 2d 
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áth ed., 2000; P M. Morse and K. U. Ingard, Theoret- 
ical Acoustics, 1986; M. Rossi, Acoustics and Elec- 
troacoustics, 1988; T. D. Rossing and N. H. Fletcher, 
Principles of Vibration and Sound, 1994. 
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The process by which the intensity of sound is dimin- 
ished by the conversion of the energy of the sound 
wave into heat. The absorption of sound is an im- 
portant case of sound attenuation. This article will 
address absorption mechanisms in varying media but 
will concentrate on absorption in ideal gases (such 
as air), since these effects are the most commonly 
observed. 

Attenuation. Regardless of the material through 
which sound passes, its intensity, measured by the 
average flow of energy in the wave per unit time per 
unit area perpendicular to the direction of propaga- 
tion, decreases with distance from the source. This 
decrease is called attenuation. In the simple case 
of a point source of sound radiating into an ideal 
medium (having no boundaries, turbulent fluctua- 
tions, and the like), the intensity decreases inversely 
as the square of the distance from the source. This re- 
lationship exists because the spherical area through 
which the energy propagates per unit time increases 
as the square of the propagation distance. This atten- 
uation or loss may be called geometrical attenuation. 

In addition to this attenuation due to spreading, 
there is effective attenuation caused by scattering 
within the medium. Sound can be reflected and re- 
fracted when incident on media of different physi- 
cal properties, and can be diffracted and scattered 
as it bends around obstacles. These processes lead 
to effective attenuation, for example, in a turbulent 
atmosphere; this is easily observed in practice and 
can be measured, but is difficult to calculate theoret- 
ically with precision. See DIFFRACTION; REFLECTION 
OF SOUND; REFRACTION OF WAVES. 

In actual material media, geometrical attenuation 
and effective attenuation are supplemented by ab- 
sorption due to the interaction between the sound 
wave and the physical properties of the propagation 
medium itself. This interaction dissipates the sound 
energy by transforming it into heat and hence de- 
creases the intensity of the wave. In all practical 
cases, the attenuation due to such absorption is ex- 
ponential in character. Thus, if J denotes the inten- 
sity of the sound at unit distance from the source, 
then J/, the intensity at distance r in the same units, 
has the form of Eq. (1), where e represents the base 


I= he ad) 


15 


16 


Sound absorption 


of napierian logarithms, 2.71828 ... and a is called 
the intensity absorption coefficient. The a can be 
expressed by Eq. (2). Hence a has the dimensions 


a il > 2 
ee Io 2) 


of reciprocal length or distance. However, it is cus- 
tomary to express the logarithm, which is a pure 
dimensionless number, in nepers, and hence g is in 
nepers per unit length. Since 10 logio Q/Io) is the 
number of decibels between the intensity J and Jo, 
the absorption coefficient œ can also be expressed 
in decibels per unit length or distance. See DECIBEL; 
NEPER; SOUND INTENSITY. 

Classical sources. The four classical mechanisms 
of sound absorption in material media are shear vis- 
cosity, heat conduction, heat radiation, and diffu- 
sion. These attenuation mechanisms are generally 
grouped together and referred to as classical atten- 
uation or thermoviscous attenuation. They will be 
discussed first as they apply to fluids, namely, liquids 
and gases. 

Viscosity. In the flow of a fluid such as air or water, 
viscosity is the tendency for the fluid flowing in a 
layer parallel to the direction of motion to retard the 
motion in adjoining layers. The viscosity of a liquid 
Gndicated qualitatively by the stickiness or sluggish- 
ness of its flow) decreases as the temperature rises, 
whereas that of gases increases with the tempera- 
ture. See VISCOSITY. 

Appropriate modifications of the standard wave 
equation for plane sound waves to take account of 
the effect of viscosity of a fluid lead to Eq. @) for 
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the absorption coefficient as defined in Eq. (1), but 
specifically due to viscosity, where u is the viscos- 
ity of the fluid, f is the frequency of the sound wave 
in hertz, c is the wave’s velocity, and po is the equi- 
librium density of the fluid. The expression for a, 
holds only for frequencies that are not too high. For 
example, in the case of air, it fails at frequencies on 
the order of 10° Hz, which involve ultrasonics. See 
ULTRASONICS. 

Heat conduction. The role of heat conduction can be 
qualitatively understood as follows. The compres- 
sion associated with the high-pressure portion of a 
propagating sound wave produces a local increase in 
temperature, which sets up a temperature gradient 
through which heat will flow by conduction before 
the succeeding rarefaction cools the medium. This is 
an irreversible phenomenon, leading to energy dis- 
sipation into heat. The value of the sound absorp- 
tion coefficient due to heat conduction is given by 
Eq. (4), where y = C,/C,, the ratio of the molar spe- 


27 f7k —1 
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cific heat at constant pressure Cp to that at constant 
volume C,, po is the equilibrium density of the fluid, 
and « is the thermal conductivity. Equation (4) is 


again valid only for frequencies that are not too high. 
See CONDUCTION (HEAT); HEAT CAPACITY. 

Substitution of the appropriate numerical values 
for the physical parameters shows that for gases the 
value of a, is somewhat less than «œ, for the same fre- 
quency. For liquids, except for liquid metals, a, > a, 
for the same frequency. It is usually considered ap- 
propriate to add qa, and a, to get the total absorption 
due to these two processes. 

Heat radiation. Thermal radiation results whenever 
one portion of a fluid acquires a temperature differ- 
ent from that of an adjacent portion. This effect is 
an electromagnetic-wave phenomenon different in 
character from heat transfer by conduction. Calcula- 
tion shows that thermal radiation plays a very small 
role in sound absorption except possibly at very high 
frequencies in very low pressure gases. See HEAT 
RADIATION. 

Diffusion. Diffusion operates mainly in the case of 
fluid mixtures, for example, air that is a mixture 
mainly of nitrogen and oxygen, in which the two 
gases have a slight tendency to diffuse into each 
other. For ordinary gases in which the components 
differ little in molecular weight, the effect of dif- 
fusion on absorption is negligible compared with 
that of viscosity and heat conduction. However, in 
gaseous mixtures in which the components differ 
markedly in molecular weight—for example, a mix- 
ture of the rare gases helium and krypton—the ef- 
fect of diffusion can be much greater than that due 
to viscosity and heat conduction combined. See DIF- 
FUSION. 

Measurement of absorption in fluids. Sound absorp- 
tion in fluids can be measured in a variety of ways, re- 
ferred to as mechanical, optical, electrical, and ther- 
mal methods. All these methods reduce essentially 
to a measurement of sound intensity as a function of 
distance from the source. 

Mechanical. The mechanical method, confined to 
the estimate of absorption in liquids, uses radia- 
tion pressure, proportional to the sound intensity, to 
“weigh” the sound in a vertical beam. See ACOUSTIC 
RADIATION PRESSURE. 

Optical. The optical method, which is also mainly 
used for liquids, is based on the diffraction of a beam 
of light by the sound beam, the latter being treated 
essentially as a diffraction grating. The greater the 
intensity of the sound, the more diffraction bands 
are produced. See DIFFRACTION. 

Electrical. The electrical methods, the most gener- 
ally used for both gases and liquids, employ an acous- 
tic transducer like a piezoelectric crystal to receive 
the sound beam as it traverses the medium whose 
absorption coefficient is to be evaluated, and to mea- 
sure the variation in the sound intensity as the dis- 
tance from the source is varied. An alternative is the 
use of an acoustic interferometer, which functions 
essentially in the same way as an optical interfer- 
ometer. Still another scheme is based on the use of 
the reverberation time technique. In this method the 
medium is confined in a closed spherical vessel with 
highly reflective walls. The reverberation time, that 
is, the time taken by a sound pulse initially produced 


at the center of the vessel by a transducer (which can 
also act as a sound receiver) to decay to one-millionth 
of its original intensity provides a measure of the in- 
tensity loss and leads to an evaluation of the absorp- 
tion coefficient. This method has been successfully 
applied to liquids and gases. See INTERFEROMETRY; 
MICROPHONE; REVERBERATION. 

Thermal. The thermal method undertakes to esti- 
mate the loss in intensity of a sound wave by direct 
measurement of the heat produced in its passage. 
This demands very careful experimental technique, 
and the difficulties associated with it have provoked 
controversy about the usefulness of the method. 

Experimental values for gases. Measurements carried 
out by these methods show that the experimen- 
tal values for the sound absorption coefficient (as 
a function of frequency) of a monatomic gas like he- 
lium agree rather well with the values calculated on 
the basis of viscosity and heat conduction, except 
at very low pressures and very high frequencies. On 
the other hand, for more complicated gases, like air 
(though in general a varies roughly directly as the 
square of the frequency, as predicted by classical 
theory), the actual numerical values are greater (and 
in many cases very much greater) than the theoret- 
ically predicted ones. For example, for air at 20°C 
(68°F) and normal atmospheric pressure, the theo- 
retical value of (@, + a,)/f? is 1.27 x 10~-™ neper 
s?/cm, whereas the experimental value of a/f for 
air of low humidity under the same conditions is ap- 
proximately 8 x 107!? in the same units. 

Experimental values for liquids. The situation with re- 
spect to liquids is about the same. Mercury and the 
liquefied gases like argon, oxygen, and nitrogen have 
absorption coefficients in moderately good agree- 
ment (that is, within 10-15%) with the classically 
calculated values. But ordinary liquids like water and 
aqueous salt solutions show values much in excess 
of those classically calculated. The observed excess 
in the case of organic liquids like carbon disulfide, 
for example, is very large indeed, of the order of 
several hundred times. A satisfactory explanation for 
this excess absorption has been found in molecular 
and related relaxation processes. 

Molecular relaxation in gases. Although sound 
propagation in a gaseous medium is phenomenolog- 
ically a wave process, it actually takes place through 
the collisions of the molecules composing the gas. 
According to the kinetic theory, the molecules of 
a gas are in ceaseless random motion. When the 
gas is locally compacted by the compression por- 
tion of a propagating sound wave, the average trans- 
lational kinetic energy of the molecules is locally 
increased. Energy has been introduced into the 
group of molecules. This energy increase is con- 
veyed through collisions to the adjoining group of 
molecules, which in turn passes it on through the 
gas, and so on, and this continuous transfer repre- 
sents the sound-wave energy. 

A sound wave thus involves the momentary local 
compression of the medium, which by increasing the 
local density, according to kinetic theory, also leads 
to an increase in the average translational kinetic en- 
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ergy and hence to an increase in local pressure. In 
the case of a gas made up of polyatomic molecules 
such as oxygen and nitrogen, the increase in pres- 
sure cannot take place instantaneously, since some 
of the extra translational kinetic energy is transferred 
to the internal rotational and vibrational molecular- 
energy states. If one imagines the diatomic oxygen 
(O2) molecule as a dumbbell-shaped weight, with a 
springy connecter between the weights, some of the 
collision energy acts to spin the molecule and some 
acts to vibrate the weights toward and away from 
each other. The dissipation of these energy states is 
referred to as rotational relaxation and vibrational 
relaxation, respectively. 

After the local density has begun to decrease due 
to the rarefaction portion of the propagating wave, 
the pressure continues to build up as energy is trans- 
ferred back to the translation form from the inter- 
nal rotational and vibration states. The excess den- 
sity and excess pressure thus get out of phase, as in 
the standard hysteresis phenomenon, which leads to 
an effective transformation of acoustic-wave energy 
into heat, corresponding to sound absorption. This 
process is termed a relaxation process, and the time 
taken for the density at any point to return to within 
the fraction 1 — (1/e) [approximately 63%] of its equi- 
librium value is termed the relaxation time, denoted 
by t. See HYSTERESIS; MOLECULAR STRUCTURE AND 
SPECTRA. 

Calculation of the effect. The results of the theoretical 
calculation of the effect of molecular relaxation will 
be discussed for the special case in which there are 
only two energy states of each molecule: (1) that in 
which it possesses translational kinetic energy only, 
and (2) that in which it possesses translational kinetic 
energy plus a single internal energy level. Here the 
relaxation time is given by Eq. (5), where x9) is the av- 


: 6) 

Ki + ky 
erage number of transitions per second from energy 
state 1 to energy state 2, and «3, is the average num- 
ber of transitions per second from state 2 to state 1. 
The superscripts mean that the transition rates refer 
to the equilibrium condition of the gas as unaffected 
by the sound wave. Analysis then gives the sound 
absorption coefficient œ due to this type of relaxa- 
tion process in the form of Eq. (6), where / is the 
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frequency and K is a constant containing the molar 
gas constant R, the sound velocity, and the specific 
heats of the gas. If 2xfr < 1, œ varies as the square 
of the frequency, as in the case of viscous and heat 
conduction absorption. However, if 27ft >> 1, the 
attenuation becomes a constant with respect to fre- 
quency and the effect is said to be frozen out. In the 
more general case, if a/2m/f is plotted as a function 
of frequency, it has a maximum for f = 1/27rT, the 
plot taking the form of an approximate bell-shaped 
curve. Such a plot, carried out experimentally, 
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provides a method for evaluating the relaxation 
time T. 

The theoretical evaluation of t from Eq. (5) in- 
volves essentially a calculation in quantum statis- 
tics. The agreement of theory with experiment for 
many gases showing excess absorption over that at- 
tributable to viscosity and heat conduction is satisfac- 
tory. The relaxation process described here is usually 
termed thermal relaxation, since the specific heats 
of the gas enter prominently into the process. 

Examples. Examples of gases for which experimen- 
tal values are in reasonable agreement with the- 
ory include hydrogen, for which the excess ab- 
sorption over classical values arises primarily from 
rotational relaxation, with a relaxation time on the 
order of 1078 s, and oxygen, for which the excess 
low-frequency absorption depends primarily on vi- 
brational relaxation, with a relaxation time on the 
order of 3 x 107? s at room temperature. In more 
complicated polyatomic gases, it is rare for a single 
relaxation process to be solely effective, and rather 
elaborate analysis is required to take account of mul- 
tiple relaxations. 

Accompanying relaxation absorption in gases is 
sound dispersion, which is the variation of sound 
velocity with frequency. The dispersion leads to the 
distortion of waveforms and features prominently in 
areas such as sonic boom propagation. See NONLIN- 
EAR ACOUSTICS; SONIC BOOM. 

Relaxation processes in liquids. The excess absorp- 
tion of sound in liquids above the classically pre- 
dicted values has also been attributed to relaxation 
processes. The assumption of thermal relaxation has 
been successful for certain nonpolar liquids such as 
benzene. It has not worked, however, for polar liq- 
uids such as water. For the latter, so-called structural 
relaxation has been successfully invoked. This as- 
sumes that liquid water can exist in two different 
states of mutual molecular orientation and that the 
passage of sound affects the average rate of energy 
transfer back and forth between the two states. The 
state of lower energy is assumed to be similar to the 
structure of ice, in which each water molecule is 
surrounded on the average (in spite of the random 
motion of the molecules) by its four nearest neigh- 
bors arranged at the corners of a regular tetrahedron 
with the given molecule in the center. The higher- 
energy state corresponds to a more closely packed 
configuration of the molecules, with an arrangement 
like that of a face-centered cubic crystal. In water in 
equilibrium, most of the molecules are in the lower- 
energy state, but the passage of sound causes a trans- 
fer of some molecules into the higher state and up- 
sets the equilibrium. There is the usual lag in the 
return of the molecules to the lower state, leading to 
a relaxation time, and this produces absorption simi- 
lar to that in gases. This theory has been successful in 
accounting for the excess sound absorption in water 
as a function of temperature in the range from 10 
to 80°C (50 to 176°F). In this range, the appropriate 
relaxation time has been found to be of the order 
of 107"? s. 
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Fig. 1. Absorption of sound in air at 20°C (68°F) and various 
values of relative humidity. (From L. E. Kinsler et al., 


Fundamentals of Acoustics, 3d ed., John Wiley, 1982; used 
by permission of John Wiley & Sons, Inc.) 


Excess absorption in aqueous solutions has been 
treated successfully by chemical relaxation. A chem- 
ical reaction in the liquid such as a dissociation- 
association process can provide two energy states, 
and the transfer of energy between these states will 
be affected by the compressive stress involved in the 
passage of a sound wave. The dissociation reaction of 
magnesium sulfate has been used to explain excess 
sound absorption in seawater. 

Inthe atmosphere. The amount of sound that air ab- 
sorbs increases with audio frequency and decreases 
with air density, but also depends on temperature 
and humidity. Sound absorption in air depends heav- 
ily on relative humidity, as seen in Fig. 1, which dis- 
plays the predicted classical thermoviscous absorp- 
tion as well as the actual absorption in air for varying 
relative humidity. 

The reason for the strong dependence on relative 
humidity is molecular relaxation. One can note the 
presence of two transition regimes in most of the 
actual absorption curves, representing the relaxation 
effects of Nz and Oz, the dominant constituents of the 
atmosphere. See ATMOSPHERIC ACOUSTICS. 

Inseawater. Sound absorption in water is generally 
much less than in air. It also rises with frequency, and 
it strongly depends on the amount of dissolved mate- 
rials Gin particular, salts in seawater), due to chemical 
relaxation (Table 1). See UNDERWATER SOUND. 

Insolids. The theory of sound attenuation in solids 
is complicated because of the presence of many 
mechanisms responsible for it. These include heat 
conductivity, scattering due to anisotropic mate- 
rial properties, scattering due to grain boundaries, 
magnetic domain losses in ferromagnetic materials, 





TABLE 1. Sound absorption versus frequency for fresh 
water and seawater* 





Absorption coefficient, dB/m* 





10 kHz 100 kHz 1 MHz 
Fresh water 0.0001 0.006 0.2 
Seawater 0.0008 0.04 0.3 





*Reprinted with permission from L. Beranek, Acoustical Measure- 
ments, rev. ed., American Physical Society, 1988; © Acoustical 
Society of America. 

t1 dB/m = 0.3 dB/ft. 











interstitial diffusion of atoms, and dislocation relaxa- 
tion processes in metals. In addition, in metals at 
very low temperature the interaction between the 
lattice vibrations (phonons) due to sound propaga- 
tion and the valence of electrons plays an important 
role, particularly in the superconducting domain. See 
CRYSTAL DEFECTS; DIFFUSION; FERROMAGNETISM; SU- 
PERCONDUCTIVITY. 

At boundaries. When sound waves passing 
through air or water encounter another material or 
strike a surface, part of the incident wave energy is 
reflected, part is absorbed within the material, and 
still another part is transmitted. The general prop- 
erty of interest in acoustical design is the absorption 
coefficient, a, of the surface: the ratio of the energy 
that is absorbed and transmitted, to the energy 
incident on that boundary. For a surface in a room, 
this coefficient measures the fraction of the energy 
that is not returned to the room, independent of 
the mechanism of extraction. For example, an open 
window has a = 1, since none of the sound comes 
back into the room, even though none of it may be 
immediately absorbed by the air. On the other hand, 
the intensity absorption coefficient a, discussed 
previously, refers specifically to energy that has 
been transformed from acoustic energy to thermal 
energy within the medium during propagation. 

There is no simple way of relating a, the sound 
absorption by a boundary, to a, the absorption of 
sound on passing through a continuous solid, liquid, 
or gas. The values of a used in acoustical design are 
typically empirically determined, being derived from 
laboratory or field tests of specimens of the particular 
material of interest. 

Practical absorbing materials for airborne and 
water-borne sound are based on the mechanisms of 
viscosity and heat conduction. Most useful sound ab- 
sorbers bring these mechanisms into play by being 
porous, perforated, or slotted yet having a significant 
resistance to the flow of air or water through their 
openings. The sound-wave movement through such 
openings dissipates energy by viscosity and thermal 
conduction with the walls. In the case of airborne 
sound, examples are mineral wool, fiberglass batts, 
acoustical ceiling tiles, and domestic drapes, all of 
which are very effective above 1 kHz. See ARCHITEC- 
TURAL ACOUSTICS; MUFFLER. 

Material ratings. Full-sized material samples are typi- 
cally tested in a laboratory reverberation room. The 


Sound absorption 


random-incidence sound absorption (absorption of 
sound waves coming from all directions) of acous- 
tical materials is measured at frequencies between 
125 and 4000 Hz, which corresponds to a signifi- 
cant fraction of the normal audio range encountered 
in speech and music. The average of the absorption 
at 250, 500, 1000, and 2000 Hz is called the noise 
reduction coefficient (NRC). Small material samples 
are tested in a device known as an impedance tube, 
which relates the incident sound wave to the re- 
flected sound wave from the test specimen. In this 
case, the normal-incidence sound absorption (ab- 
sorption of sound impinging along a line perpendic- 
ular to the absorber) is measured. 

Applications to room design. Room walls, floors, and 
ceiling surfaces reflect and absorb sound differently 
in accordance with the acoustic properties of their 
construction material, that is, depending on whether 
they are concrete, glass, brick, or carpet, for exam- 
ple. This is depicted in Fig. 2, which shows how 
sound can arrive at a receiver from many paths with 
varying degrees of absorption along the route. 

Sound generation in the average nonindustrial 
room occurs via speech, music, equipment or ap- 
pliance noise, and occupant movements. The subse- 
quent sound-level buildup or reverberation may be 
undesirably high or low depending on the intended 
use of the room. If the reverberation is too great, the 
room is typically referred to as a live room; if it is too 
low, the room is referred to as a flat room. Reverbera- 
tion level can be modified by treatment of the room’s 
interior surfaces by adding materials with greater or 
lesser sound absorptive qualities. The acoustic de- 
sign of a room depends on its intended use, whether 
as a conference room, music room, lecture room, or 
open-plan office. For example, if the intended use 
is as a church, a greater reverberation may be de- 
sired for the typical music played. If the intended use 
is as an office space, less reverberation may be de- 
sired for speech communication. Conventional rugs, 
hanging drapes, stuffed furniture, and carpeting are 
typical sound absorption materials used in many 
applications. However, additional treatment is 
often necessary. See ACOUSTIC NOISE; SOUND FIELD 
ENHANCEMENT. 

Once the appropriate materials are selected, 
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Fig. 2. Depiction of direct and reflected sound in a room 
with varying wall properties. 
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TABLE 2. Absorption coefficients of typical construction 
materials* 





Frequency, Hz 





Material 125 250 500 1000 2000 4000 





Acoustic paneling 0.15 0.3 0.75 0.85 0.75 0.4 

Brick 0.05 0.04 0.02 0.04 0.05 0.05 
Brick, painted 0.01 0.01 0.02 0.08 0.02 0.03 
Concrete block 0.36 0.44 0.31 0.29 0.39 0.25 


Curtains (Oy Wh Onl her Oey/  (o}eta) 
Floor, concrete 0.02 0.02 0.02 0.04 0.05 0.05 
Floor, wood Oak 02 01 0.1 0.1 Ont 


Floor, carpeted 0.1 015 025 03 03 0s 
Glass fiber 

insulation, 2in. 0.17 055 08 09 085 0.8 
Glass, window 0.35 0.25 0.18 0.12 0.07 0.04 
Gypsum board 

on studs 0.29 0.10 0.05 0.04 0.07 0.09 
Plaster 0.03 0.03 0.02 0.03 0.04 0.05 
Plywood paneling 0.28 0.22 0.17 0.09 0.10 0.11 
Water surface 0.008 0.008 0.013 0.015 0.020 0.025 
One person 0.18 04 046 046 0.51 0.46 





*From C.E. Wilson, Noise Control, Krieger, 1994. 











usually with the help of a professional acoustical 
consultant, the area of treatment, A, is calculated by 
means of the Sabine formula [Eqs. (7)] to determine 


T= 0.049 (English) (7a) 
aA 
V 

T = 0.161 — (metric) (7b) 
aA 


the reverberation time T required for the intended 
use. In the Sabine formula, V is the room volume in 
cubic feet or cubic meters, a is the absorption co- 
efficient of the acoustical material, and A is the area 
in square feet or square meters of the room. If the 
existing reverberation time for a room is unsatisfac- 
tory, acoustical materials with varying values of a 
can be used to replace portions of the room surface 
A to modify the reverberation time. The value of a for 
common acoustical materials depends on frequency; 
typical values are indicated in Table 2. 

Industrial applications. In industrial settings it is the 
overall sound level rather than the quality and intel- 
ligibility of the sound which is typically the driving 
concern. Industrial noise is often at levels sufficient 
to cause damage to hearing if it is not reduced or 
if hearing protection (such as headphones, which 
absorb sound before it enters the ear) is not used. 
Noise reduction is often achieved by adding sound- 
absorbing materials and enclosures around the noise 
source. Qualitatively, the treatment of a noisy indus- 
trial setting is quite similar to the treatment of a noisy 
office space, but special care must be paid to material 
selection due to the harsher operating environment 
of a typical factory. 

Henry E. Bass; Angelo J. Campanella; 
James P. Chambers; R. Bruce Lindsay 
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A system for enhancing the acoustical properties of 
both indoor and outdoor spaces, particularly for un- 
amplified speech, song, and music. Systems are sub- 
divided into those that primarily change the natural 
reverberation in the room, increasing its level and 
decay time (reverberation enhancement systems); 
and those that essentially replace the natural rever- 
beration (sound field synthesis systems). Both sys- 
tems may use amplifiers, electroacoustic elements, 
and signal processing to add sound field compo- 
nents to change the natural acoustics. Sound field 
enhancement is used to produce variable acoustics, 
to produce a particular acoustics which is not attain- 
able by passive means, or because a venue has one 
or all of the following deficiencies: (1) unsuitable 
ratio between direct, early reflected, and reverberant 
sound; (2) unsatisfactory early reflection pattern; and 
Q) short reverberation times. See AMPLIFIER. 

The purpose of sound field enhancement sys- 
tems is not to provide higher speech intelligibil- 
ity or clarity—in contrast to sound reinforcement 
systems—but to adjust venue characteristics to best 
suit the program material and venue, and in such 
a way optimize the subjectively perceived sound 
quality and enjoyment. See SOUND-REINFORCEMENT 
SYSTEM. 

The use of sound field enhancement ranges from 
systems for homes (surround sound systems and 
home theater sound systems) to systems for rooms 
seating thousands of listeners (concert halls, per- 
forming arts centers, opera houses, and churches). 
Outdoor spaces may have elaborate designs to ap- 
proach the sound quality of indoor spaces (advanced 
concert pavilions, outdoor concert venues). Systems 
may be used to improve conditions for both perform- 
ers and listeners. See ARCHITECTURAL ACOUSTICS; RE- 
VERBERATION. 

Basic elements. A sound field enhancement sys- 
tem may consist of one or more microphones, 
systems for amplification and electronic signal pro- 
cessing, and one or more loudspeakers. The signal 
processing equipment may include level controls, 
equalizers, delays and reverberators, systems for pre- 
vention of howl and ringing, as well as frequency di- 
viding networks and crossover networks. See EQUAL- 
IZER; LOUDSPEAKER; MICROPHONE. 

Prerequisites. Many indoor venues have acoustical 
characteristics unsuitable for communication such 
as speech, song, or music. Outdoor venues seldom 
have satisfactory acoustical conditions for unampli- 
fied acoustic communication. Satisfactory acoustic 
conditions include for speech, good speech intel- 
ligibility and sound quality; and for music, appro- 
priate loudness, clarity, reverberation envelopment, 
spaciousness, and other sound quality characteristics 
for the music to be performed and enjoyed. Rooms 
with insufficient reverberation are often perceived 


Sound field enhancement 








sound field enhancement 























sound field 
synthesis 
direct envelopmental reverberant 
field field field 





























non- 


heren 
cacen coherent 

































































reverberation 
enhancement 
wide narrow 
band band 
non- 
coherent 
coherent 

















Fig. 1. Classification scheme for sound field enhancement systems. 


to be acoustically “dry” or “dead.” Insufficient rever- 
beration usually is due to lack of room volume for the 
existing sound absorption areas in the room. The re- 
verberation time is essentially proportional to room 
volume and inversely proportional to sound absorp- 
tion, as indicated by Sabine’s formula. In these cases, 
sound field enhancement systems may be used to im- 
prove acoustical conditions, by increasing the sound 
level and reverberation time. Absence of noise is also 
necessary for good acoustic conditions. See ACOUS- 
TIC NOISE; SOUND ABSORPTION. 

System design. A simple reverberation enhance- 
ment system uses a microphone to pick up the sound 
signal to be reverberated, amplified, and reradiated in 
the auditorium. In most cases, reverberation is added 
to the original signal by electronic digital signal pro- 
cessing. An advanced sound field enhancement sys- 
tem uses several, possibly directional microphones 
to pick up sound selectively from the stage area. Sig- 
nal processing and many loudspeakers are used to 
obtain the desired sound field for the audience and 
performers. Most performing art centers need vari- 
able acoustic conditions that can be changed instan- 
taneously using electronic controls. 

Categories. Sound field enhancement systems can 
be classified according to different criteria. One pos- 
sible scheme is shown in Fig. 1. Sound field synthesis 
systems are designed to provide the necessary sound 
field components, such as early reflections from side 
walls and ceiling (the latter are usually called envel- 
opmental sound), and reverberation by using digital 
filters, which can be designed to imitate a specific 
desired room response. Reverberation enhancement 
systems are designed to primarily increase the re- 
verberation time and sound level of the reverberant 
field, while having negligible influence on the direct 
sound and the early reflected sound. Reverberation 
enhancement systems can be further subdivided into 
regenerative systems, based on the acceptance of un- 
avoidable positive feedback between loudspeakers 
and microphones to increase the reverberation time; 
and nonregenerative systems, based on the use of 
electronic reverberators. Regenerative systems can 


be designed so as to work over a wide or narrow fre- 
quency range; nonregenerative systems are usually 
wide-frequency-range systems. See SOUND; REFLEC- 
TION OF SOUND. 

Routing. A sound field synthesis system can gen- 
erally be modeled as shown in Fig. 2. The system 
uses interconnected microphones and loudspeakers 
to form a more or less complete matrix of channels, 
where signal processing varies between channels. 
The signals from the microphones to the loudspeak- 
ers can be sent either directly between a specific mi- 
crophone and loudspeaker pair, or in some form of 
matrix system feeding the signal from a microphone 
to several loudspeaker channels. A matrixed system 
can be designed as a phased array loudspeaker sys- 
tem. Another possibility is for a matrixed system to 
act as multiple individual systems, with uncorrelated 
or correlated outputs. Digital signal processing may 
be used in the circuits for various reasons. 
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Fig. 2. A sound field enhancement system can generally be 
modeled as a matrix. The system uses microphones (M) 
and loudspeakers (L). 
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Direct sound and room reverberation. The concept 
of room impulse response is useful in understand- 
ing sound field enhancement. The room impulse re- 
sponse describes the sound pressure in a room as a 
result of a very short sound pulse. Hearing uses bin- 
aural analysis to separate various types of information 
such as direction of incidence, direct sound from re- 
verberation, and so on. The binaural room impulse 
responses for various source and receiver positions 
characterize the acoustics of the venue. See ACOUS- 
TICS; BINAURAL SOUND SYSTEM; SOUND PRESSURE. 

It is common to subdivide the (binaural) room im- 
pulse response in the range above 250 Hz into three 
temporal sections. For music in large venues, these 
sections are often direct (0-20 milliseconds), envel- 
opmental (20-150 ms), and reverberant sound (be- 
yond 150 ms), all times referred to after the arrival of 
the direct sound from the source to a listener’s loca- 
tion. The time ranges are approximate, somewhat in- 
dividual, and depend on the frequency range consid- 
ered. The desirable decibel levels of the three sound 
field components (direct, envelopmental, and rever- 
berant sound) depend on the use of the venue—for 
example, whether it is to be used for an organ recital, 
a symphony, an opera, or a speech. Conditions for op- 
timizing these levels can be attained instantaneously 
by electronic means using a sound field enhance- 
ment system, which can be used to change the re- 
flection patterns as well as the relative decibel level 
between the different sound field components. The 
spatial characteristics of the acoustics, such as spa- 
ciousness and envelopment, are mainly determined 
by the directional properties of the envelopmental 
sound and the reverberant sound. 

Direct sound. Direct sound is the combination of 
both the true direct sound (which has traveled di- 
rectly from source to listener) and the various early 
reflections which follow within the first 20 ms. Di- 
rect sound usually determines the perceived direc- 
tion of the sound source. In active systems, un- 
wanted shift of the apparent source direction may 
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Fig. 3. Basic signal paths of a sound field enhancement system showing an acoustic 
feedback channel from a loudspeaker back to a microphone. 






occur due to excessively strong additional sound 
components from unwanted directions. Halls for 
classical music may need the direct sound to be weak- 
ened relative to the envelopmental and reverberant 
sound, if the clarity is excessive. Amplification of di- 
rect sound is needed when (1) the sound source is 
weak, (2) sound is attenuated while passing over an 
audience area, (3) the venue is very large, and (4) the 
performance space is outdoors. 

Envelopmental sound. The envelopmental sound has 
been reflected against walls and ceilings relatively 
few times, particularly in typical halls seating 1000- 
3000 people. The envelopmental sound usually is a 
major part of the energy in the sound reaching the 
listener, and is therefore critical to the balance of the 
three sound field components. In many wide halls, 
lacking sufficiently reflecting side walls and ceiling, 
or in halls with absorptive balconies on the side 
walls, the envelopmental sound may be too weak, 
and it will then be necessary to add envelopmental 
sound. One of the major problems in enhancing en- 
velopmental sound is obtaining equal coverage over 
the audience area. 

Reverberant sound. The impression of reverberation 
on classical music is determined by the decay char- 
acteristics of the envelopmental sound and rever- 
berant sound, as well as by the degree of correla- 
tion between these sounds at the listener’s ears. The 
degree of interaural cross-correlation influences the 
perceived envelopment and spaciousness. Both the 
decay rate and relative level of the reverberant sound 
are Critical to the overall impression of reverberation. 
However, on “running” music it is usually only the en- 
velopmental sound and the early part of the reverber- 
ation of a room which is heard, because new music 
signals continue to arrive to be analyzed by hearing. 
Therefore, the perceived reverberation characteris- 
tics for running music and those for a final chord 
may be different, the latter reverberation being audi- 
ble longer. 

Feedback. In any real environment there will be 
unavoidable positive acoustic feedback from the 
loudspeaker(s) back to the microphone(s) (Fig. 3.) 
The feedback gives rise to regeneration, which may 
be controlled in various ways. Uncontrolled regener- 
ation, that is, system oscillation (usually called how)), 
is obtained at and above a critical channel gain set- 
ting. Other effects of regeneration are ringing (that 
is, temporal artifacts such as flutter echo, primarily 
in wide-bandwidth systems) and coloration (that is, 
spectral artifacts such as frequency response peaks). 
From a control viewpoint, the feedback is character- 
ized by its loop gain, which depends on the micro- 
phones, loudspeakers, amplification, signal process- 
ing, and the room itself. See GAIN; OSCILLATION. 

Stability and coloration. Because of the acuity of hear- 
ing, coloration is noticed at much lower loop gains 
than oscillation because sound at some frequencies 
is exaggerated. An example of frequency response 
curves at some different gain settings of a feedback 
system involving a room is shown in Fig. 4. 

The regeneration also prolongs the decay time 
of the signal because the sound energy from the 


Sound field enhancement 





relative level, dB 





-80 











| l 
1300 1310 1320 1330 1340 


1350 1360 1370 1380 1390 1400 


frequency, Hz 


Fig. 4. Example of frequency response curves for a room with a feedback system. The upper curve shows response for loop 
gain setting close to oscillation; the lower curves shows response for gain settings of —5, —10, —15, —20 dB relative to this 


level. 


loudspeakers compensates the sound absorbed by 
audience, chairs, and so on, leading to an increased 
reverberation time. In simple systems, regeneration 
effects are subjectively annoying since only sound at 
a few frequencies is subject to increase of reverber- 
ation time and level. In a system consisting of many 
channels, the gain can be adjusted so that each chan- 
nel has low gain and coloration, and therefore the 
combined effect of such a system will yield both 
prolonged reverberation, over the entire operating 
frequency range of the channels, and negligible col- 
oration. 

Regenerative systems use this feedback as a means 
of achieving the desired sound field, whereas nonre- 
generative systems try to achieve the desired con- 
ditions with as little feedback as possible in order 
to have maximum control. Sound field synthesis sys- 
tems are usually nonregenerative. To change acoustic 
conditions outdoors, only sound field synthesis sys- 
tems are used. 

Nonregenerative systems. Sound field synthesis can 
often be used to good advantage, provided that the 
environment is acoustically dry, and regeneration has 
been minimized by use of directional microphones 
and loudspeakers. It can also be used when the signal 
is obtained from a source other than microphones, 
for example in a surround sound system or a home 
theater system, in which case there is no possibility 
of regeneration. In these systems, both envelopmen- 
tal and reverberant sound is simulated using signal 
processing and is radiated via side and rear loud- 
speakers. 

Direct sound enhancement. Systems for amplification of 
direct sound may be configured as single-channel 


or multichannel systems. The sound level may be 
increased relative to that of the unamplified direct 
sound by using the masking effect of the direct 
sound. Any substantial positive acoustic feedback 
will give rise to coloration. Therefore, these systems 
require directive or close-mounted microphones, as 
well as highly directive loudspeakers radiating pri- 
marily at sound absorptive surfaces in the room, such 
as audience areas, to minimize feedback. Loudspeak- 
ers should be near the stage so that the audible direc- 
tional impression will agree with the visual impres- 
sion. 

Envelopmental sound enhancement. Improved envelop- 
mental sound can also be implemented along the 
same lines as direct sound amplification. The delayed 
microphone (or other) signals should arrive in a suit- 
able time pattern at all places in the audience area. 
Digital filters are used to provide single or multiple 
delays. Because of the limited number of channels 
and the need for wide sound coverage and substan- 
tial envelopmental sound levels, systems have to be 
protected from positive feedback, which could oth- 
erwise give rise to coloration. See DIGITAL FILTER. 

Envelopmental sound enhancement can be 
achieved by both coherent and noncoherent de- 
signs. In coherent designs, arrays of close-mounted 
loudspeakers are used. 

An example of a coherent system is the wave field 
synthesis approach, the most advanced of the non- 
regenerative systems suggested so far. One can con- 
sider it as a holographic system because it uses an 
array of microphones covering the source, and an 
extensive, dense array of loudspeakers surrounding 
the listening area. It is possible to approximately 
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simulate the reflected waves of a space much larger 
than the one in which the system is installed. For the 
system to work as intended, the transducers must be 
very close and individually steered from the matrix. 
See TRANSDUCER. 

It is also necessary to have some form of adap- 
tive control over the microphones so that the mi- 
crophone array can effectively track the sources (to 
avoid noise problems). In early implementation at- 
tempts, it was necessary to severely compromise 
these principles. A noncoherent approach is used 
by a beacon system, where integrated, small (wall- 
mounted) microphone-loudspeaker channels pick 
up sound and retransmit it with suitable delay and 
reverberation characteristics. Adaptive systems can 
be used to control and eliminate the internal feed- 
back in beacon systems. If a number of beacons are 
used in a moderately sound absorbing or sound dif- 
fusing surface, one can create an illusion of a more 
remotely positioned sound reflecting surface or ob- 
ject. See ACTIVE SOUND CONTROL. 

Reverberant sound enhancement. Reverberation is ob- 
tained along the same lines as the envelopmental 
sound enhancement systems except that many more 
and quite evenly spaced loudspeakers are necessary 
to achieve good coverage over the audience area. 
The signal is sent through long finite or infinite 
impulse response digital filters to be reverberated. 
Many techniques are available to design infinite im- 
pulse response filters with little coloration. In prac- 
tice, however, finite impulse response filters offer su- 
perior sound quality, and can be based on calculated 
or measured room impulse responses. Usually sev- 
eral filters, having insignificant correlation, will be 
used. 

In acoustically dry rooms, the reverberation re- 
sponses of the filters dominate the reverberation 
characteristics heard by the audience. In practice, 
the unavoidable scattered sound by various sound 
reflecting surfaces (for example, chair back rails) will 
fill in and smooth the reverberant sound field from 
the loudspeaker system. 

Regenerative systems. The primary reason for 
using regenerative systems is that some regenera- 
tion (positive feedback) will always be present in 
any sound field enhancement system. Thus, it might 
be better to try to turn the coloration due to regener- 
ation into something useful instead of trying to elim- 
inate the coloration by other, perhaps more audible 
methods. Regenerative systems are used only for en- 
hancement of reverberant sound. Microphones are 
thus placed far from both sound sources and system 
loudspeakers. 

Noncoherent systems. In noncoherent regenerative 
system, microphones and loudspeakers are not posi- 
tioned in regular arrays, as is often the case in sound 
field synthesis systems, but are spread out over the 
room (hung from ceilings or wall-mounted). 

In the simplest systems, each microphone-signal 
processor-loudspeaker channel in the system is al- 
lowed to act independently. The microphones and 
loudspeakers are interconnected to form a matrix of 


channels where only the diagonal contains nonzero 
coefficients. Each channel includes a level control, 
equalizer, and power amplifier (and possibly a system 
for minimizing regeneration). This is the approach 
of the classical reverberation enhancement systems, 
both narrow and wide frequency band. More com- 
plex systems may use microphones and loudspeak- 
ers to form a more or less complete matrix of chan- 
nels of pair combinations, having nonzero transfer 
functions in all channels. See POWER AMPLIFIER. 

Coherent systems. Coherent regenerative systems use 
the same basic approach as more complex noncoher- 
ent systems, but channels are interconnected using 
various signal processing to form many coherent 
channels, reminiscent of some sound field synthesis 
systems. 

Control of regeneration. There are multiple ways of 
controlling the loop gain in order to reduce the prob- 
lems of instability and coloration. The loop gains of 
the electroacoustic channels vary considerably with 
frequency and location. The expected value of the 
peak to mean ratio of the sound pressure level re- 
sponse of a room, as a function of frequency, is not 
very dependent on the room reverberation time for 
wide-frequency-range sounds. One can show that 
the instability condition in a system channel will be 
fulfilled approximately 10 dB before wideband in- 
stability. On the average, the frequency intervals be- 
tween peaks are inversely proportional to reverber- 
ation time. 

Directional transducers. The loop gain can be reduced, 
for example, by using highly directive microphone 
and loudspeaker systems. Because of sound reflec- 
tions from walls and so forth, it is usually not possible 
to achieve a large reduction in the loop gain in this 
way. Even if loudspeaker systems for envelopmental 
sound are designed to radiate primarily at the (sound- 
absorbing) audience areas, their side lobes and the re- 
flection of sound from room surfaces set a limit to the 
reduction of the loop gain. Microphone systems suf- 
fer the same limitations whether highly directive mi- 
crophones or adaptive steered arrays are used. In any 
case, the directive properties of the sound sources 
cause problems. Most music sound sources are quite 
directional, and even speech is so directive that it is 
probably necessary to use personal microphones in 
order to avoid detrimental effects due to directivity 
when performers are moving. 

Adaptive filters. It is impossible to predict the exact 
location in frequency of the loop gain peaks be- 
cause of changes in room temperature, movement by 
persons, and so on. Commercial adaptive electronic 
notch (band rejection) filters can be used which au- 
tomatically detect and remove unwanted loop gain 
peaks. This is necessary because after a peak has 
been controlled, once the channel gain is increased 
a new peak will quickly appear. 

Modulation by linear periodic time-variant filters. Under typi- 
cal conditions, the mean distance between adjacent 
sound pressure level maxima (in a narrow frequency 
band) is approximately a wavelength. Consequently, 
higher loop gain or less coloration can be obtained 


by moving the sound source or the loudspeakers, 
or by adding a small shift to the signal frequency. 
Moving loudspeakers and sources is impractical. In- 
stead, time variance is achieved by use of digital sig- 
nal processing to modulate the loop gain. Several 
modulation methods are possible such as amplitude, 
phase, delay, and frequency modulation. All time vari- 
ance introduces sidebands, that is, extra frequency 
content. See AMPLITUDE MODULATION; FREQUENCY 
MODULATION; MODULATION; PHASE MODULATION. 
The modulation methods vary as to audibility. Gen- 
erally, slow modulation (low modulation frequency) 
is heard as beats, whereas fast modulation (high 
modulation frequency) is heard as roughness. The 
sensitivity of hearing to modulation is highest for 
modulation frequencies around 2-4 Hz, which is un- 
fortunate since this frequency range is often desir- 
able from a technical viewpoint to average the room 
frequency response. To minimize unwanted audible 
effects, the modulation frequency should be kept as 
low as possible, preferably below 1 Hz. Low-pass, fil- 
tered random signals can be used for modulation. 
Most commercial reverberation enhancement sys- 
tems use some form of modulation. A challenge for 
future room acoustics research is to design new mod- 
ulation methods, inaudible yet effective at prevent- 
ing regeneration. See SOUND-REPRODUCING SYSTEMS. 
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Sound intensity 


A fundamental acoustic quantity which describes the 
rate of flow of acoustic energy through a unit of 
area perpendicular to the flow direction. The unit 
of sound intensity is watt per square meter. The in- 
tensity is calculated at a field point (x) as a prod- 
uct of acoustic pressure p and particle velocity u. 
Generally, both p and u are functions of time, and 
therefore an instantaneous intensity vector is defined 
by Eq. (1). The time-variable instantaneous intensity, 


L, D = px, Dux, D) a) 


I(x, t), which has the same direction as Ù(x, t), is 
a nonmoving static vector representing the instanta- 


neous power flow through a point (x). See POWER; 
SOUND PRESSURE. 

Frequency spectrum. Many acoustic sources are sta- 
ble at least over some time interval so that both 
the sound pressure and the particle velocity in the 
field of such a source can be represented in terms 
of their frequency spectra. The static sound inten- 
sity vector T(x, w) represents the time-independent 
sound power, which propagates through the point 
(x) in the direction of the particle velocity. The vector 
I, w) can be calculated at a frequency w (or fre- 
quency band Aw) from Eq. (2), where ¢ is the phase 


T(x, w) = pæ, w) - Ux, w) - cos (p) D 


shift between the real quantities p and u. It is prac- 
tical to define both p and u as complex quanti- 


ties so that the intensity vector J can be calculated 
from Eq. (3), where * denotes a complex conjugate. 


T(x, w) = '/rRe[px, w) - Ù“ œ, w)] 6) 


Because intensity represents power, Tis always a real 
quantity. See COMPLEX NUMBERS AND COMPLEX VARI- 
ABLES; PHASE (PERIODIC PHENOMENA). 

Measurement. The applications of sound inten- 
sity were fully developed after a reliable technique 
for intensity measurement was perfected. Sound in- 
tensity measurement requires measuring both the 
sound pressure and the particle velocity. Very pre- 
cise microphones for sound-pressure measurements 
are available. Because there are no microphones for 
a direct particle velocity measurement, an indirect 
technique based on Euler’s equation (4), which links 
1 3px, w) 
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U(x, w) = 
u with a pressure gradient, is used. Here, p is the 
density of the fluid medium, c is the sound speed, k 
is the wave number (k = w/c), andj = / —T1. See 
MICROPHONE. 

Measuring the sound pressure at close points with 
two precision microphones approximates the pres- 
sure gradient. To keep the measurement errors at an 
acceptable level, both microphones must have iden- 
tical sensitivity and be phase-matched to 0.05°. The 
calculation of sound intensity is usually obtained by 
feeding the microphone output into a dual-channel 
fast-Fourier-transform (FFT) analyzer. The intensity 
is proportional to the imaginary part of the cross- 
spectrum of the input voltages. This measurement 
arrangement measures the intensity component in 
the direction of an axis connecting the two mi- 
crophones. To determine the total intensity vector, 
three perpendicular components must be obtained 
by measuring along three mutually perpendicular 
axes. Microphone probes, consisting of six micro- 
phones, to implement such a measurement have 
been constructed. Both international and national 
standards define the requirements on the instrumen- 
tation to secure the needed measurement precision. 
See FOURIER SERIES AND TRANSFORMS. 
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Sound pressure 


The usefulness of the measurement technique 
based on two microphones goes beyond the mea- 
surement of sound intensity. The sound pressure can 
be obtained by interpolating the pressure measured 
by the two microphones. Similarly, both the poten- 
tial and kinetic energy densities as well as the particle 
velocity vector can be obtained. 

Applications. One essential application of the in- 
tensity technique is the measurement of sound 
power radiated from sources. Such a measurement 
can be done with a precision unobtainable from pres- 
sure measurements. The radiated sound power is 
obtained from intensity measurements over an area 
circumscribing the source. The knowledge of the ra- 
diated power makes it possible to classify, label, and 
compare the noise emissions from various pieces of 
equipment and products and to provide a reliable 
input into environmental design. The intensity tech- 
nique is perfect for transmission and insertion-loss 
measurements. Because the measurements can be 
done in the near field of acoustical radiators, a re- 
liable overall functional analysis and identification 
of sound radiating sources can be performed. Gen- 
eral sound-field studies and mapping are essential for 
many applications. See SOUND. Jiri Tichy 

Bibliography. E J. Fahy, Sound Intensity, 2d ed., 
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Sound pressure 


The incremental variation in the static pressure of a 
medium when a sound wave is propagated through 
it. Sound refers to small-amplitude, propagating pres- 
sure perturbations in a compressible medium. These 
pressure disturbances are related to the correspond- 
ing density perturbation via the material equation of 
state, and the manner in which these disturbances 
propagate is governed by a wave equation. Since a 
pressure variation with time is easily observed, the 
science of sound is concerned with small fluctuat- 
ing pressures and their spectral characteristics. The 
unit of pressure commonly used in acoustics is the 
micropascal (1 Pa = 1 N/m? = 107% dyne/cm? = 
10~> bar). One micropascal is approximately 10-™! 
times the normal atmospheric pressure. See PRES- 
SURE; PRESSURE MEASUREMENT; WAVE MOTION. 

The instantaneous sound pressure at a point can be 
harmonic, transient, or a random collection of waves. 
This pressure is usually measured with an instrument 
that is sensitive to a particular band of frequencies. 
If the pressure is measured in a specified frequency 
band over a short period of time, the mean value 
of the fluctuating pressure is found to be zero, but 
the mean value of the square of the pressure, (p°), 
is a positive quantity. The square root of (p°) is the 
root-mean-square pressure, Prms, and is proportional 
to the time average of the sound intensity, (/). The 
measured pressures will depend on the bandwidth 
and filter response of the measurement instrument. 
The peak sound pressure for any specified time in- 
terval is the maximum absolute value of the instan- 


taneous sound pressure in that interval and is equal 
toV/2 Prms for sinusoidal signals. Acoustic pressures 
are commonly studied as a function of frequency, 
where the frequency can range over the sonic band, 
which lies between 20 and 20,000 Hz and is bounded 
by the infrasonic and ultrasonic frequency bands. 
The Fourier transform and its digital implementa- 
tion, the fast Fourier transform (FFT), are widely used 
to characterize the spectrum of the (p°) versus fre- 
quency. Thus (p°) in a given band, Af, is often normal- 
ized by this bandwidth to yield the spectral density, 
(p?)/A f, with units of [uPa]?/Hz. See FOURIER SERIES 
AND TRANSFORMS; SOUND INTENSITY. 

A concept widely used in acoustics is “level,” 
which refers to the logarithm of the ratio of any two 
field quantities. When the ratio is proportional to a 
power ratio, the unit for measuring the logarithm 
of the ratio is called a bel, and the unit for measur- 
ing this logarithm multiplied by 10 is called a decibel 
(dB). The sound intensity, J, which describes the rate 
of flow of acoustic energy (acoustic power flow) per 
unit area, is given by the mean square pressure, (p°), 
divided by the acoustic impedance, defined as the 
product of the medium density, p, and compressional 
wave speed, C; that is, the intensity J = (p°)/pC. A 
ratio proportional to power can be formed if a ref- 
erence quantity is specified. The reference pressure 
for water is usually chosen to be 1 uPa with pC ~ 
1.545 x 10° Pa- s/m, and the reference pressure for 
ait is 20 wPa with pC © 413 Pa - s/m. The effective 
sound pressure level (SPL, symbolized Zp) is a use- 
ful quantity since the power transmitted by a sound 
wave can be related to (p°) for a specified medium; 
however, when comparing levels in seawater to 
those in air, the acoustic impedance difference be- 
tween the two media must be considered. With these 
considerations in mind, the SPL [dB re Pref] is given 
by Eq. (1). 


SPL [dB re Pret] = Lp = 10 logyo((Z) / (Tree) 


= 10log,y (0°) /(v%.)) a) 


See DECIBEL. 

When the sound pressure level is restricted to a fre- 
quency band, the term “band sound pressure level” 
(BSPL, symbolized L,,) is used. Air acoustics stan- 
dards use a weighted filter response such as an A or C 
filter with either fast (’, 125-ms) or slow (S, 1000-ms) 
exponential time averaging. In these instances the 
levels are designated by L4 [dB re 20 uPa] or Lç [dB 
re 20 Pa]. When the mean square pressure is nor- 
malized by the bandwidth, the quantity becomes a 
density and is referred to as a spectral density or a 
spectrum level with the sound pressure spectrum 
level (SPSL, symbolized L,,) defined by Eq. (2). 


SPSL [dB re( p*) rer/Hz] = Lys 
= 10 logiol(p*)/AND/Cp*)/M re] 2) 


See NOISE MEASUREMENT; SOUND. William M. Carey 


= 
Sound recording 


The technique of entering sound, especially music, 
on a storage medium for playback at a subsequent 
time. The storage medium most widely used is mag- 
netic tape. See MAGNETIC RECORDING. 

Monophonic sound recording. In the simplest form 
of sound recording, a single microphone picks up 
the composite sound of a musical ensemble, and the 
microphone’s output is recorded on a reel of 1/,-in. 
(6.35-mm) magnetic tape. This single-track, or mono- 
phonic, recording suffers from a lack of dimension, 
since in playback the entire ensemble will be heard 
from a single point source: the loudspeaker. 

Stereophonic sound recording. An improved re- 
cording, with left-to-right perspective and an illu- 
sion of depth, may be realized by using two micro- 
phones which are spaced or angled appropriately 
and whose outputs are routed to separate tracks on 
the tape recorder. These two tracks are played back 
over two loudspeakers, one each on the listener’s left 
and right. Under ideal conditions this stereophonic 
system will produce an impressive simulation of the 
actual ensemble, and the listener may identify the ap- 
parent location of each integral part of the ensemble. 
See STEREOPHONIC SOUND. 

As early as 1931, Alan Blumlein applied for a 
patent on several stereophonic microphone place- 
ment techniques. Despite the advances in record- 
ing technology that have been made since then, the 
methods described in his patent may still be found in 
use today. One example uses two microphones con- 
structed within a single housing. The microphones’ 
diaphragms are at right angles to each other, and 
when the device is placed at a proper location, the 
diaphragms point toward the left and right sides of 
the ensemble. 

The microphone picks up essentially what listen- 
ers would hear if they were in the same room. Instru- 
ments at the rear are heard imperceptibly later than 
those placed closer to the microphone. A complex 
mixture of direct and reflected sound is heard, and 
although a listener would scarcely be conscious of 
these subtle relationships, the brain uses this infor- 
mation to analyze the size of the ensemble and the 
acoustic conditions of the playing room. See HEAR- 
ING (HUMAN). 

The successful use of this technique demands an 
acoustically excellent concert hall or recording stu- 
dio, and an ensemble that is musically well balanced. 
The recording engineer has little or no technical con- 
trol over the recorded sound, beyond moving the 
microphone device to a more favorable location. See 
SOUND-RECORDING STUDIO. 

Binaural sound recording. In this system, two mi- 
crophones are placed on either side of a small acous- 
tic baffle in an effort to duplicate the human listening 
condition. The recording is played back over head- 
phones, so that the microphones are, in effect, an 
extension of the listener’s hearing mechanism. Al- 
though the reproduced program is most faithful to 
the original, headphone listening restricts the lis- 
tener’s freedom to move about and excludes oth- 


Sound recording 


ers from hearing the program. Loudspeakers are, 
of course, more widely used, yet the distance be- 
tween them, as well as their distance from the lis- 
tener, somewhat defeats the binaural effect, and 
other stereophonic microphone placements are apt 
to prove more satisfactory. 

Multitrack sound recording. To give the recording 
engineer more technical control over the record- 
ing medium, many recordings are now made using 
a multiple-microphone technique. In place of the 
stereo microphone, one or more microphones are 
located close to each instrument or group of instru- 
ments. See MICROPHONE. 

In the control room the engineer mixes the out- 
puts of all microphones to achieve the desired musi- 
cal balance. As a logical extension of this technique, 
the microphone outputs may not be mixed at the 
time of the recording, but may be routed to 24 or 
more tracks on a tape recorder, for mixing at a later 
date. 

When many microphones are so used, each instru- 
ment in the ensemble is recorded—for all practical 
purposes—simultaneously. The complex time delay 
and acoustic relationships within the room are lost in 
the recording process, and the listener hears the en- 
tire ensemble from one perspective only, as though 
he or she were as close to each instrument as is 
each microphone. Electronic signal processing de- 
vices may not be entirely successful in restoring the 
missing information, and the listener hears a record- 
ing that may be technically well executed, yet lacking 
the apparent depth and musical cohesiveness of the 
original. 

However, the multitrack technique becomes ad- 
vantageous when it is impractical or impossible to 
record the entire ensemble at once stereophonically. 
Especially in the case of new music, it may be imprac- 
tical to spend the amount of recording time required 
to achieve an ideal balance. By recording the various 
sections on separate tracks, the engineer may experi- 
ment with balances later on, after the musicians have 
left. 

In contemporary popular music, each member of 
a small group may wish to play or sing more than 
one part. For example, a basic rhythm section may be 
recorded, with each instrument routed to a separate 
track on the multitrack tape recorder. To keep the 
instruments acoustically isolated from each other, 
the engineer may place the microphones as close 
up as possible. For further isolation, acoustic baf- 
fles, or goboes, may be placed between instruments, 
and particularly subdued instruments may be placed 
within isolation booths. 

As a consequence of this technique, each micro- 
phone picks up only the direct sound of the instru- 
ment in front of it. The echo and reverberation within 
the room, and the sound of the other instruments, 
are not heard. Therefore, by processing the output 
of any one microphone or tape recorder track, the 
engineer may make major changes in the sound of 
a musical instrument without affecting any of the 
others. 

After the basic rhythm section has been recorded, 
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Fig. 1. Adding tracks to an unused recorder tape. 


the group may wish to add some additional parts. Or 
at a later date an arranger may write a string or brass 
accompaniment, after listening to the rhythm tracks. 
In either case, the musicians listen over headphones 
to the material already recorded and play along in 
accompaniment. The accompaniment is recorded on 
one or more of the previously unused tracks (Fig. 1). 

Since the tape recorder’s playback head is some 
distance from the record head, there is a slight time 
delay after the tape passes the record head until 
it reaches the playback head. If the musicians lis- 
tened to the previously recorded program via the 
playback head, this time delay would place the new 
material on the tape out of synchronization with the 
original program. Consequently, the record head is 
used for monitoring whenever new material is being 
added. Thus it performs a dual function simultane- 
ously: playback of the previously recorded program, 
and recording of the new material on previously un- 
used tracks. 

Multitrack recording console. A modern recording 
console gives the engineer the flexibility required to 
meet the demands of multitrack sound recording. At 
the console, microphones may be quickly routed to 
any track on the tape recorder, and just as quickly re- 
assigned to other tracks as the recording progresses. 
Since the musicians in the studio may not be able 
to hear each other due to the acoustic isolation be- 
tween instruments, the console provides a studio 
headphone monitoring system, which the engineer 
may regulate to give the musicians the balance re- 


quired. Previously recorded material is also routed 
through the console to the musicians’ headphones 
when the performers are to play along in accompa- 
niment. Although any of this material can be moni- 
tored over a loudspeaker in the studio, headphones 
are preferred so that the microphones do not again 
pick up the previously recorded program. 

In the control room the engineer may wish to 
hear a somewhat different balance, and the console 
permits this flexibility. For testing purposes, the engi- 
neer may briefly listen to the output of just one micro- 
phone, without affecting the recorded program. Dur- 
ing playback the balance of the entire program may 
be adjusted without affecting the recording setup. 

Multitrack mixdown session. When all the record- 
ing work is completed, the engineer plays back the 
multitrack tape, mixing and balancing the many sep- 
arate tracks to produce the desired stereophonic 
program (Fig. 2). During this mixdown session the 
outputs of the multitrack tape recorder are routed 
through the console, just as the microphones were 
during the original recording session. In effect, the 
multitrack tape now takes the place of the studio 
musicians, and the various tracks—each with its dis- 
tinctive musical information—are mixed to create 
the final master tape. This master tape is used as the 
first step in the production of phonograph records. 

During the mixdown session a wide variety of 
processing devices may be used to modify the sig- 
nals that were recorded earlier. An equalizer changes 
the apparent frequency response of a musical instru- 
ment, compressors and limiters restrict the dynamic 
range, and expanders attenuate or remove low-level 
signals. 

The natural ambience of a large room may be 
simulated with delay lines and reverberation cham- 
bers. One or two electronically delayed echo signals 
may be mixed with the direct program, while the 
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Fig. 2. Multitrack sound recording and mixdown session. 


reverberation chamber simulates the multiplicity of 
reflections typical of a large concert hall. Signals that 
are delayed just slightly, perhaps 25 milliseconds or 
less, may be mixed with the direct sound to create 
the illusion of a larger group. The delay is not heard 
as such, but simulates the slight imprecision of attack 
that is characteristic of, say, a large group of violins. 
Excessive use of any of these signal-processing de- 
vices may create special effects that the musicians 
might find impossible to produce on their instru- 
ments. 

In the case of a monophonic or stereophonic 
sound recording, the engineer is unable to isolate 
and modify, or remove entirely, one or more parts 
of the ensemble. However, the recording may con- 
vey an excellent sense of depth, with the complex 
relationships between instruments well preserved. 

At the cost of losing some of the ensemble’s spatial 
relationships within the room, the multitrack sound- 
recording technique offers the recording engineer 
an unprecedented degree of technical control over 
the entire recording and record production process. 
See SOUND-REPRODUCING SYSTEMS. John M. Woram 
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of Recording Engineering, 2d ed., 1992; D. M. 
Huber, Modern Recording Techniques, 5th ed., 
2001; J. Woram, Sound Recording Handbook, 1989; 
A. J. Zaza, Mechanics of Sound Recording, 1991. 
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Sound-recording studio 


An enclosure treated acoustically to provide con- 
trolled recording of music or speech. Professional 
sound-recording studios range from small home stu- 
dios (such as a converted garage or basement) to 
large facilities using state-of-the-art equipment. They 
consist of two rooms, the main studio or stage (per- 
formance area) and the audio control room. The 
stage includes one or more microphones, is acousti- 
cally isolated to ensure that external sounds do not 
interfere with the recording, and can be broken up 
into small booths to separate individual performers. 
The audio control room contains the mixing console 
and recording devices. 

Motion picture sound stages. These large enclo- 
sures, with a volume of 1,000,000 ft? (28,000 m3), 
are used for photography (to allow long shots), video 
recording, and sound recording. As well as keeping 
out extraneous noises, the interior walls and roof 
are treated with sound absorption material covering 
up to 85% of the total interior surface to achieve a 
short reverberation time (RT). This assures intelligi- 
ble speech recording, as well as the realistic simu- 
lation of outdoor scenes. The sound mixing person 
(mixer) is on the floor of the stage. See CINEMATOG- 
RAPHY; OPTICAL RECORDING; REVERBERATION. 

Television stages. These are smaller than motion 
picture sound stages and require separate audio and 
video control rooms. Emphasis is on dialog; some 
television stages allow audience participation, such 
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Fig. 1. Concrete housing for air-conditioning equipment to attenuate both airborne and 
solid-borne sound transmission to a nearby studio. 


as panel shows and short comedies where audience 
clapping is an essential part of the program. The mi- 
crophone operators are on the stage with the camera 
operators. Audio mixing of a video program takes 
place in the audio control room. Still another en- 
closure is required for the lighting console. Stages 
are often built in pairs joined by a passageway, al- 
lowing construction on one stage while recordings 
take place in the other. Sound insulation is pro- 
vided between these stages so that hammering in one 
room will not disturb recordings on the other. Sound 
and video recording and lighting features are care- 
fully rehearsed. Light fixture dimming is controlled 
electrically, often with high electrical current pulses 
that necessitate that microphone cables be specially 
shielded. 

HVAC systems. The presence of several hundred peo- 
ple in a television stage requires a high-volume heat- 
ing, ventilation, and air-conditioning (HVAC) system 
of low noise level, which is achieved by large ducts 
purveying air with a velocity less than 500 ft (150 m) 
per minute. HVAC condensers, chillers, and cooling 
towers are remote from the television stage in an 
isolated enclosure at ground level or underground 
(Fig. 1). Mechanical equipment is supported on 
vibration isolators. Enclosure walls are lined with 
sound absorption material. Heat removal ventilation 
may include noise reduction openings. See AIR COOL- 
ING. 

Studio floors. Motion picture and television stages 
have concrete floors for durability, load-bearing ca- 
pacity, and absence of squeaks. They are isolated 
from the walls to minimize the transmission of exte- 
rior mechanical vibrations. Floors may include drain 
pipes for rain scenes on the stage, and pits may be 
built into the floor to allow video recording of simu- 
lated below-ground scenes without building up the 
set. 

Figure 2 shows a typical foundation detail of two 
adjoining television stages separated by a distance 
of at least 2 ft (0.6 m). Sand and decomposed gran- 
ite form a cushion for the concrete floors, which are 
structurally isolated at the perimeters by a 1-in.-thick 
(2.5-cm) elastic seal. The walls are lined with 2-in.- 
thick (5-cm) fiberglass sound-absorbent treatment, 
covered with fireproof facing. The interspace be- 
tween the stages may be covered with sheet lead. 
See TELEVISION STUDIO. 

Music-recording studios. Music-recording studios 
vary in size up to 100,000 ft* (2800 m°), the largest 
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Fig. 2. Adjoining television stages. (a) Vertical cross 
section. (b) Top view. 


being for the recording of symphonies and opera 
music by large recording companies. However, the 
majority of music-recording studios are owned and 
operated by smaller independent producers, or are 
rented studios or empty theaters. The reverberation 
time will vary. The recording microphones may be 
placed near the artists for individual clarity, or hung 
at strategic locations just above and in front ofa stage. 

Multitrack recording. Multitrack magnetic recording 
appeared first as only 3-track audio recordings, then 
4-track, then 8-track. Now some of the larger record- 
ing companies employ 32 and 64 tracks. Each track 
contains the sound from only one or two musicians. 
The conductor is replaced by the recording engineer, 
who mixes the many music parts in the audio con- 
trol room. Concomitant with this type of recording is 
the installation of drum cages, piano chambers, vocal 
booths, trumpet stalls, and various other cubicles to 
achieve clean individual section tracks. A number of 
adjacent enclosures have windows in the partitions 
for performers to see each other to keep in time. See 
MAGNETIC RECORDING. 

Figure 3 illustrates an audio/video recording stu- 
dio that allows the mixer a direct view of each in- 
strument section. The video mixers have only a par- 
tial direct view. They watch their program material 
over a closed-circuit television circuit anyway for 
best videography. 

Special acoustical devices. Special acoustical features, 
such as sound traps and bass traps, may be installed 
in small recording studios to achieve distinction in 
recording music for records, audio cassettes, video 
cassettes, and compact disks (CDs). 

Built in a pit between adjoining instrument sec- 
tions, a sound trap (Fig. 4) replaces a barrier that 


would not allow neighboring performers to see each 
other. The trap is generally filled with 2-in.-thick 
(5-cm) fiberglass panels spaced 1 in. (2.5 cm) apart. 
When a performer sits over a sound trap 3 to 4 ft 
(1 to 1.3 m) square, much of the sound from the 
instrument is absorbed into the trap due to the 
low sound pressure over the trap compared to 
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Fig. 3. Plan of audio/video recording studio for multitrack 
recording. 
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Fig. 4. Sound trap used to minimize reflections from 
overhead instrument to adjacent instrument microphones. 
Diffracted sound rays are absorbed into the pit where 
sound pressure is low. 


the adjoining floor. Trap depth D should be a quarter 
wavelength of the lowest frequency to be absorbed. 
The trap is covered by a metal grid to support the 
performer. See SOUND. 

An entire studio can be built from such sound traps 
and still have sound reflective walls, floor, and ceil- 
ing. Both the traps and the reflective surfaces are 
practically frequency-independent in their absorp- 
tion and reflection characteristics so that such en- 
closures do not exhibit the coloration (spectral am- 
plitude variations) from sound reflected from room 
boundaries. 

A bass trap absorbs sound below about 100 Hz. 
This may be a sound trap deep enough to absorb low- 
frequency sound, a resonant absorber (Helmholtz 
chamber), or a thin panel that resonates on the layer 
of air trapped between it and a rigid wall near which 
it is placed. A common drywall on studs will act as a 
bass trap in the range 80-160 Hz. 

Sidewall sections. Another device used in music 
recording is acoustically variable sidewall sections. 
These allow parts of a wall in a rectangular room 
to be turned to be either sound absorbent or reflec- 
tive without producing discrete or structured echoes 
(Fig. 5). 

Control room mixing and audition. A control room is used 
to monitor a musical program performed in a studio. 
A modern control room also serves as a venue for the 
mixer to later adjust a multitrack recording for vol- 
ume levels, frequency response (equalization), and 
reverberatory notes, and to expand or compress cer- 
tain musical passages. The mixer does this by listen- 
ing to the program reproduced by one or more loud- 
speakers in the control room and therefore assumes 
the role of the conductor. The three optimal acous- 
tic treatment theories for such a control room are 
live-end-dead-end (LEDE), EDEL (LEDE spelled back- 
wards), and MOD (moderate). 

For LEDE, a dead or absorbent treatment is placed 
on surfaces near the loudspeaker. The opposite end 
of the room is made reflective or diffusive (to scat- 
ter sound) [Fig. 6a]. This arrangement is intended 
to prevent coloration of the reproduced program by 
first-order reflections from the sidewalls and ceiling 
about the loudspeaker. It is, of course, impossible 
to prevent first-order low-frequency reflections from 
the window below the loudspeaker, but the origina- 
tors of the system claim that these reflections are not 
important for the purpose of mixing sound tracks. 

For EDEL, the acoustic treatment has the same al- 
lotment of hard and soft surfaces, but with the ab- 
sorbent end behind the mixer (Fig. 6b). Reflective 
or diffusive surfaces are placed near the loudspeaker 
and above the mixing console in the control room, 
while the rear wall is made absorptive. 

For MOD, the acoustic treatment comprises a uni- 
form disposition of the sound-absorbent material all 
about the control room. 

There is little consensus whether LEDE, EDEL, or 
MOD treatment is best for the mixer. This is un- 
derstandable, since the reverberation time in such a 
room is generally very short, approximately 0.1 dog 
V), where V is the volume of the control room in 
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Fig. 5. Variable sidewall treatments. (a) Rotatable cylinders. 
(b) Rotatable triangular prisms. (c) Rotatable cylindrical 
sections. (d) Folding splays. (e) One stationary and one 
sliding perforated panel. (f) Retractable curtains. 

(g) Rotatable flat panels. 


cubic feet (1 m? = 35 ft*). This reverberation time 
may be only 0.4 second for a large control room of 
10,000 ft? (280 m°). None of these treatments will 
improve the intelligibility of speech, singing, or mu- 
sic. Rather, each treatment supports subjective pref- 
erences that may achieve the individual esthetic goal 
of the mixer. 

ADR room. The automatic dialog replacement 
(ADR) or postsynchronizing room (Fig. 7), tradition- 
ally of the motion picture industry, also has wide 
use in television production. The performer watches 
himself or herself on the screen and then repeats 
the original speech with lip synchronism. ADR em- 
ploys forward-and-backward transportation of a film 
or magnetic tape sound track so that short scene 
bites can be repeatedly reproduced visually and au- 
rally for an actor who did not intelligibly record his 
or her speech during original filming or video record- 
ing. This may have been because the performer had 
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Fig. 6. Control rooms. Top and side views are shown for each type of room. (a) LEDE room with live rear wall and 
sound-absorbent environment at the frontal sidewalls to prevent lateral reflections from mixing with the direct signals from 
the loudspeaker. (b) EDEL room with absorbent rear wall and live front end as in conventional listening locales. 


laryngitis or because unavoidable noise was present, sary to credibly recreate the ambiance (acoustical 
such as seaside surf or nearby vehicular or aircraft liveness) and the soundscape (original background 
traffic. sound) where the original dialog took place. This 

To achieve realism in the recording, it is neces- can be done by placing the actor in an acoustically 
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Fig. 7. ADR (automatic dialog replacement) or postsynchronizing studio with Foley pits containing various materials for 
generating sound effects. 


adjustable environment within the ADR room. Such 
“variable acoustics” is achieved on one or more 
sidewalls of the ADR facility with hinged reversible 
panels or rotatable cylindrical columns that are ab- 
sorbent on one side and reflective on the other 
(Figs. 5 and 7). 

It is frequently necessary to add synthetic sound 
effects to a scene. When played back, the original 
recording of a slamming door or a pistol shot may 
not sound natural. ADR rooms are often equipped 
with Foley pits (Fig. 7) containing materials to imi- 
tate sound effects that could not be recorded well 
during filming or video recording, or devices that 
provide realistic sound signals for the events pictured 
on the screen. Such devices may include a boardwalk 
to allow the recording of footstep sounds on wood 
when the original scene was performed on a stage 
with a concrete floor. A parquet dance floor is used 
to duplicate footsteps shown on the screen, espe- 
cially in tap dancing, where the dancer listens to the 
music over headphones, so that his or her recorded 
footstep signals are not marred by extraneous music. 
Soundscapes such as birds or surf may be added from 
a library of environmental sounds. Either mixing con- 
sole shown in Fig. 7 may be used, depending on the 
mixer’s preference or requirements for recording the 
sound of a scene. 

Optimal studio room shape. Sound recording stu- 
dios are often highly sound absorbent (“dead”), in 
which case room shape is unimportant. However, 
for studios that cannot be made dead, or where 
some reverberance is desirable, room shape is criti- 
cal. Small rooms of regular shape (cubes and squares) 
have redundant resonances (degenerate modes) that 
will color sound reverberating in that room. Table 1 
shows some acoustically recommended dimension 
ratios to minimize the number of degenerate modes. 
Such rectangular rooms require treatment for flut- 
ter echoes. A recording studio having the harmonic 
ratio of 1:2:3 will provide the greatest floor area com- 
pared to other ratios of rooms of equal volume, ac- 
commodating the most performers in a given space. 
A room with a ratio of 2:3:5 has the highest ceiling 
compared to the other cited ratios, providing a max- 
imum number of sidewall reflections (diffusion) and 
hence good stereophonic sound reception, as may 
be desired for a concert hall. Adherence to any of 
these ratios is more important for reverberant rooms 
than it is for studios having the prescribed short re- 
verberation time of 0.1 (log V). 
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Reverberation time. This is predicted by the Sabine 
formula RT = 0.049 V/A, where Vis the room volume 
in cubic feet and A is the sum total of all the acous- 
tic absorption in the room, measured in sabins. dn 
metric units, RT = 0.161 V'/A!, where V! is the room 
volume in cubic meters and A! is the total absorption 
in metric sabins.) In large studios and stages, the re- 
verberation time may be greater than predicted. The 
reported absorption coefficient, determined from 
tests on small samples, can be greater than unity, but 
when the same material is placed over the majority 
of walls and ceiling areas, the resulting reverbera- 
tion time will be longer, indicating that the material 
absorption coefficient is then as low as 0.80. 

Sound and television stages must have a very short 
reverberation time to simulate an outdoor acousti- 
cal atmosphere. Music recording studios, ADR cham- 
bers, and control rooms may have a more live inte- 
rior. The recording engineer may wish to change the 
reverberation time. Performers often prefer a live en- 
vironment for orchestral or ecclesiastical music. Jazz 
and rock-and-roll performers prefer a shorter rever- 
beration time. Figure 5 shows ways to change the 
sound reflectivity of the walls in a music recording 
studio or an ADR room, thus changing the sound live- 
ness near such walls. These devices minimally affect 
the reverberation time found in the middle of the 
room unless they are installed on all surfaces of that 
room. To change the reverberation time in a large 
studio, it is necessary to move either one of its walls 
or its ceiling. This is rarely done because of the ex- 
pense. See ARCHITECTURAL ACOUSTICS. 

Windows. Recording studio windows for control 
rooms and public viewing must present high sound 
insulation. They may consist of two thick glass panes 
widely spaced and sealed to the adjoining wall sur- 
face. To avoid first-order sound reflections, they may 
be triple-pane units, with the center pane made of a 
3/4-in.-thick (19-mm) to 7/8-in.-thick (22-mm) noise 
barrier glass. The outside panes are 1/4-in.-thick 
(6-mm) Lucite or Plexiglas sheets cold-bent through 
an arc of 60° with a radius equal to the window 
width. 

Noise control. Recording studios’ background 
noise level must be very low. This is particularly im- 
portant for digital recording, because of its very wide 
dynamic range of 80-90 dB. Heating, ventilating, and 
air-conditioning equipment must not be installed on 
the roof of such studios. Airflow control or augment- 
ing devices (VAV boxes) must not be located at, over, 

















TABLE 1. Rooms with recommended ratios of dimensions 

Rooms normalized Rooms normalized 

for equal volume for equal height 
Ratio of room Normalized Relative Normalized Relative 
dimensions Name of ratio ratio floor area ratio floor area 

1:2:3 Harmonic 12:3 6 123 6 
1:6:3:4 Knudsen 1.08:2.03:2.71 5:5 11:88:25 4.7 
3:5:8 European 1.1:1.84:2.95 5.43 1:1,.67:2.67 4.45 
61225) Volkmann 1.14:1.82:2.85 5.19 EGE 275) 4 
(61/2 — 1):2:(51/2 + 1) Golden section 1.12:1.82:2.94 5.35 11162:2162 4.24 
2:3:5 Sabine ilelivelehor2.92 Sa 1215:25 3.75 
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Fig. 8. Room background noise balanced criterion curves. 


or near the studio space. Large HVAC mechanical 
equipment (boilers, chiller pumps, condensers, and 
cooling towers) should be located on the ground at a 
distance from the studio (Fig. 1). It may be necessary 
to employ a sound attenuator to the outdoor ventila- 
tion intake and exhaust openings in that enclosure. 

Noise criterion (NC) curves. L. Beranek’s balanced NC 
curves (Fig. 8) provide a convenient measure of the 
annoyance of background noise in buildings. Match- 
ing the noise level found in a room to any of these 
curves is done as a tangent criterion, where the noise 
background level in every frequency band must not 
exceed the indicated curve. A dBA level computed 
from the sum of all bands is often numerically 5 deci- 
bels or more greater than the corresponding NCB 
value. Table 2 shows the recommended noise crite- 
rion balanced (NCB) value for occupied spaces. 

Use of the limiting-curve criterion alone can lead 
to dubious results for low-frequency mechanical 
equipment noise (“rumble”). Low-frequency sound 
(and vibration) above 60 dB can cause audible rat- 
tles in ceiling and studio fixtures. The noise from an 
air-conditioning installation could exceed an NCB-30 
specification, or the alternative RC-30 specification, 


by 2 or 3 dB in the octave band whose midfrequency 
is 1000 Hz and thus be deemed unacceptable, while 
the spectrum of another installation may hug the 
NCB-30 at all points on the curve and be declared 
acceptable. Vibration and discomfort can result from 
low-frequency sound. Thus, in Fig. 8, additional lim- 
its Ç - -, moderate; — —, severe) are placed on the 
lowest-frequency noises. To minimize these prob- 
lems, the alternative RC curve set with suppressed 
lower and higher frequency levels is used as an HVAC 
design tool by mechanical engineers. 

Background limit. This may be NCB-15 (about 20 dBA) 
for high-quality sound recording studios or NCB-25 
(G0 dBA) for television studios (Fig. 8). For an ordi- 
nary studio it should not exceed NCB-20 (25 dBA). 
When many performers are in the room or photogra- 
phy must take place there, the limit should be NCB- 
25 (30 dBA). See NOISE MEASUREMENT. 

Exteriornoise. The outdoor noise at a prospective site 
is surveyed over at least one 24-hour period and over 
several days if possible. Figure 9 shows the 24-hour 
noise history of a prospective studio site. Hours of 
maximum noise exposure, especially at low frequen- 
cies (for example, from diesel locomotive and aircraft 
passages), are evident. The spectrum of these maxi- 
mum noise events must be known for airborne sound 
insulation design of studio exterior walls, roof, doors, 
and windows. 

Wall noise isolation design. Concrete walls are good 
noise barriers. They may be solid formboard-cast, 
ground-cast and later raised with a crane (tilt-up 
walls), or concrete block (CMU). One side of a CMU 
is plastered to seal joint leakage paths for sound. Solid 
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Fig. 9. Results of a 24-hour noise exposure level survey at 
a planned studio site. L4, L10, and Lso represent the 
C-weighted noise level that was exceeded 1, 10, and 50% 
of the observation period. Dots are aircraft flyover noise 
maximum values. 





TABLE 2. Recommended background noise criteria for 
various rooms 





Noise criterion A-weighted 











Enclosure balanced (NCB) value sound level, dB 
Concert halls 15-20 25-30 
Recording 15-20 25-30 

studios 
Theaters 20-25 30-35 
Homes 25-30 35-40 
Offices 30-35 40-45 
Stores 35-40 45-50 





concrete walls approximately obey the acoustical 
mass law for the transmission line (TL) in decibels 
for a sound of frequency, f, 


TL = 14.5logM + 20 logf — 31 + 20 log cos 0 
= 14.5logM' + 20logf — 41 + 20 log cos 0 


where M is the surface density in 1b/ft? (M! is the 
surface density in kg/m®, 0 is the angle of sound 
incidence. For instance, at 500 Hz (dog f = 2.7) and 
0° angle of sound incidence (log cos 0 = 0), a 6-in.- 
thick (15-cm) solid concrete wall weighing 70 1b/ft? 
dog M = 1.85) or 340 kg/m? (log M! = 2.53) exhibits 
a transmission loss of 50 dB. 

The enclosing floor, walls, and ceiling of studios 
located in buildings housing other activities require 
similar analysis and treatment. Ceiling noise isolation 
is improved by suspending a grid on vibration isola- 
tors that will carry several layers of drywall. The air 
gap so formed is filled with one or more layers of 
sound-absorbing insulation. See ACOUSTIC NOISE. 

Ground vibrations. Studios should be isolated from 
ground vibrations from trains, subways, and indus- 
trial plants. Studios located on upper floor levels 
are subject to vibrations from mechanical equipment 
anywhere in the building such as HVAC, transform- 
ers, and pumps. Ground vibration and its spectral 
content is measured with an accelerometer that is 
either buried in the ground or attached to the af- 
fected floor, preferably over a 24-hour period. A stu- 
dio concrete floor is isolated from these vibrations 
by “floating” it on pads 2-3 in. thick, on coil springs 
of appropriate deflection or on air springs over a 
similar air gap. The resonant frequency of this sup- 
port and gap must be a quarter or less that of the 
lowest audio frequency expected to be reproduced, 
for example, a 10-Hz resonance for a 40-Hz lowest 
reproduced audio frequency. See ACCELEROMETER; 
MECHANICAL VIBRATION; SOUND RECORDING; VIBRA- 
TION ISOLATION. Michael Rettinger; Angelo Campanella 
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Studio Monitoring Design: A Personal View, 1995; 
D. N. Nordantonio, Sound Studio Production Tech- 
niques, 1990; A. Nisbett, The Sound Studio, 6th ed., 
1995; P. White, Creative Recording II: Microphones, 
Acoustics, Soundproofing, and Monitoring, vol. 2, 
1999. 


E 
Sound-reinforcement system 


A system that amplifies or enhances the sound of 
a speaking person, singer, or musical instrument. 
Sound-reinforcement systems are used to increase 
the clarity of the original sound as well as to add loud- 
ness and reverberation. Those that improve loudness 
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and clarity are often called public address systems, 
and those that improve reverberation characteristics 
are called reverberation enhancement systems. Sys- 
tems for increased loudness are used when listeners 
are located far from the sound source, as in a large 
audiorium or outdoors. Systems for increased clar- 
ity are used to overcome the effects of unwanted 
noise or excessive reverberation in a room (such 
as concert halls and churches with reverberation 
characteristics designed for music). Reverberation 
enhancement systems are used to add sound field 
components, such as reverberation to rooms hav- 
ing unsuitable levels of direct, early reflected sound 
and reverberant sound or having short reverberation 
times. See ARCHITECTURAL ACOUSTICS; REVERBERA- 
TION; SOUND; SOUND FIELD ENHANCEMENT. 

Basic elements. The basic elements of a sound- 
reinforcement system are (1) one or more micro- 
phones, (2) amplification and electronic signal pro- 
cessing systems, and (3) one or more loudspeakers. 
The signal processing equipment may include level 
controls, equalizers or tone controls, systems for pre- 
vention of howl and ringing, delays and reverber- 
ators, as well as frequency dividing and crossover 
networks. 

Figure 1 illustrates the signal flow diagram of a 
simple system. In the simplest systems, all elements 
are joined as one physical unit. An example of a sim- 
ple public address system is the hand-held battery- 
driven electronic megaphone, consisting of a micro- 
phone, a frequency-response-shaping filter, an ampli- 
fier, and a horn loudspeaker (Fig. 2). Typical use of 
the megaphone by police and others is to address a 
crowd of people. In order to obtain high sound levels 
in combination with high intelligibility, the frequncy 
response of a megaphone is limited to save battery 
power. This may result in poor sound quality. 
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Fig. 1. Basic sound-reinforcement system. 
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Fig. 2. Electronic megaphone, with all the basic elements 
of a sound system in one package. 
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Fig. 3. Polar directivity patterns of microphones. 


The “amplified speaker pulpit” is a stationary and 
only slightly more complex system. It is a sound- 
reinforcement system built into a pulpit. Because 
the system is stationary (and most system parame- 
ters are known to the designer), it can be tailored to 
give a very good and natural sound quality. Since the 
loudspeaker is close to the talker, there is no shift in 
apparent source location. 

Most sound-reinforcement systems, however, are 
assembled from individual elements to optimize and 
adapt the system performance for the needs of a par- 
ticular installation. 

Microphones. Microphones often use dynamic, con- 
denser, or electret types of transducers, and may 
have various polar directivity patterns such as omni- 
directional, unidirectional, or bidirectional (Fig. 3). 
In some microphones, it is possible to switch the 
pattern by remote control. Microphones for use on 
stage floors, boardroom tables, and so on are bound- 
ary microphones with special frequency response 
characteristics, and their use can reduce or elimi- 
nate phase cancellation effects from reflections. See 
ELECTRET TRANSDUCER; TRANSDUCER. 

Condenser microphones require a preamplifier 
close to the microphone element. They may be per- 
manently polarized (electret) or require a polarizing 
voltage. In either case, dc power must be sent to the 
preamplifier, usually positive on the signal line pair 
and negative on the shield. See ELECTRET; PREAMPLI- 
FIER. 

Digital microphones use an analog transducer, but 
integrate analog-digital converters and other compo- 
nents to offer improved possibilities for signal identi- 
fication, cabling, and routing, making them less sus- 
ceptible to noise and hum. See ANALOG-TO-DIGITAL 
CONVERTER. 

Radio links may substitute for wire to connect sys- 
tem components. The most familiar radio link is wire- 
less microphone. Wearing such a microphone on the 
body permits the user to move around freely. The 
wireless microphone usually consists of a miniature 
omnidirectional microphone (either electret con- 
denser or dynamic type) worn close to the neck 
on the outside of clothing; a compact FM transmit- 
ter, worn inside clothing; an antenna, usually a wire 
worn along a leg; and a receiver located in a fixed 
position convenient both for connecting to the rest 
of the system and for obtaining unobstructed radio 
pickup from the transmitter (Fig. 4). Miniature wire- 
less microphones may be worn on the head and hid- 


den by hair. See FREQUENCY-MODULATION RADIO; MI- 
CROPHONE. 

Signal control equipment. Control and signal process- 
ing equipment may range from a single volume con- 
trol in a preamplifier to complex control boards 
similar to those used in recording studios, with pos- 
sibly hundreds of input channels, complex equaliz- 
ers for every input channel, feedback control equip- 
ment, reverberation generation equipment and delay 
units, matrixed outputs, and so forth. Modern large 
and complex systems feature preset automatic digital 
control, with the audio signals determining real-time 
settings. See SOUND-RECORDING STUDIO. 

Delay units use analog-digital conversion, digital 
delay, and digital-analog conversion to delay an input 
signal. Reverberation may be generated by using fi- 
nite or infinite impulse response filters, correspond- 
ing to the addition of many delayed sound compo- 
nents. Finite impulse response filters use an assembly 
of delays, and can be based on natural (measured) or 
calculated room impulse responses. In infinite im- 
pulse response filters, some of the delay output sig- 
nals are fed back to the delay inputs and added in spe- 
cial patterns to reduce coloration in the generated 
reverberation. Power amplifiers may range in output 
from 10 watts to thousands of watts, and a system 
may have only one power amplifier or hundreds of 
them. See ACOUSTICS; AMPLIFIER; POWER AMPLIFIER. 

Because sound-reinforcement systems add energy 
to the sound field in a room, there is a possibility 
that some frequencies will be particularly empha- 
sized due to in-phase feedback of sound to the micro- 
phone(s). At a critical gain setting, the system gain 
will be sufficient to let the system start to oscillate 
(feedback). The gain-before-feedback is a measure 
of the loudness the system can produce, with refer- 
ence to the loudness of the live sound source, before 
the system starts to oscillate. At lower gain settings, 
close to oscillation, the system will sound “colored” 
because sound at some frequencies is exaggerated. 
The feedback also creates reverberation, that is, pro- 
longs the decay time of the signal. These effects are 
subjectively annoying. Feedback control equipment 
may be based on techniques such as (1) adaptive 
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Fig. 4. Elements of a wireless microphone. 
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Fig. 5. Loudspeakers used in sound-reinforcement systems. (a) Direct-radiator loudspeaker in a cabinet. (6) Multiple direct 
radiators in a vertical line array or column. (c) Straight-axis horn loudspeaker. (d) Folded or reentrant horn loudspeaker. 

(e) Two-way horn loudspeaker system with a single high-frequency straight directional horn and a separate low-frequency 
horn enclosure with one or two bass loudspeakers. 


notch filters, that is, very sharp (narrow-band) rejec- 
tion filters, (2) frequency shifting, or (3) time-varying 
delay or amplification. All these techniques may in- 
troduce audible artifacts into the sound if not prop- 
erly used, but may add up to approximately 6 dB of 
gain in a system. See GAIN; LOUDNESS. 

Loudspeakers. Loudspeakers may be direct radiators 
or horn-loaded drivers. The direct radiators are 
usually cone-type loudspeakers, but are sometimes 
bending-wave-transducer, electrostatic, or other 
unconventional types. Horn loudspeakers use the 
reentrant horn for simple low-quality paging; diffrac- 
tion horns for somewhat better-quality paging; front- 
loaded directional horns of various types, includ- 
ing the constant directivity horns (which attempt 
to provide the same coverage characteristics over a 
wide frequency range and which have largely dis- 
placed the older multicellular and radial horns in 
high-quality applications); and horn bass enclosures, 
which often employ folds or bends for compactness, 
or else supplement the horn loading with ports to 
bring up bass response (Fig. 5). 

A particular loudspeaker system may consist of a 
single-radiator loudspeaker, a series of direct-radiator 
loudspeakers in either horizontal or radial lines (ver- 
tical line-source loudspeaker systems are often called 
column loudspeakers), combinations of direct radi- 
ators for bass radiation and horn loudspeakers for 
treble radiation, or large enclosures with large low- 
frequency loudspeakers, used with high-frequency 
horns (Fig. 5). Some systems used for high-level mu- 
sic reinforcement divide the frequency range into 
four parts, with subwoofers for the lowest bass 
frequencies, a second set of bass loudspeakers for 
midbass frequencies, midrange constant-directivity 
horns, and supertweeters for the extreme high- 
frequency end of the audio spectrum. Each of these 


loudspeakers may be driven by separate amplifiers, 
which are fed by a four-way crossover network (fre- 
quency dividing network). The use of separate am- 
plifiers for different portions of the sound spec- 
trum permits the lowest distortion and the greatest 
flexibility for adjustment. See LOUDSPEAKER; SOUND- 
REPRODUCING SYSTEMS. 

Loudspeaker placement and selection. Fixed sound 
systems are of two basic types: central and dis- 
tributed. The central system employs loudspeakers 
reasonably close to the source of live sound (but 
not so close that feedback is an unsolvable prob- 
lem), while the distributed system has many loud- 
speakers fairly close to the audience. For central sys- 
tems to work properly, the performing area must be 
defined, as in a typical auditorium or concert hall. 
Spaces where the sound source may be located any- 
where, such as in hotel ballrooms where the head 
table might be along any wall, are usually best served 
by some type of distributed system. 

Central systems. The most common type of central 
system employs a loudspeaker or loudspeaker sys- 
tem (sometimes called a cluster) directly over the 
front of the stage, with the directional characteristics 
of the loudspeaker or system chosen to cover the au- 
dience with good uniformity without directing too 
much energy toward the stage. Such a system is usu- 
ally the first choice for most sound system designers 
because it can be installed most simply and usually 
provides the greatest intelligibility and naturalness 
(Fig. 6). 

Variations on the central system include the use 
of vertical column loudspeakers on each side of the 
stage, the use of a horizontal line source along the 
top of the stage, and the use of directional horn loud- 
speakers on each side of the stage. All these solu- 
tions have their own peculiar problems. Any of the 
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Fig. 6. Central cluster loudspeaker systems. (a) System in a theater. The constant-coverage high-frequency horns are below 
the supporting frame, and a two-loudspeaker straight-horn low-frequency enclosure is above (courtesy of Amir Pishdad). 
(6) System for a gymnasium. Ten multicellular high-frequency horns are below two double-loudspeaker straight-horn bass 


enclosures. 


split-source methods (with loudspeakers on each 
side of the stage) can result in alternate zones of 
cancellation and reinforcement near the centerline 
of an auditorium. Modern column loudspeakers may 
offer steerable array technology, using digital signal 
processing so that the radiation pattern at various 
frequencies can be tailored to give optimal coverage 
over a wide area. 

Distributed systems. The most common type of dis- 
tributed system employs loudspeakers mounted in 
the ceiling, usually directed straight down. Less fre- 
quently used variations include loudspeakers in seat 
backs (the “pew-back” approach, most often used in 
churches) or in building columns. Horizontal rows 
of closely spaced loudspeakers have been used from 
one side of a theater to the other above the prosce- 
nium, as well as down the length of the apex of the 
ceilings of Mormon churches for reasonably uniform 
coverage in spaces without excessive reverberation. 
Loudspeakers in side walls are best used for adding 
reverberance in a room, and for surround sound and 
specific effects in cinema and theater. Low ceilings 
and chandelier systems require wide-dispersion 
loudspeakers, usually about 100-mm-diameter 
(4-in.) or larger coaxial (small high-frequency cone 
and larger bass cone) loudspeakers; however, above 
10 m @3 ft) height directional horns are often 
required. 

Reduction of reverberation. With proper placement and 
selection of loudspeakers, sound systems in live 
rooms can actually reduce the apparent reverbera- 
tion by concentrating the amplified sound energy 
into the sound-absorbing audience or seats, with 
minimum energy directed to wall and ceiling sur- 
faces. The constant-directivity horns have particular 


applications in central systems used for this purpose. 
Such systems can provide high speech intelligibility, 
with naturalness, in highly reverberant spaces that 
have poor speech acoustics. 

Enhancement of reverberation. In a system consisting of 
many sound reinforcement channels, one can ad- 
just the gain so that each channel has low gain and 
coloration. The combined effect of the system will 
yield a prolonged reverberation time over the entire 
operating frequency range of the channels. Systems 
with up to a hundred channels are sometimes used 
for reverberation enhancement purposes in rooms 
perceived to be acoustically “dead”. The subjective 
perception of reverberation may also be enhanced 
by adding reverberation into the signals using a re- 
verberator in the signal-processing system. Unless 
this reverberation is radiated incoherently by the use 
of many loudspeakers in different directions from 
the listener, the reverberation will be insufficiently 
diffuse and inadequate. Sometimes reverberation is 
added selectively to a singer’s voice to enhance the 
quality. 

Signal processing. In addition to proper loud- 
speaker placement and selection, the design of a 
sound-reinforcement system with the desired char- 
acteristics requires the use of signal-processing tech- 
niques such as signal delay and equalization. 

Signal delay. Digital signal delay equipment is now 
often used with distributed systems. This equipment 
places the electronic audio signal in a memory de- 
vice, producing it in time for the appropriate loud- 
speakers to be synchronized with the live sound from 
the performance area. 

Haas or precedence effect. Use of this effect can con- 
tribute greatly to naturalness. If the live signal from 


a talker or singer is heard first and the amplified sig- 
nal arrives up to 25-30 milliseconds later and is not 
more than 10 dB louder, the listener will determine 
the direction of the composite signal as coming from 
the talker or singer. That is, the listener will not be 
aware that the amplified signal is coming from a dif- 
ferent direction. This effect can be used with central 
systems by proper location of the loudspeakers. In 
some central systems and most distributed systems, 
use of the effect requires both proper loudspeaker 
placement and proper selection of delays. 

Equalization. The equalization process can also con- 
tribute to the naturalness of a sound system and 
to its gain-before-feedback in many situations. Be- 
fore equalization, almost every sound system ex- 
hibits some frequency response irregularities that 
are a departure from an ideal frequency response 
curve. In the past, such irregularities (due to deficien- 
cies in either the room or the loudspeakers) were 
accepted. Now, broadband or narrowband equaliza- 
tion usually is applied to a sound system to ensure 
both the smoothest possible response and adequate 
gain-before-feedback. Various types of equalizers and 
frequency response analysis equipment are used in 
the equalization process. The most widely used ap- 
pear to be octave-band or third-octave-band equaliz- 
ers that reduce (or sometimes boost) the audio elec- 
trical signal in adjacent bands, and audio analyzers 
that display the relative acoustical signal level in the 
same bands via a visual readout. Such equipment is 
often used with a “pink” noise generator, giving a ran- 
dom electrical signal that has the same signal level in 
each band, during the analysis and adjustment pro- 
cess. In some cases, the equalization is supplemented 
by the use of (adaptive) notch filters to prevent os- 
cillation. Equipment in digital mode can continually 
analyze signals to provide real-time adaptive equal- 
ization. See EQUALIZER. 

Time-energy-frequency analyzers. Both the setting of cor- 
rect signal delays and proper equalization can benefit 
from the use of analyzers, which display the energy 
of a signal as a function of time and frequency. Simple 
audio analyzers present the total sound energy arriv- 
ing at the microphone with the sound system excited 
by a continuous random-noise generator. Modern an- 
alyzers, however, excite the system with a particular 
pulse, a pseudo-random noise, or a pure-tone signal 
swept in frequency. The analysis equipment uses in- 
formation about the signal emitted to calculate the 
response as a function of time and frequency. This 
technique allows the time alignment of two separate 
signals to set delays and locate loudspeakers, and 
allows independent evaluation of the frequency re- 
sponse of the direct signal (the signal from the loud- 
speaker without reflections) and the various compo- 
nents of room sound (the reflected energy). There 
are situations where equalization of the direct sound 
is preferred to equalization of the total sound field. 
Even with modern equipment, the ear is the final 
judge of any equalization scheme. 

Power requirements. The size of the power ampli- 
fiers needed for an indoor sound system depends on 
the acoustics of the room served, the sound pressure 
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levels needed (much less for a speech-reinforcement 
system than for a music system), and the efficiency 
of the loudspeakers used. Direct-radiator loudspeak- 
ers may be only 1% efficient in converting electrical 
power to acoustical power, while certain types of 
horn loudspeakers may be as high as 25% efficient. 
Generally, low-frequency loudspeakers are less effi- 
cient than high-frequency components in a multiway 
system; thus, if electronic crossovers are used, larger 
amplifiers are needed for low frequencies than for 
high frequencies. A small church might use only 30- 
50 W of power amplification for speech reinforce- 
ment, while a small night club with less efficient 
loudspeakers and higher sound levels might require 
several thousand watts. Temporary outdoor systems 
set up for rock concerts may run into the hundreds of 
thousands of watts, as extremely high levels are pro- 
duced over long distances for hundreds of thousands 
of listeners. Electronic reverberation and surround 
systems, particularly for organ and for orchestra, re- 
quire tremendous amounts of bass power and large 
areas of bass-loudspeaker radiating surface to sound 
convincing. This also holds true for all outdoor music 
reinforcement. See SOUND PRESSURE. 

Outdoor systems. Outdoor systems (Fig. 7) are sim- 
ilar to indoor systems, without the reflections that 





Fig. 7. Cluster of high-frequency horns at the base of the 
loudspeaker tower at a stadium, ensuring even coverage 
and the projection of sufficient sound power to produce 
100-dB levels 700 ft (210 m) away at home plate. The 
mouths of the horns are protected by mesh. 
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Fig. 8. Large control console used for both reinforcement and recording at a theater. The 
window enables the operator to hear the sound in the theater. (Courtesy of Amir Pishdad) 


result from interior room surfaces. (The reflections in 
indoor systems produce reverberation, which can be 
a negative or positive effect, depending on the nature 
of the program. Reflections also can result in greater 
uniformity of sound pressure level throughout the 
audience area to be covered.) Outdoor systems may 
have to cope with such ambient noise problems as 
aircraft flyovers, nearby rail lines or highways, and 
factory noise. This limits the dynamic range, or range 
of loudness, from outdoor systems such that the soft- 
est passages of a musical work must be brought up in 
loudness to override intruding noise, while the loud- 
est passages cannot be so loud as to bother some 
listeners. 

Care must be taken in outdoor facilities to pre- 
vent the sound system from becoming a nuisance 
to the community. For example, outdoor rock con- 
certs should not take place in the midst of pop- 
ulated areas. The use of directional loudspeakers 
(such as constant-directivity high-frequency horns), 
the proper placement and orientation of these loud- 
speakers, and the use of architectural barriers (such 
as walls and berms) can minimize the annoyance cre- 
ated by the use of an outdoor sound system. 

System development. Development of a successful 
sound-reinforcement system begins with analysis of 
the uses of the system: just who or what requires 
reinforcement, and what is the spatial relationship 
of the live sound source with respect to the audi- 
ence. Next comes the acoustical, architectural, and 
electronic design, including choice of a central or 
distributed system, locations for loudspeakers, their 
orientation, choice of microphones, preparation of 
a signal-flow chart which is developed into a line di- 
agram for the electronic portion of the system, and 
proper selection of a sound control position if one 
is necessary. 

The design of a system may be greatly simplified 
through the use of acoustic simulation by computer 
software—based on the image source or ray trac- 


ing methods—to optimize the design with respect 
to loudspeaker directional characteristics, room ge- 
ometry, and acoustic conditions. Using auralization 
(audible sound field simulation based on such simula- 
tion), it is possible to listen binaurally to a simulation 
of the sound-reinforcement system’s performance in 
advance of its being installed. See BINAURAL SOUND 
SYSTEM; VIRTUAL ACOUSTICS. 

Often the sound-reinforcement installation must 
be put to competitive bidding, and this process re- 
quires detailed specifications indicating the overall 
performance expected from the system and from 
critical individual components, the system’s func- 
tional requirements, wiring practices, details of in- 
stallation, documentation, and so forth. This leads to 
bid review by an impartial expert, who may have de- 
signed the system. Careful installation is necessary, 
followed by adjustment and equalization, in turn fol- 
lowed by acceptance tests by the owner’s represen- 
tatives. Instruction of operators takes place at this 
time and is critical to the performance of the sys- 
tem. Good service can keep a high-quality system in 
operation for periods of 20 years or more. Usually 
service is best provided by the installing contractor. 

System control. The sound system operator should 
be located where it is possible to hear the same mix 
of live and amplified sound as heard by the audience. 
Backstage is a poor location for the controls of a the- 
ater sound-reinforcement system. Giving the opera- 
tor a monitor loudspeaker in the control booth, or 
even a “dummy head” (mannequin with two “ear” 
microphones) feeding headphones, can never be as 
good as a large open window in a correctly located 
control booth (Fig. 8). The use of binaural technol- 
ogy may help in controlling the sound in a large 
venue by the use of many dummy heads. Many per- 
forming arts centers have cockpits in the middle of 
the audience area, placing the sound system opera- 
tor directly in the audience. Compromise positions 
at the rear or at the side are more frequent and can 
also work. A frequent control position in churches is 
at a balcony rail. 

Several types of automatic mixers have been devel- 
oped. These devices recognize which microphone 
is being addressed, increase the amplification from 
that microphone while suppressing the signal from 
others, and reduce the gain appropriately when two 
microphones are being addressed to keep the sys- 
tem from feeding back. The best of these devices 
also provide overall gain control, reducing the am- 
plification from the loudest voices while bringing up 
amplification from the softest. For a simple church 
speech-reinforcement system, one of these devices 
can replace a human operator with good results. As 
sophistication of the equipment and techniques of 
use have improved, automatic mixers in theaters as- 
sist, but do not replace, the human operator. 

David Lloyd Klepper; Mendel Kleiner 
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Systems that attempt to reconstruct some or all of 
the audible dimensions of an acoustic event that 
occurred elsewhere. A sound-reproducing system 
includes the functions of capturing sounds with 
microphones, manipulating those sounds using elab- 
orate electronic mixing consoles and signal proces- 
sors, and then storing the sounds for reproduction 
at later times and different places. 

The design of audio products has been progres- 
sively more scientific, thanks to a better understand- 
ing of psychoacoustics, the relationship between 
what we measure and what we hear. Such investi- 
gations can be productive only if it is possible to 
elicit consistent opinions from individual listeners 
and agreement among numbers of different listeners. 
Although some with deteriorated hearing certainly 
do “hear differently,” the majority of listeners, given 
some practice and the opportunity to form opinions 
about sound in circumstances free from biases (such 
as seeing the products, loudness differences, and 
variable program sources), are remarkably consistent 
judges of sound quality. See PSYCHOACOUSTICS. 

When opinions obtained in a controlled manner 
are studied, it is found that they correlate strongly 
with certain measurable quantities. Of all audio com- 
ponents, electronic devices are the closest to tech- 
nical perfection, exhibiting the smallest audible dif- 
ferences. Transducers present many more problems. 
Microphones, which convert sound into an electri- 
cal signal, exhibit fewer problems than loudspeak- 
ers, which do the reverse. This is because micro- 
phones are relatively simple devices, and they are 
small compared to most sound wavelengths. Loud- 
speakers must be physically larger in order to gen- 
erate the required sound levels. Because the sound 
interacts with the acoustical space (a room or auto- 
mobile interior, for example), as it propagates to the 
ears, reflected sounds modify and sometimes dom- 
inate the audible impressions. The acoustical space 
is the final component in a sound-reproduction sys- 
tem. See ARCHITECTURAL ACOUSTICS; LOUDSPEAKER; 
MICROPHONE; TRANSDUCER. 

The original event for most recordings occurs in 
the control room ofa recording studio or the dubbing 
stage of a film studio, where the artistic decisions 
are made about what is to be stored on disk or tape. 
These decisions are made by listening through loud- 
speakers in rooms. The loudspeakers and room used 
in capturing the sound influence what is recorded. If 
these are significantly different from those used dur- 
ing the reproduction, the sound will not be heard as 
it was created. As there are no universal standards 
governing the design and use of loudspeakers and 
rooms, there are substantial variations in the qual- 
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Fig. 1. Loudspeakers influence the sound of recordings 
while they are being made, and then those recordings are 
used to evaluate other audio products, including 
loudspeakers. Subjective evaluation, used alone, is a 
flawed process. 


ity of recordings, as well as how they sound when 
reproduced through different loudspeakers in differ- 
ent rooms. This is a problem for a sound-reproducing 
system (Fig. 1). See SOUND-RECORDING STUDIO. 

Technical variables. Sound-reproduction systems 
deal not only with the sounds of live performers but 
also with synthesized sounds. The same system can 
reasonably be expected to do justice to the nuances 
of chamber music, the dynamics and drama of a sym- 
phony, and World War II gunfire reenacted on film. 
The following technical variables are relevant to the 
reliable reconstruction of audio events. 

Frequency range. Traditionally the audible frequency 
range has been considered to be 20 Hz to 20 kHz. 
However, lower frequencies can generate interesting 
tactile impressions, and it is now argued by some 
that higher frequencies (even 50 kHz and beyond) 
add other perceptual nuances. Whether these fre- 
quencies are significant, or even exist, is a matter 
of current debate. Most scientific data indicate that 
only exceptional persons, in the prime of youth, re- 
spond to sounds beyond about 20 kHz. See HEARING 
(HUMAN). 

Dynamic range. In humans, this is the range of sound 
level from the smallest audible sound to the largest 
sound that can be tolerated. In devices, it is the 
range from background noise to unacceptable dis- 
tortion. The contours of equal loudness show that 
from the threshold of audibility to the onset of dis- 
comfort is about 120 decibels at middle and high fre- 
quencies. In reality, much less than this is required 
to give a satisfactory impression of accurate repro- 
duction. When recording a live performance, micro- 
phones are not a limiting factor since the best have 
dynamic ranges in excess of 130 dB. In the record- 
ing studio, dynamics are manipulated by electronic 
gain compensation, so the ultimate limitation is the 
storage medium. For example, a 16-bit compact disk 
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Fig. 2. Some important reference sound levels relating to what is recorded and what is heard. 


(CD) can exceed 100 dB signal-to-noise ratio, while 
24-bit digital media can exceed any reasonable need 
for dynamic range. Background acoustical noise in 
studios and concert halls sets the lower limit for 
recordings, just as background noise in homes and 
cars sets a lower limit for playback. Quiet concert 
halls and homes are around 25 dB sound pressure 
level (SPL) at middle and high frequencies. Crescen- 
does in music and movies approximate 105 dB, giv- 
ing a working dynamic range of about 80 dB for 
fussy listeners and less for casual listeners. Today, 
this is no challenge for electronics, and it is not 
even difficult for loudspeakers in homes and cars, 
except perhaps at very low frequencies where a lot 
of air must be moved to fill a large room (Fig. 2). 
See ACOUSTIC NOISE; COMPACT DISK; DECIBEL; DIS- 
TORTION (ELECTRONIC CIRCUITS); GAIN; LOUDNESS; 
SIGNAL-TO-NOISE RATIO; SOUND; SOUND PRESSURE. 
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Fig. 3. Just-audible deviations in a measured frequency response when the program 
material is a complex orchestration in a reverberant space. From left to right are shown 
deviations caused by a high-Q, a medium-Q, and a low-Q resonance. The magnitudes of 
the deviations change little as a function of where they are located on the frequency scale. 


Linear distortions—amplitude/frequency response. Varia- 
tions in amplitude and phase as functions of fre- 
quency are known as linear distortions, since (ide- 
ally) they do not vary with signal level. In practice 
they often do, and in the process generate nonlinear 
distortions. In terms of their importance in preserv- 
ing the timbre of voices and musical instruments, 
the amplitude versus frequency characteristic, 
known commonly as the frequency response, is 
the dominant factor. Humans are very sensitive 
to variations in frequency response, the amount 
depending on the bandwidth of the variation. 

It is convenient to discuss this in terms of the qual- 
ity factor (Q) of resonances, since most such vari- 
ations are the result of resonances in loudspeaker 
transducers or enclosures. High-O resonances (such 
as Q = 50) have a narrow bandwidth; they are very 
frequency specific. Such resonances are just audi- 
ble when their deviation from an otherwise flat fre- 
quency response is about 10 dB fora sound that is not 
very revealing (nonreverberant, multimiked pop or 
jazz), to something less for complex orchestrations 
in reverberant spaces, to about 3 dB for pink noise. 
As the Q drops (wider bandwidth), so do the ampli- 
tudes at which the resonances are audible. A broad- 
band OQ of 1 is just audible at an amplitude deviation 
of about 0.3 dB with pink noise (Fig. 3). So, when 
we speak of frequency responses having deviations 
of +/— so many dB, over a frequency range, such a 
statement is meaningless unless there is a curve to 
show the bandwidth of the variations; a narrow spike 
can be much larger than a broad bump, for the same 
audible effect. Critical listeners report that they like 
the sound of devices with flat, smooth, broadband 


frequency responses. See DISTORTION (ELECTRONIC 
CIRCUITS); Q (ELECTRICITY); RESONANCE (ACOUSTICS 
AND MECHANICS); RESPONSE; REVERBERATION. 

Linear distortions—phase/frequency response. A pattern of 
phase shift accompanies the amplitude variation 
caused by a resonance. We can assume that this visual 
clue indicates something potentially audible. How- 
ever, there is no clear evidence that phase shift, by 
itself, is an audible problem. Since both amplitude 
and phase responses must be linear to allow accu- 
rate waveform reproduction (Fourier transform), the 
conclusion is that the reproduction of accurate wave- 
forms is not a requirement for good sound. The re- 
ality of everyday listening is that we hear a combi- 
nation of direct sound and multiple reflections from 
room boundaries and furniture. Moving either the 
head or the sound source can cause huge changes 
in the waveform at the entrance to the ear, yet the 
perception of timbre is hardly, if at all, changed. See 
FOURIER SERIES AND TRANSFORMS; PHASE (PERIODIC 
PHENOMENA). 

Nonlinear distortions. These distortions occur when a 
device behaves differently at different signal levels. 
The waveform coming out of a distorting device will 
be different from the one entering it. The difference 
in shape, translated into the frequency domain, is 
revealed as new spectral components. If the input 
waveform is a single frequency (pure tone), the ad- 
ditional spectral components will be seen to have 
a harmonic relationship to the input signal. Hence 
they are called harmonic distortion. If two or more 
tones are applied to the device, nonlinearities will 
create harmonics of all of the tones (harmonic dis- 
tortion) and, in addition, more new spectral compo- 
nents that are sum-and-difference multiples of com- 
binations of the tones. These additional components 
are called intermodulation distortion. 

It should be remembered that it is the same non- 
linearity that causes all of the distortion, and that the 
measurement methods described here are simply dif- 
ferent ways to quantify the nonlinearity. In reality, we 
listen only rarely to anything resembling a pure tone, 
and only occasionally to multiples of pure tones. We 
listen to much more complicated sounds and, as a re- 
sult, the distortions that result from their interaction 
with the nonlinearity are of many forms, continu- 
ously changing with time. It is therefore impossible 
to predict, from our simple distortion measures, just 
how offensive, or even audible, the unwanted spec- 
tral contamination will be. 

Multiway loudspeaker systems complicate inter- 
modulation distortion measurements because that 
form of distortion occurs only when the multiple 
test signals are reproduced by the same transducer. 
Thus it is possible for a nonlinear system to yield 
similar harmonic distortion data but very different 
intermodulation distortion data, depending on how 
many crossovers there are in the system and the fre- 
quencies at which they occur. For example, a woofer 
that becomes mildly nonlinear at low frequencies 
may be acceptable if it is restricted to frequencies 
below 80 Hz, but annoying if it operates to 1000 Hz 
where important voice and music frequencies can be 
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modulated by simultaneous strong bass signals. 

Perceptually, listeners are aided by a phenomenon 
called masking, in which loud sounds prevent some 
less loud sounds from being heard. This means that 
the music causing the distortion inhibits the ability 
to hear it. Rough guidelines suggest that, in music, 
much of the time we can be unaware of distortions 
measuring in whole percentages, but that occasion- 
ally small fractions of a percent can be heard. The 
only safe guideline seems to be to keep distortion, 
however it is measured, as low as possible. See MASK- 
ING OF SOUND. 

Dispersion, directivity, sound power—of loudspeakers. The 
preceding technical issues apply to both electronic 
devices and loudspeakers. However, loudspeakers 
radiate sound in all directions, so that measurements 
made at a single point represent only a tiny frac- 
tion of the total sound output. In rooms, most of 
the sound we hear from loudspeakers reaches our 
ears after reflections from room boundaries and fur- 
nishings, meaning that our perceptions may be more 
influenced by measures of reflected and total sound 
than by a single measurement, say, on the principal 
axis. 

At low frequencies, common box loudspeakers 
radiate omnidirectionally. At middle and high fre- 
quencies, loudspeakers can be designed to radiate 
in a predominantly forward direction, bidirectionally 
(forward and back) with the transducers in phase 
(the so-called bipole) or out-of-phase (the dipole), 
omnidirectionally, or any of several other multidirec- 
tional possibilities. Many of these designs evolved 
during the era of two-channel stereo, as means of 
enhancing the reflected sound field in rooms, com- 
pensating for the lack of true surround information. 
However, the most popular configuration has been 
loudspeakers with transducers arranged on the front 
surface, facing the listeners. 

To be useful, technical measurements must allow 
us to anticipate how these loudspeakers might sound 
in rooms. Consequently, it is necessary to measure 
sounds radiated in many different directions at points 
distributed on the surface of an imaginary sphere 
surrounding the loudspeaker. From these data, it is 
possible to calculate the direct sound from the loud- 
speaker to the listener, estimates of strong early re- 
flected sounds from room boundaries, and estimates 
of later reflected or reverberant sounds. It is also 
possible to calculate measures of total sound out- 
put, regardless to the direction of radiation (sound 
power) and of the uniformity of directivity as a func- 
tion of frequency. All of these measured quantities 
are needed in order to fully evaluate the potential 
for good sound in rooms. See DIRECTIVITY. 

Power compression of loudspeakers. As more power is de- 
livered to a transducer, the voice coil heats up, the 
resistance increases, and the sound output is lower 
than it should be. This is called power compression. 
In systems with woofers, midranges, and tweeters, 
the different units may heat by different amounts, 
leading to audibly different spectral balance at dif- 
ferent playback levels. Another power-related per- 
formance change has to do with the vent or port in 
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bass-reflex woofer systems. At high sound levels, the 
high air velocity in the port can generate turbulence, 
causing noise and changing the carefully designed 
tuning of the system. Thus loudspeaker systems can 
have quite distinctive sounds at different playback 
levels. See AERODYNAMIC SOUND. 

System configurations. In the earliest systems, 
there was monophonic sound (monaural means one 
ear, not one channel). In the 1930s, Bell Telephone 
Laboratories published research results indicating 
that, to successfully preserve the “auditory perspec- 
tive” of a frontal sound stage, there were two prac- 
tical techniques: binaural (meaning two ears) and 
multichannel. 

Binaural. This refers to a sound-reproduction tech- 
nique in which a dummy head, equipped with 
microphones in the ear locations, captures the orig- 
inal performance. Listeners then audition the repro- 
duced performance through headphones, with the 
left and right ears hearing, ideally, what the dummy 
head “heard.” The system is good, but flawed in 
that sounds that should be out in front (usually the 
most important sounds) tend to be perceived to be 
inside, or close to, the head. Some laboratory 
exercises claim to have solved the problem, but com- 
mon experience is different. Addressing this limita- 
tion, systems have been developed that use two loud- 
speakers, combined with signal processing to cancel 
the acoustical crosstalk from the right loudspeaker 
to the left ear, and vice versa. The geometry of the 
loudspeakers and listener is fixed. In this mode of lis- 
tening, sounds that should be behind are sometimes 
displaced forward, but the front hemisphere can be 
very satisfactorily reproduced. See BINAURAL SOUND 
SYSTEM; EARPHONES; VIRTUAL ACOUSTICS. 

Multichannel audio. This began with stereophonic sys- 
tems using two channels, not because two was an op- 
timum number, but because in the 1950s the technol- 
ogy was limited to one modulation for each groove 
wall of a record. Bell Labs, many years before, had 
recommended many channels and, as a practical min- 
imum, three (left, center, and right) or, for a single 
listener, two. Good stereo imaging is possible only 
for listeners on the axis of symmetry, equidistant 
from both loudspeakers. At its best, it has been very 
enjoyable. Stereo auditioned through headphones is 
much less realistic. See MODULATION; STEREOPHONIC 
SOUND. 

Quadraphonic systems. These appeared in the 1970s, 
and added two more loudspeakers behind the listen- 
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Fig. 4. Evolution of multichannel audio. (a) Two-channel stereophonic reproduction. 
(b) Quadraphonics. (c) 5.1-channel “home theater.” The larger loudspeaker symbol is a 
subwoofer. (d) 7.1-channel multichannel movie and music entertainment system. 


ers, mirroring the ones in front. The most common 
systems exhibited generous interchannel crosstalk 
or leakage. To hear the proper spatial perspective, 
the preferred listening location was restricted to 
front/back as well as left/right symmetry. The lack of 
a center channel was a serious omission. The failure 
to agree on a single standard resulted in the system’s 
demise. 

Dolby Surround. For film sound applications, Dolby 
modified the matrix technology underlying one of 
the quadraphonic systems, rearranging the four ma- 
trixed channels into a left, center, right, and sur- 
round configuration. In cinemas, the single, limited- 
bandwidth, surround channel information is sent to 
several loudspeakers arranged along the side walls 
and across the back of the audience area. For home 
reproduction, the surround channel is split between 
two loudspeakers placed above and to the sides of 
the listeners. To enhance the spatial effect of the sin- 
gle surround channel, some systems introduced elec- 
tronic decorrelation between the two loudspeakers, 
some used multidirectional loudspeakers in the sur- 
round channels, and some did both. 

Digital discrete multichannel systems. Even with the best 
active (electronically steered) matrix systems, the 
channels are not truly discrete. Sounds leak into 
unintended channels where they dilute and distort 
directional and spatial impressions. Several compet- 
ing digital recording systems now provide five dis- 
crete full-bandwidth channels, plus a low-frequency 
special effects channel for movies with truly big 
bass sounds. For homes, 5.1 channels refers to five 
satellite loudspeakers, operating with a common 
low-frequency subwoofer. The subwoofer channel is 
driven through a bass-management system that com- 
bines the low frequencies of all channels, usually 
crossing over at 80-100 Hz so that, with reasonable 
care in placement, it cannot be localized. 

Movies provided the motivation for multichannel 
audio in homes, but music may be where it offers its 
greatest opportunities. Movies tend to limit the use 
of off-screen sounds mainly to vague spatial effects, 
ambient music, reverberation, and carefully rationed 
directional effects. Music, on the other hand, is not 
constrained. Traditional acoustical music takes ad- 
vantage of the greater spatial realism that is possible 
with more channels. Modern music uses the addi- 
tional channels to surround listeners with moving 
and static sounds, instruments, and voices. Listeners 
may find these new directional and spatial effects var- 
iously stimulating and annoying. At least now, with 
more channels and more loudspeakers, what we hear 
at home and in our cars is likely to more closely re- 
semble what was created in the studio. See MUSICAL 
ACOUSTICS. 

The 5.1 channels do not satisfy all listeners. The 
configuration of three across the front and two at 
the side/rear locations originated with films. It as- 
sumed that all listeners would face a screen. Music 
listeners are less disciplined in where they sit and in 
which direction they face. More channels improve 
the situation and open more creative opportunities 


(Fig. 4). 


System components—electronics. The number of signal 
sources is large and growing. The control device of 
any multichannel system must accept analog and dig- 
ital inputs from several stereo and multichannel stor- 
age devices and from real-time delivery systems, in- 
cluding terrestrial broadcasting, cable, satellite, and 
the Internet. Mass storage options now include “juke 
boxes” holding hundreds of CD’s and DVD’s, as well 
as computerlike hard drive storage for compressed 
audio and video. To this complexity must be added 
the variety of encoding/decoding options, including 
different lossy and lossless digital compression al- 
gorithms for multichannel audio and digital video. 
Confusion is common. These are all digital devices 
and, with enough intelligent programming, a com- 
prehensible user interface should make it all work- 
able. Today’s audio/video receivers are much more 
complicated than the stereo devices of just a few 
years ago, yet they are easy to use. These, and es- 
pecially the stand-alone surround processors, are re- 
ally very powerful computers. Sound quality in elec- 
tronic devices is currently excellent and is getting 
even better. See VIDEO DISK. 

System components—loudspeakers. The one audio com- 
ponent that remains stubbornly evolutionary is the 
loudspeaker. In basic form, cone and dome trans- 
ducers of today look and work remarkably the 
same as those of decades ago. Other variations in 
transducers—electrostatic and electromagnetic pla- 
nar panels, horns, compression drivers, arrays, an 
so on—have been developed, but none has pre- 
vailed. The excellent fidelity, versatility, and cost 
effectiveness of moving-coil cone and dome trans- 
ducers justify their widespread acceptance. Better 
mechanical and acoustical measurements, new engi- 
neering design aids, and more accurate performance 
targets have led to considerable improvements in 
sound quality. New diaphragm materials allow res- 
onances to be better controlled or even eliminated 
by forcing them above the highest frequency re- 
produced by the transducer. Acoustical waveguides 
tailor the directivity of both direct radiators and com- 
pression drivers so that loudspeaker systems radi- 
ate sounds more uniformly at all frequencies. The 
benefit is a greatly improved interaction with rooms, 
large and small, matching the perceived timbral sig- 
natures of the direct, early reflected, and reverber- 
ant sounds. See SOUND FIELD ENHANCEMENT; SOUND- 
REINFORCEMENT SYSTEM. 

The marriage of electronics and transducers is a 
welcome trend, as the performance of the entire sys- 
tem can then be optimized. Designers have more 
scope to perfect the crossovers between transduc- 
ers in multiway systems. Digital techniques can re- 
fine the performance of the transducers by reduc- 
ing residual linear and nonlinear distortions. Built-in 
equalization can compensate for acoustical problems 
specific to the listening room. Supplying the signals 
through a digital connection makes it possible to link 
the loudspeakers into a whole-room or whole-house 
plug-and-play audio/video network. All of these func- 
tions have been demonstrated. It remains to be seen 
which will become commonplace. Floyd Toole 
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Sourwood 


A deciduous tree, Oxydendrum arboreum, of the 
heath family, indigenous to the southeastern section 
of the United States, and found from Pennsylvania to 
Florida and west to Indiana and Louisiana. Sourwood 
is hardy and often cultivated in the north. It is usually 
asmall or medium-sized tree, but it sometimes attains 





(b) 


Sourwood (Oxydendrum arboreum). (a) Branch has 
urn-shaped flowers. (b) Leaf, twig, and enlargement of twig 
showing axial bud and leaf scar are below. 
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a height of 75 ft (23 m) and a diameter of nearly 2 ft 
(0.6 m). The leaves are simple, finely toothed, long- 
pointed, and 4-8 in. (10-20 cm) long, and have an 
acid taste which explains the popular name. They 
turn scarlet in the fall. The flowers are white and urn- 
shaped and grow in long clusters (see illus.). The dry 
fruit remains on the tree through the fall and winter. 
The wood is not used commercially. Sourwood is 
also known as sorrel tree, and it is widely planted as 
an ornamental. See ERICALES; FOREST AND FORESTRY; 
TREE. Arthur H. Graves; Kenneth P. Davis 


South America 


The southernmost of the New World or Western 
Hemisphere continents, with three-fourths of it 
lying within the tropics. South America is approx- 
imately 4500 mi (7200 km) long and at its great- 
est width 3000 mi (4800 km). Its area is estimated 
to be about 7,000,000 mi? (18,000,000 km?). South 
America has many unique physical features, such as 
the Earth’s longest north-south mountain range (the 
Andes), highest waterfall (Angel Falls), highest navi- 
gable fresh-water lake (Lake Titicaca), and largest ex- 
panse of tropical rainforest (Amazonia). The western 
side of the continent has a deep subduction trench 
offshore, whereas the eastern continental shelf is 
more gently sloping and relatively shallow. See CON- 
TINENT. 

Regional characteristics. South America has three 
distinct regions: the relatively young Andes Moun- 


tains located parallel to the western coastline, the 
older Guiana and Brazilian Highlands located near 
the eastern margins of the continent, and an ex- 
tensive lowland plains, which occupies the central 
portion of the continent. The regions have distinct 
physiographic and biotic features. 

The Andes represent the highest and longest 
north-south mountain system on Earth. Altitudes 
often exceed 20,000 ft (6000 m) and perpetual 
snow tops many of the peaks, even along the Equa- 
tor (Fig. 1). So high are the Andes in the northern 
half of the continent that few passes lie below 
12,000 ft 600 m). Over most of their length the 
Andes are not just a single range but two or three 
parallel ranges. Within the parallel peaks lie a vast 
series of intermontane basins and plateaus. Ecuador 
contains a string of 10 such basins. Bolivia has the 
Altiplano, an extensive basin about 400 mi (640 km) 
long and up to 200 mi G20 km) wide that is al- 
most entirely surrounded by rugged and lofty 
peaks. The Altiplano is cold and high, averaging 
12,000 ft 600 m), and is mostly level. The Alti- 
plano is distinctive in that it possesses the highest 
navigable fresh-water body, Lake Titicaca, measuring 
approximately 3488 mi? (8965 km?) at 12,497 ft 
(3750 m) elevation and approximately 918 ft (275 m) 
in depth. 

On the northeastern and eastern continental pe- 
riphery lie the Guiana and the Brazilian Highlands. 
These vast areas of hilly uplands and low mountains 
are separated from each other by the Amazon River 
drainage (Fig. 2). Together these two plateaus form 





Fig. 1. Andes Mountains, Chile. The Andes, which parallel the Pacific coast, constitute one of the Earth’s most majestic 
features. Only the Himalayas exceed them in elevation and mass. (United Press International) 





Fig. 2. Representative view of the hilly uplands and low mountain country of the Guiana Highlands. One of five waterfalls on 
the Canaima River, Gran Sabana in southern Venezuela. (Venezuelan Information Service) 


the geologically oldest part of South America. Rocks 
of ancient igneous and metamorphic origin are par- 
tially covered by younger sedimentary beds and sand- 
stones. The vegetation and animal life associated 
with the flat-topped mesas of the highlands, called 
tepuis, show a high proportion of endemicity (in-situ 
speciation) resulting from long periods of isolation 
and ecosystem stability. 

More than 65% of South America’s total area 
is characterized by lowland plains under 1000 ft 
G00 m) in elevation. Some 40% of the continent is 
less than 650 ft (200 m) above sea level. The plains 
lie from about 8°N to 40°S between the lofty Andean 
backbone on the west coast and the Guiana and the 
Brazilian Highlands on the east coast, and between 
the Rio Orinoco in the north and the Rio Colorado 
in the south. Also in this region are the Llanos of 
Venezuela and Colombia, the Amazon plain of Brazil, 
the Paraguayan Chaco, and the Pampas of Argentina. 
Some of these areas are quite flat while others are 
undulating. Some, such as the Llanos and Chaco, are 
alternately flooded and baked. 

Physiographic features. The three Andes regions, 
the lowlands, and the eastern highlands are com- 
posed and bounded by many physiographic features, 
including mountains, intermontane basins, extensive 
lowland plains, coastlines, and rivers. 

Mountains. Because of the vast extent of the Andes, 
a greater proportion of South America than of any 
other continent lies above 10,000 ft 000 m). The 
young, rugged, folded Andean peaks stand in sharp 


contrast to the old, worn-down mountains of the 
eastern highlands. Although the Andes appear to be 
continuous, most geologists believe that they con- 
sist of several structural units, more or less joined. 
They are a single range in southern Chile, two ranges 
in Bolivia, and dominantly three ranges in Peru, 
Ecuador, and Colombia. 

Except in Bolivia, where they attain their maxi- 
mum width of 400 mi (640 km), the Andes are sel- 
dom more than 200 mi (20 km) wide. They do 
not equal the Himalayas in height, but have at least 
30 peaks above 20,000 ft (6000 m). The average 
height of the Andes is estimated to be 13,000 ft 
(3900 m). However, it is only north of latitude 35°S 
that the mountains exceed elevations of 10,000 ft 
(3000 m) [see table]. 

Both active and quiescent volcanic formations are 
common in southern Colombia, Ecuador, central and 
southern Peru, and western Bolivia. These volcanic 
peaks are the surface expression of the subduction of 
the Nazca tectonic plate beneath the South American 
plate, forming the offshore Peru-Chile Trench. The 
Andes were raised up by tectonic forces from 6560 ft 
(1970 m) to an average 13,120 ft G940 m) during the 
Pliocene uplift approximately 5 million years ago, 
and they continue to rise incrementally each year. 
As a result of these powerful tectonic forces, there 
is a spectacular series of volcanic peaks in western 
Bolivia and on each side of the structural depression 
in Ecuador. See PLATE TECTONICS. 

Stratification of climate and vegetation with 
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altitude can be readily observed in the Andes. At their 
eastern base is a zone with hot, humid lowland and 
foothills up to 3300 ft (1000 m), known as tierra 
caliente. Tierra templada, the zone from 3300 to 
6600 ft (1000 to 2000 m), has a relatively mild, frost- 
free climate and was preferred for European settle- 
ment and the production of plantation crops such 
as coffee and coca. From 6600 to 13,200 ft (2000 to 
4000 m) is tierra fria, a montane zone with occa- 
sional frosts. Tierra helada, at 13,200 to 19,800 ft 
(4000 to 6000 m), occupies the zone between the 
daily frost line and the final zone of permanent snow 
called tierra nevada. 

Because of the great north-south extent of the 
Andes, the processes of their erosion and denuda- 
tion have varied. Southward from about 40°S, and 
especially in the far south, the Andes were heavily 
glaciated during the Ice Age, and an extensive area 
north of the Strait of Magellan still has a broad mantle 
of permanent ice. Glaciers descend to the heads of 
many fiords on the Pacific coast or into lakes on the 
eastern side of the mountains. 

Coastal features. From the southern tip of Cape Horn 
north to 41°S latitude, the western coastal zone con- 
sists of a broad chain of islands where a mountain- 
ous strip subsided and the ocean invaded its val- 
leys. This is one of the world’s finest examples of 
a fiorded coast. Nowhere along the Pacific coast is 
there a true coastal plain. South of Arica, Chile, the 
bold, precipitous coast is broken by only a few deep 
streams, the majority of which carry no water for 





Principal Andean peaks* 





Peak Height, ft (m) 





Aconcagua, Argentina 
Ampato, Peru 

Caca Aca, Bolivia 

Cachi, Argentina 
Chimborazo, Ecuador 
Cincel, Bolivia 

Condoriri, Bolivia 
Coropuna, Peru 

Cuzco (Ausangate), Peru 
Del Acay, Argentina 

Dos Conos, Argentina 

Falso Azufre, Argentina-Chile 
Huascaran, Peru 

llampu, Bolivia 

Illimani, Bolivia 

Incahuasu, Argentina-Chile 
Llullaillaco, Argentina-Chile 
Mercedario, Argentina-Chile 
Ojos del Salado, Argentina-Chile 
Payachata, Bolivia 

Pissis, Argentina 

Porongos, Argentina-Chile 


22,835 (6960) 
21,702 (6615) 
20,329 (6196) 
21,326 (6500) 
20,577 (6272) 
20,102 (6127) 
20,043 (6109) 
22,802 (6950) 
20,187 (6153) 
20,801 (6340) 
22,507 (6860) 
22,277 (6790) 
22,188 (6763) 
21,276 (6485) 
21,282 (6487) 
21,720 (6620) 
22,015 (6710) 
21,884 (6670) 
22,573 (6880) 
20,768 (6330) 
22,245 (6780) 
20,512 (6252) 


Pular, Chile 20,342 (6200) 
Sajama, Bolivia 21,390 (6520) 
Sarmiento, Chile 20,670 (6300) 
Socompa, Argentina-Chile 19,787 (6031) 
Tocorpuri, Bolivia-Chile 22,163 (6755) 
Tortolas, de las, Chile 20,018 (6101) 


Tres Cruces, Chile 
Turpungato, Chile 
Valadero, Argentina 


21,720 (6620) 
21,490 (6550) 
20,735 (6320) 





*The elevations are approximate and some are controversial. Data 
based mostly on tables in Goode’s World Atlas, 12th ed., Rand 
McNally, 1964. 














Fig. 3. Uplifted beaches on desert coast, east side of San 

Juan Bay, Departmento de Ica, Peru. The sea cliff is about 
100 ft (30 m) high. Each horizontal line represents a former 
beach. (Photograph by F. Atchley) 


years at a time. Between Arica and Caldera, Chile, 
there are no natural harbors and almost no pro- 
tected anchorages. In fact, South America’s coast- 
line is the least indented of all the continents except 
Africa’s. The only exceptions are the fiorded coast 
and the offshore archipelagoes of southern Chile, the 
Gulf of Venezuela including Lake Maracaibo in the 
north, the Gulf of Guayaquil in the west, and the es- 
tuaries of the Amazon and the Rio de la Plata in the 
east. See FIORD. 

Many raised beaches are seen along the coast 
(Fig. 3), supporting the premise that the Andes are 
of recent origin and that the forces that raised them 
are still at work. Nearly every port in coastal Peru 
is an open roadstead (less enclosed than a harbor). 
Coastal Ecuador, separated by mountain spurs and a 
narrow coastal plain east of the Gulf of Guayaquil, 
varies in width from 50 to 150 mi (80 to 240 km). 
Two important river systems cross it, the Esmeraldas 
in the north and the Guayas in the south. But the most 
important segment of the entire area is the Guayas 
Lowland, east and north of the Gulf of Guayaquil. 
Here are found one-quarter of Ecuador’s total pop- 
ulation, the most important commercial agricultural 
lands, and the principal port, Guayaquil. The climatic 
transition that occurs in coastal Ecuador is one of 
the most pronounced in the world. Within a dis- 
tance of only 425 mi (680 km), one passes from areas 
having a rainy tropical climate, to wet and dry tropi- 
cal, to semiarid tropical, and finally to arid tropical— 
in short, from the world’s rainiest to its driest 
climates. 

The Caribbean coast of Colombia is a lowland 
formed largely of alluvium, deposited by the 
Magdalena and Cauca rivers, and bounded by moun- 
tains on three sides. In Venezuela, the Central 
Highlands rise abruptly from the Caribbean, with 
lowlands around Lake Maracaibo, west of Puerto 
Cabello, and around the mouth of the Rio Tuy of 
the Port of Guanta. The coastal region of Guyana, 
Suriname, and French Guiana is a low, swampy allu- 
vial plain 10-30 mi (16-48 km) wide, and as much 
as 60 mi (96 km) wide along the larger rivers. This 
coastal plain is being built up by sediments carried 
by the Amazon to the Atlantic and then deflected 
westward by the equatorial current and cast upon 
the shore by the trade winds. 


There is no broad coastal plain south of the 
Amazon and east of the Brazilian Highlands to af- 
ford easy access to the interior as is characteristic of 
North America’s coast. The rise from the coastal strip 
to the interior is quite gradual in northeastern Brazil; 
but southward, between Bahia and Rio Grande do 
Sul, the steep Serra do Mar is a formidable obsta- 
cle to transportation. Nonetheless, the first railroad 
was constructed here connecting São Paulo in the 
coffee region with the port of Santos. Though only 
40 mi (64 km) long, it was a remarkable feat since 
the abrupt escarpment of the Serra do Mar behind 
Río de Janeiro posed difficult engineering problems 
and required the use of cog tracks and cables. 

Along coastal Uruguay there is a transition be- 
tween the hilly uplands and plateaus of Brazil and 
the flat Pampas of Argentina, whereas coastal 
Argentina as far south as the Río Colorado, in 
Patagonia, is an almost featureless plain. In Patagonia, 
steep cliffs rise from the water’s edge. Behind these 
cliffs lies a succession of dry, flat-topped plateaus, 
surmounted occasionally by hilly land composed of 
resistant crystalline rocks. Separating southern Pata- 
gonia from Tierra del Fuego is the Strait of Magel- 
lan, which is 350 mi (560 km) long and 2-20 mi G- 
32 km) wide. Threading through numerous islands, 
the strait is lined on each side with fiords and moun- 
tains. 

Major rivers and river systems. There are three great river 
systems in South America and a number of impor- 
tant rivers that are not a part of these systems. The 
largest river system is the Amazon which, with its 
many tributaries, drains a basin covering 2.7 million 
square miles (7 million square kilometers), or about 
40% of the continent. The next largest is the sys- 
tem composed of the Paraguay, Parana, and La Plata 
rivers, the last being a huge estuary. The third largest 
river system, located in southern Venezuela, is the 
Orinoco, which drains 365,000 mi? (945,000 km?) 
of land, emptying into the Atlantic Ocean along the 
northeast edge of the continent. 

Amazon River system. The Amazon River begins 
in the eastern Andean highlands and stretches across 
the north-central part of the continent, draining the 
interior lowlands. Not only is it the longest, but it 
carries the largest volume of water and silt of any 
river in the world. It is also navigable for oceango- 
ing vessels for a greater distance than any other river. 
The Amazon discharges 3.4 billion gallons/min 13 bil- 
lion liters/min and has an average flow of 7.5 million 
cubic feet/s (210,000 cubic meters/s). Its discharge 
is about 18% of that of all rivers of the world. In 
places, for example, between Obidos, Brazil (600 mi 
or 800 km upstream) and the mouth, its depth ex- 
ceeds 300 ft (90 m). It drains an area of 2.3 million 
square meters (6 million square kilometers), and its 
gradient averages only 0.2 in./mi (3 mm/km) or about 
35 ft (11 m) for the last 2000 mi (4200 km) to the 
sea. Considering the enormous amount of silt, sand, 
and alluvial debris the Amazon carries to the Atlantic 
Ocean, it might be expected that a huge delta would 
have formed. But the Amazon has no delta because of 
the steady submergence of the land where the river 


reaches the sea. The Amazon is joined by some 500 
tributaries descending the Andes and the Brazilian 
and Guiana Highlands. 

Paraguay-Parana-La Plata system. From its head- 
waters in southwestern Brazil, the Paraguay River 
courses southward 1300 mi (1780 km), discharging 
into the Parana at the southern edge of Paraguay. The 
western bank of the Paraguay is low, and during the 
rainy season the western side of the entire basin be- 
comes inundated, flooding thousands of square miles 
of low-lying country. 

The Parana ranks among the world’s major rivers. 
It is longer and carries more water than the Missis- 
sippi River in the United States. Its source in the 
Brazilian Highlands is 2450 mi (4920 km) from its 
outlet in the Rio de la Plata. The Parana has cut a 
deep canyon in the flat-topped plateau of its upper 
course. As the river drops over the edge of the lava 
formation, famous waterfalls are found—La Guayra 
and Iguazu, the latter on the Iguazu, a tributary of the 
Parana. However, from Corrientes, where the water 
of the Paraguay is added, the gradient is gentle. Since 
the Parana rises in areas with characteristically heavy 
summer rainfall, its water volume fluctuates widely, 
the variation from high to low water reaching 15 ft 
(4.5 m) in the lower course. At the mouth of the 
Parana is an enormous delta, shared with the emp- 
tying Uruguay River. The delta consists of numerous 
low, flat islands that are submerged for weeks at a 
time. 

The largest indentation on the east coast of South 
America occurs from the head of the estuary of 
La Plata to the open Atlantic and is almost 200 mi 
(320 km) long. At Buenos Aires the estuary is about 
25 mi (40 km) wide, at Montevideo 60 mi (96 km), 
and at its mouth 138 mi (221 km). The Parana and 
the Uruguay transport huge quantities of silt into La 
Plata, some of which settles in navigation channels, 
necessitating costly dredging. 

Navigation for vessels drafting 6-7 ft (1.8-2.1 m) 
is possible as far as Corumba, Brazil, some 1800 mi 
(88 km) upstream from Buenos Aires via the Parana 
and Paraguay rivers. The remainder of the Parana is 
rarely used for navigation, and its hydroelectric po- 
tential is as yet undeveloped because markets are too 
distant. The Paraguay, however, serves the interior, 
and is considered the lifeline to the outside world for 
Paraguay. 

Orinoco River system. The headwaters of the 
Orinoco River lie in southern Venezuela near 
the Brazilian border. Near Esmeraldas, in southern 
Venezuela, is a stretch of about 220 mi (350 km) 
known as the Casiquiare canal, joining the Orinoco 
with the Negro of the Amazon system. 

The Orinoco, approximately 1600 mi (2560 km) 
long, is noted for its variability in volume from wet 
to dry season. It is a broad stream, and is a mile 
wide at Ciudad Bolivar, Venezuela, the “narrows.” 
Rainfall from June to October is so heavy that the 
Orinoco and its many tributaries are unable to han- 
dle all the water, and enormous areas become inun- 
dated. The river rises 39 ft (12 m) at Puerto Ordaz, 
Venezuela. From January to March, the dry season, 
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the waters recede and only the larger rivers flow 
freely. The smaller rivers are gradually converted into 
chains of pools and swamps lying along the valley 
bottoms. Where the Orinoco flows into the Atlantic, 
it forms a low, wide delta consisting of small islands 
and swamps and an intricate maze of distributaries. 

Biotic features. The plants and animals of the South 
American tropics are classified as Neotropical, de- 
fined by the separation of the South American and 
African continents during the Middle Cretaceous 
(95 million years ago). The Parana basalt flow, which 
caps the Brazilian shield in southern Brazil and ad- 
jacent parts of Uruguay and Argentina, as well as 
western Africa, indicates the previous linkage be- 
tween the South American and African continents. 
During the Middle Cretaceous Period, angiosperms 
(flowering plants) expanded in distribution across 
the Earth’s landmasses, largely replacing the gym- 
nosperms, which previously dominated the Gond- 
wana landmass. While there are regions of impor- 
tant endemicity for plants and animals within South 
America, many evolved early enough to share sig- 
nificant similarities with North American, Eurasian, 
and African fauna and flora. South America has many 
biotic environments, including the constantly moist 
tropical rainforest, seasonally dry deciduous forests 
and savannas, and high-altitude tundra and glaciated 
environments. 

Tropical rainforest. Amazonia contains the largest ex- 
tent of tropical rainforest on Earth. It is estimated 
to encompass up to 20% of the Earth’s higher plant 
species and is a critically important source of fresh 
water and oxygen. Structurally complex, the rainfor- 
est is composed of up to four distinct vertical layers 
of plants and their associated fauna. The layers often 
cluster at 10, 20, 98, and 164 ft (3, 6, 30, and 50 m) 
in height. The highest levels of canopy experience 
intense competition for light and water. Epiphytes 
and lianas are often found in the taller trees of this 
layer. The lower canopy and forest floor are usually 
open spaces because of the low intensity of light 
(around 1%) that reaches the forest floor. Over 75% 
of Amazonian soils are classified as infertile, acidic, or 
poorly drained, making them undesirable for agricul- 
ture because of nutrient deficiencies. Most of the nu- 
trients in the tropical rainforest are quickly absorbed 
and stored in plant biomass because the high annual 
rainfall and associated leaching make it impossible 
to maintain nutrients in the soils. In addition to the 
high structural complexity of the tropical rainforest, 
there is considerable horizontal diversity or patchi- 
ness. As many as 300 separate species of trees can 
be found in a square mile (2.6 km?) sample tract of 
rainforest in Brazil. The high complexity and species 
diversity of the rainforest are the result of long peri- 
ods of relative stability in these regions. Many areas 
of Amazonia show unique concentrations of species 
that suggest these territories may have acted as Pleis- 
tocene refugia where plants and animals were able 
to survive the glacial advances of higher latitudes. 
See PLEISTOCENE. 

Deciduous and conifer forest. Deciduous forests are 
found in areas where there is seasonal drought and 


the trees lose their leaves in order to slow tran- 
spiration. The lower slopes of the Andes, central 
Venezuela, and central Brazil are areas where these 
formations are found. Conifer forests occur in the 
higher elevations of the Andes and the higher lati- 
tudes of Chile and Argentina. Many of the species 
of these formations derive from the Late Cretaceous 
Southern Gondwana flora, including the Araucaria 
(star pine), Podocarpus, and Dacrydium. See DECID- 
UOUS PLANTS. 

Savanna. Tropical savannas occupy an extensive 
range in northern South America through southeast- 
ern Venezuela and eastern Colombia. Temperate sa- 
vannas are found in Paraguay, Uruguay, the Pampas 
of Argentina, and to the south, Patagonia. Savannas 
are composed of a combination of grass and tree 
species. The climate in these areas is often quite 
hot with high rates of evapotranspiration and a pro- 
nounced dry season. Most of the plants and animals 
of these zones are drought-adapted and fire-adapted. 
Tall grasses up to 12 ft 5.5 m) are common as are 
thorny trees of the Acacia (Fabaceae) family. Many 
birds and mammals are found in these zones, includ- 
ing anteater, armadillo, capybara (the largest rodent 
on Earth), deer, jaguar, and numerous species of ven- 
omous snake, including rattlesnake and bushmaster 
(mapanare). 

Desert. South America is unique in having a west- 
coast desert that extends almost to the Equator, 
probably receiving less rain than any on Earth (the 
Atacama), and an east coast desert located poleward 
from latitude 40°S (the Patagonian). 

The Atacama-Peruvian Desert dominates the 
Pacific coast for nearly 2000 mi (3200 km), lying 
between the ocean and the higher slopes of the 
Andean ranges. In many places, particularly in Chile, 
the land rises abruptly from the sea to an elevation of 
2000-3000 ft (600-900 m). The driest stretch, one 
of the driest places in the world, is between Arica 
and Caldera, Chile (Fig. 4). Above the seaward scarp 
is a low range of hills and a desert trough, 30-50 mi 
(48-80 km) wide, from which nitrate deposits are 
extracted. The persistence of this desert through lat- 
itudes at which deserts are not found elsewhere is the 
result of the north-moving Humboldt Current (Peru 
Current), accompanied along the immediate shore 
by the upwelling of cold water from considerable 
depths, creating stable, high-pressure atmospheric 





Fig. 4. Port of Matarani on the desert coast of southern 
Peru. This oblique aerial view looks coastwise southward 
over one of the world’s driest and most barren coastal 
zones. (Servicio Aerofotográfico Nacional) 


conditions. Prevailing winds, blowing from the west, 
carry cold air onto the warmer land with a drying 
effect. 

South America’s east-coast desert, the Patagonian, 
is found between the Rio Colorado and the Strait 
of Magellan. This desert is partly located in the rain 
shadow of the Andes, and the meager annual rainfall 
is ordinarily 5-6 in. (13-15 cm). Also contributing to 
the region’s aridity and to the presence of the desert 
near to the shores of the Atlantic are the cold waters 
of the Falkland Current, moving northward off the 
eastern coast, and the ceaseless winds, which has- 
ten evaporation. Wind velocities can exceed 70 mi/h 
G1 m/s). What precipitation there is occurs mostly 
in the winter. The region’s very cold winter temper- 
atures are the result of high latitude and high eleva- 
tion. 

Tundra. In Bolivia and Peru the zone from 10,000 
to 13,000 ft 000 to 3900 m), though occasion- 
ally to 15,000-16,000 ft (4500 to 4800 m), is known 
as the puna. Here the hot days contrast sharply with 
the cold nights. Above the puna, from timberline 
to snowline, is the paramo, a region of broadleaf 
herbs and grasses found in the highest elevations 
of Venezuela, Colombia, and Ecuador. Many of the 
plant species in these environments are similar to 
those found at lower elevations; however, they grow 
closer to the ground in order to conserve heat and 
moisture. 

Deforestation and environmental contamination. The 
1992 Quincentennial and the 1992 Earth Summit 
(United Nations Conference on Environment and 
Development) have drawn widespead attention to 
environmental policy and economic development 
in South American countries that have encouraged 
the destruction of natural vegetation. Intense poli- 
tical discourse has focused on the removal of 
tropical rainforest and the contamination of both 
lowland and highland environments across the con- 
tinent. An estimated 20% of the Amazonian forest 
has already been destroyed, and more than 7700 mi? 
(200,000 km?) are estimated to be replaced by agri- 
culture, cattle ranching, and urbanization each year. 
The loss of tropical rainforest in particular has been 
widely publicized, although many other biotic envi- 
ronments, including deciduous forest, coastal man- 
grove, and tundra environments, are also being de- 
graded by human use. 

Deborah A. Salazar; C. Langdon White 
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South Pole 

That end of the Earth’s rotational axis opposite the 
North Pole. It is the southernmost point on the 
Earth and one of the two points through which 
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all meridians pass (the North Pole being the other 
point). This is the geographic pole and is not the 
same as the south magnetic pole, which is near the 
coast of Antarctica, south of Tasmania. The South 
Pole lies inland from the Ross Sea, within the land- 
mass of Antarctica, at an elevation of about 9200 ft 
(2800 m). 

Since the Earth’s axis is inclined 231/,° from the 
vertical to the plane of the ecliptic, the South Pole ex- 
periences six months when the Sun is continuously 
above the horizon, reaching a maximum altitude of 
231/,°. It also has 6 months when the Sun remains 
continuously below the horizon, from March 22 to 
September 22. During the latter period, however, 
there is an interval of 7 weeks at the beginning and 
at the end when twilight conditions preclude total 
darkness. Twilight prevails when the Sun is located 
between 0° and 18° below the horizon. 

There is no natural way to determine local Sun 
time because there is no noon position of the Sun, 
and shadows point north at all times, there being no 
other direction from the South Pole. See MATHEMAT- 
ICAL GEOGRAPHY; NORTH POLE. Van H. English 

Bibliography. A. H. Strahler and A. Strahler, Intro- 
ducing Physical Geography, áth ed., 2005. 
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All the seas between Asia and Australia and the 
Pacific and the Indian oceans. They form a geograph- 
ical and oceanographical unit because of their spe- 
cial structure and position, and make up an area of 
3,450,000 mi? (8,940,000 km), or about 2.5% of the 
surface of all oceans. 

Ocean floor. The ocean floor consists of two large 
shelves and a number of deep-sea depressions. The 
Sunda Shelf is covered with littoral sediments, and 
the submerged valleys of two large river systems 
are found on it. The Arafura Shelf connects New 
Guinea with Australia. The China Basin, whose max- 
imal depth is 16,450 ft (5016 m), is connected with 
the Pacific Ocean over a sill about 6000 ft (2000 m) 
in depth. The Sulu Basin (maximal depth 18,300 ft or 
5580 m) has the highest sill (1400 ft or 420 m). The 
Celebes Basin (maximal depth 20,400 ft or 6220 m) 
is connected with the Pacific Ocean over three sills 
of about 4200 ft (1400 m) each. The Banda Basin 
is subdivided into several smaller basins, including 
the Weber Deep of 24,400 ft (7440 m); its sill depth 
is 6170 ft (1880 m). In the Celebes, Banda, and 
Flores basins, volcanic ashes are found. See EAST 
INDIES. 

Surface circulation. The surface circulation is com- 
pletely reversed twice a year (Figs. 1 and 2) by the 
changing monsoon winds. 

Monsoon current. During the north monsoon, which 
is fully developed in February, the monsoon current 
is formed in the northern China Sea, flows along the 
coast of Vietnam into the Java Sea, and into the Flores 
Sea. Parts of its water masses return into the Pacific 
Ocean; parts sink in the Banda Sea or flow into the 
Indian Ocean. 
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Fig. 1. Water circulation in South China and Indonesian 
seas, February. Transports in 10° m°/s. 


During the south monsoon in August, the mon- 
soon current is formed by water flowing from the 
Java Sea into the South China Sea and farther into 
the Pacific Ocean. 

The transports of the monsoon current are 
110,000,000 ft?/s (3,000,000 m3/s) in August and 
180,000,000 ft3/s (5,000,000 m/s) in February, but 
are small compared with those in the adjoining parts 
of the Pacific and Indian oceans (Figs. 1 and 2). 

Indonesian throughflow. Indonesian throughflow is a 
complex system of currents within the depth range 
of the thermocline to 1000 m (300 ft) that allows 
water to flow from the Pacific to the Indian Ocean. 
It is the only place in the global oceans where warm 
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Fig. 2. Water circulation in South China and Indonesian 
seas, August. Transports in 10° m°/s. 


equatorial waters pass from one ocean basin into 
another. Throughflow water must be replaced in the 
Pacific by inflowing, colder water at a high latitude in 
the South Pacific. Consequently, Indonesian through- 
flow exports a large amount of heat from the Pacific. 
The shift in heat between ocean basins, and the sub- 
sequent release of the heat to the atmosphere over 
the Indian Ocean, is recognized as a critical com- 
ponent of the global climate system. Consequently, 
Indonesian throughflow has been the subject of in- 
tensive oceanographic research. Attempts to under- 
stand why it exists led to fundamental theoretical 
advances in geophysical fluid dynamics. The studies 
showed that the mean throughflow is a response to 
the wind field over the whole Pacific Ocean. 

The main path of Indonesian throughflow is from 
the Celebes Sea near Mindanao, through Makassar 
Strait, eastward along the Indonesian archipelago 
through the Flores and Banda seas, and finally into 
the Indian Ocean passing south of Timor. Some 
flow passes through the lesser Sunda Islands. The 
strength of throughflow is probably not larger that 
360,000, 000 ft?/s (10,000,000 m3/s), while the tem- 
perature difference from the South Pacific is about 
10°C (18°F). The large temperature difference is the 
factor that makes Indonesian throughflow critical 
in the global climate system. The throughflow is 
strongest during the south monsoon (Fig. 2), when 
upwelling water from the Banda Sea contributes to 
the flow. Its strength varies considerably from one 
year to the next in response to changing winds over 
the equatorial Pacific and Indian oceans. The vari- 
ability is related to the El Nino Southern Oscillation 
phenomenon. See EL NINO. 

Salinity and temperature. The monsoons cause a pro- 
nounced rainy and dry season over most parts of the 
region and consequently strong annual variations of 
the surface salinity. During the north monsoon, rain- 
fall is strongest to the south of the Equator and sur- 
face salinity decreases, while in the China Sea the 
excess of evaporation over rainfall increases salinity. 
During the south monsoon, conditions are reversed; 
strong rainfall over the China Sea decreases its salin- 
ity, while evaporation and upwelling increase salinity 
in the Java, Flores, and Banda seas. Regions of per- 
manent low salinities are in the Malacca Straits, the 
Gulf of Thailand, and the waters between Borneo 
and Sumatra, because of the discharge of the large 
rivers. The surface temperature is always high, be- 
tween 79 and 84°F (26 and 29°C), and drops only 
in the northern parts of the South China Sea and in 
the Arafura Sea to about 75°F (24°C) during the dry 
season. See MONSOON METEOROLOGY. 

Deep circulation. The deep basins are supplied 
with Pacific Ocean deep water that enters over the 
various sills (Fig. 3). Only the Timor and Aroe basins 
are filled with Indian Ocean deep water. The renewal 
of water in the deep-sea basins is sufficiently rapid, 
on the order of 100 years, to maintain an adequate 
level of oxygen for the deep-sea fauna. 

Tides. The tides are mostly of the mixed type. Di- 
urnal tides are found in the Java Sea, Gulf of Tonkin, 
and Gulf of Thailand. Semidiurnal tides with high 
amplitudes occur in the Malacca Straits. Strong tidal 
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Fig. 3. Flow of bottom waters in the eastern part of the 
Indonesian Archipelago. 


currents occur in many of the small shallow passages. 
See INDIAN OCEAN; PACIFIC OCEAN; TIDE. 
Klaus Wyrtki; Gary Meyers 
Geology and geophysics. The Southeast Asian seas 
are characterized by the presence of numerous major 
plate boundaries. Plate boundaries represent sites 
where the Earth’s crust is created or destroyed, or 
where rigid blocks of crust slide past one another. 
In Southeast Asia the plate boundaries are identified, 
respectively, by young, small ocean basins with their 
spreading systems and associated high heat flow; 
deep-sea trenches and their associated earthquake 
zones and volcanic chains; and major strike-slip faults 
such as the Philippine Fault (similar to the San 
Andreas Fault in California). The Southeast Asian seas 
are thus composed of a mosaic of about 10 small 
ocean basins whose boundaries are defined mainly 
by trenches and volcanic arcs. The dimensions of 
these basins are much smaller than the basins of the 
major oceans. The major topographic features of the 
region are believed to represent the surface expres- 
sion of plate interactions, the scars left behind on the 
sea floor. See BASIN; FAULT AND FAULT STRUCTURES. 
The proximity of important metalliferous ore de- 
posits in selected islands of Southeast Asia, coupled 
with the fact that other nearby areas show signifi- 
cant potential for oil and gas, has led many scientists 
to investigate models that could relate plate tectonic 
processes to the formation of ore deposits and oil. 
Ocean basins. The small ocean basins which make 
up the Southeast Asian waters are relatively young 
when compared with the ages of the major ocean 
basins. The basins of Southeast Asia that have been 
dated with some confidence are all younger than ap- 
proximately 50,000,000 years. As part of the Deep 
Sea Drilling Project, the vessel Glomar Challenger 
drilled several holes along an east-west line (near 
18°N) from the central part of the Philippine Sea into 
the Pacific Basin. One purpose of the drilling pro- 
gram was to determine the age and chemical prop- 
erties of the underlying sea floor for comparison with 
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the sea floor of the major ocean basins. The ultimate 
goal of the drilling is to determine the evolutionary 
processes which have given rise to the extremely 
complex tectonic features which are observed in the 
Southeast Asian seas. 

Sediments. The sediments that blanket the sea floor 
in the Southeast Asian seas primarily reflect the prox- 
imity of land (that is, sediment sources) and the age 
and depth of the underlying sea floor. The thickly 
sedimented areas are found close to the continental 
margins and island arcs. The sediments here are com- 
posed mostly of materials transported from the adja- 
cent continents and islands. It is possible that an ad- 
ditional component of the sediments found near the 
margins was first deposited on the deep-sea floor but 
was subsequently uplifted onto the margins when 
certain plates collided. See CONTINENTAL MARGIN; 
MARINE SEDIMENTS. 

Tectonic processes. The study and understanding of 
the tectonic processes active in this region are of 
fundamental importance to many unresolved prob- 
lems of the Earth’s history. For example, it is a widely 
considered hypothesis that the continental regions 
may have evolved through the subduction (under- 
thrusting) and the obduction (overthrusting) of an- 
cient island arc systems with one another and with 
preexisting continental fragments. Another impor- 
tant hypothesis regarding the origin of metallifer- 
ous ore deposits is known as the geostill hypothe- 
sis. This postulates that rich metalliferous deposits 
were formed on or in the ocean crust near sites of 
plate accretion, that they were transported laterally 
to the arc-trench areas by sea-floor spreading pro- 
cesses and there were thrust back into the Earth’s 
interior; eventually they were remelted and migrated 
back to the Earth’s surface. When these metals (for 
example, copper) are found in concentrations such 
as in the Philippines, they are of great economic im- 
portance. 

Finally, the release of energy by earthquakes and 
volcanic eruptions at the collisional plate bound- 
aries underlines the need for more studies so that 
the destructive effects of these events can be mini- 
mized through accurate monitoring and prediction 
programs. See MARINE GEOLOGY; PLATE TECTONICS. 

Dennis E. Hayes 
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Glycine max, a legume native to China that is a major 
source of vegetable protein and oil for human and 
animal consumption and for industrial usage. Intro- 
duced into North America in 1765, soybeans were 
first grown as a curiosity and then as a forage and 
soil-building crop. The valued portion of the plant is 
the seed, which contains about 40% protein and 20% 
oil on a dry-weight basis. 

Economic importance. The United States is the 
world’s largest producer of soybeans. Illinois, 
Iowa, Minnesota, Indiana, Ohio, Missouri, Nebraska, 
Arkansas, South Dakota, Kansas, Michigan, and Mis- 
sissippi are the major soybean producers there. Rank- 
ing after the United States are Brazil, China, and 
Argentina. About 64% of the soybeans produced in 
the United States are crushed or processed domes- 
tically, 31% are exported as whole beans, 3% are 
used for seed, and 2% have miscellaneous uses. Coun- 
tries that import large quantities of whole beans for 
processing include Japan, the Netherlands, Taiwan, 
Spain, Mexico, South Korea, Germany, and Belgium. 
Russia, Canada, Algeria, and Venezuela import large 
quantities of soybean meal; and Pakistan, Morocco, 
Tunisia, Ecuador, and the Dominican Republic are 
the major importers of soybean oil. 

Livestock consume about 97% of the soybean pro- 
tein that is utilized in the United States. There is a 
limited use of the protein for human consumption 
as soy flours, grits, and texturized proteins in combi- 
nation with other foods. Soybean oil is widely used 
throughout the world for human consumption as 
margarine, salad and cooking oils, and shortening. 
A small amount of protein and oil is used for indus- 
trial purposes. 

In China, Taiwan, Japan, Korea, and Indonesia, 
soybeans are a regular part of the human diet as a 
source of protein and oil. Soybean milk, made by 
soaking beans in water, grinding the soaked beans, 
and filtering the insoluble pieces out, is directly con- 
sumed primarily by the Chinese; it is the starting ma- 
terial for preparation of soybean curd, probably the 
most important and popular soybean food in Asia. 
The soft cheeselike curd, known as tou fu, tofu, 
tubu, and other local names, has a bland taste and 
can be flavored with seasonings or blended with 
other foods. Soybean curd is made into a variety 
of products by frying, dehydration, and freezing, 
and is consumed daily in the same manner as high- 
protein foods in Western cultures. Soybeans also are 
prepared by fermentation with microorganisms to 
form highly flavored foods and seasonings such as 
soy sauce and fermented soybean paste. Soybean 
sprouts, immature green seed, and deep-fried mature 
seed provide additional sources of food for human 
consumption. 

Plant characteristics. The soybean is an annual 
plant that reaches a mature height of 25-50 in. (64- 
127 cm). The cotyledons are the parts of the seed 
that emerge from the soil as the seedling develops. 
Two unifoliolate leaves (single leaflets) develop op- 
posite each other above the cotyledonary nodes. 


Other leaves produced on the plant generally are 
trifoliolate (three leaflets). Branches develop from 
the main stem in response to the amount of space 
between adjacent plants. A plant may produce few 
branches and only a few seeds in close spacings or 
many branches and over a thousand seeds in wide 
spacings. 

Flowers of soybeans are self-fertilized and white or 
varying shades of purple. Pubescence (hair) on the 
stems, leaves, and pods is brown or gray. Pod color of 
commercial varieties in the United States is brown or 
tan, although genetic types are available with black 
pods. The pods generally contain one to four seeds 
ranging in size from about 800 to 10,000 seeds per 
pound (1800 to 22,000 seeds per kilogram); how- 
ever, most commercial varieties have two or three 
seeds per pod with a size of 1600 to 3600 seeds per 
pound (3600 to 8000 seeds per kilogram). The seed- 
coat of commercial varieties is yellow, and the com- 
mon colors of the hilum (seed scar showing point 
of attachment to pod) are yellow, gray, buff, brown, 
imperfect black, or black. Varieties of soybean na- 
tive to Asia have yellow, green, black, brown, and 
varicolored seedcoats. 

As a legume, soybeans can obtain nitrogen needed 
for growth from nitrogen-fixing bacteria that live in 
their roots. The bacteria can be applied at the time 
of planting by mixing them with the seed or by di- 
rect application to the soil. They can survive in most 
soils for several years and do not have to be reap- 
plied when soybeans are grown in a field every 2 or 
3 years. See LEGUME; NITROGEN FIXATION. 

The growing season for soybean varieties is con- 
trolled by their response to photoperiod and temper- 
ature; consequently, the most productive varieties 
for the northern United States are different from 
those grown in the South. Number of days from 
planting to maturity can range from about 90 to 180, 
depending on the variety and environmental condi- 
tions. 

More than 15,000 varieties and selections of soy- 
beans have been introduced into the United States 
from other areas of the world, and in the past some 
of them were grown commercially. Current pro- 
duction, however, is entirely devoted to varieties pro- 
duced by the breeding programs conducted by the 
U.S. Department of Agriculture, state agriculture ex- 
periment stations, and private companies. 

Most soybeans in the United States are planted 
in a well-prepared seedbed, although in some lo- 
cations they are planted in unplowed fields to re- 
duce soil erosion. In areas where the growing sea- 
son is long enough, soybeans are planted as a second 
crop immediately after wheat or other crops are har- 
vested. Planting is done in rows ranging in width 
from 7 to 40 in. (18 to 102 cm) with equipment 
also used to plant corn, cotton, wheat, and similar 
crops. Weed control includes the use of chemical 
herbicides, mechanical cultivation, and hand labor. 
Harvest of soybeans with combines begins when the 
leaves have dropped and moisture in the seed is at 
or below the 13% level required for safe storage. 

Walter R. Fehr 


Diseases 


Most diseases of soybean that occur throughout the 
world can be found in the United States. Diseases 
tend to be chronic in nature, occurring with some 
regularity from year to year and taking a small (and 
often unnoticed) toll. Annual losses from all diseases 
have been estimated at 12%, and this figure can fluc- 
tuate considerably, depending upon environmental 
conditions in any one particular season. Many soy- 
bean diseases are usually found in any one field, and 
a disease-free field is rare indeed. 

Although the term disease is often difficult to de- 
fine, it is usually considered that soybean diseases 
can be caused by both living and nonliving enti- 
ties. In the first category are the parasitic diseases 
caused by fungi, nematodes, bacteria, and viruses. 
The nonliving causes of disease include conditions of 
weather and soil and the deficiency or excess of var- 
ious minerals. Iron, potassium, and manganese defi- 
ciencies are common, and varying degrees of chloro- 
sis accompany such deficits. Herbicide and lightning 
damage are also often reported by growers. 

Fungus diseases. Fungi cause more diseases than 
other parasitic organisms. Leaves can be attacked 
by Peronospora manshurica and become severely 
spotted with yellowish-green areas on upper leaf sur- 
faces. If the case is severe enough, the leaves can 
fall prematurely; the leaves more often remain in- 
tact, and the spots take on a dark-brown appearance 
with fungus growth apparent on the lower side of 
leaves. This symptom gives rise to the common term 
mildew. The fungus spreads rapidly from plant to 
plant and may overwinter in fallen leaves or invade 
pods and survive on seeds as a white crust. Many 
pathogenic variants of this fungus are found, and re- 
sistant varieties of soybean are being developed. Up 
to 8% reduction in yield has been reported from this 
disease alone in Iowa. 

In the Midwest, brown spot, caused by Septoria 
glycines, is another disease of importance on leaves. 
It appears early in the season as angular lesions on 
primary leaves and may continue to move up the 
plant and cause defoliation, attack pods, and ulti- 
mately invade seed. Target spot, a disease caused by 
Corynespora cassticola, is found farther south in the 
soybean-growing region and has caused losses in the 
range of 18-32% in Mississippi. Its name is derived 
from concentric dark rings formed as large brown 
and often irregular spots develop on leaves. Fungus 
leaf spots of minor importance include those caused 
by Cercospora sojina, Phyllosticta sojicola, and Al- 
ternaria atrans. 

A series of fungus root and stem rots may 
plague soybeans. Root rot, caused by Phytophthora 
megasperma, is probably the most spectacular and 
serious disease in areas where it occurs. Plants die 
in all stages of growth, usually in low places in fields 
and along end turn rows. Several states and Canada 
have reported damage from this disease, and control 
is effected by the use of resistant varieties. Inheri- 
tance of resistance is controlled by a single factor, 
thus making it relatively simple to incorporate this 


resistance into promising (or preexisting) varieties 
of soybeans. The pathogen is not seed-borne. 

By 1980, 19 pathogenic races had been described, 
and some researchers believe that single-gene resis- 
tance or tolerance no longer is a viable approach to 
control. Either a generalized resistance or tolerance 
has been considered. Most commercial cultivars pos- 
sess considerable field resistance to Phytophthora. 

Brown stem rot, caused by Cephalosporium gre- 
gatum, is a disease of major concern in the Mid- 
west, and the fungus is widespread in Minnesota, Illi- 
nois, Iowa, and Indiana. Crop rotation (on a 3-4-year 
basis) is a well-known control. Resistance as a viable 
method of control was investigated in the 1970s, and 
by the end of that decade at least two major produc- 
ing states were considering this method of control. 
Seed rot and root rot by Pythium and Rhizoctonia 
are common and chronic on seedlings over vast areas 
of the soybean region. Plants able to emerge usu- 
ally outgrow this problem; however, stand reduction 
may be severe enough to warrant replanting. Stem 
canker and pod and stem blight are two stages of 
an important disease caused by Diaporthe phaseolo- 
rum which are favored by high humidity and can be 
controlled to some extent by chemical treatment. A 
widespread and serious seed quality problem in the 
warmer and more humid regions of the South and 
Midwest results from pod infection by Diaporthe. 
Systemic fungicides have provided some improve- 
ment in seed quality and yields. 

Soybean rust, a fungus-induced disease, is assum- 
ing more importance as acreages expand in tra- 
ditional soybean growing areas and as production 
moves into new areas. The disease is serious in Asia, 
especially in tropical and subtropical areas. In Tai- 
wan, Thailand, the Philippines, and Java it is the most 
damaging fungal disease affecting soybean crops. 
The pathogen, Phakopsora pachyrhizi, can infect 
and sporulate on species in 17 known legume gen- 
era but is economically important only on soybean. 
The pathogen has been reported in Africa on cowpea 
but not on soybean. In 1976 soybean rust was found 
on soybean cultivars Biloxi, Hardee, Santa Rosa, and 
Williams in an experimental planting in the highlands 
of western Puerto Rico. This was the first record of 
rust on soybean in the Western Hemisphere, and is 
cause for concern in the Americas. It is likely that the 
pathogen will be found on soybean plantings else- 
where in the Caribbean area. 

Bacterial diseases. Bacterial diseases usually are a 
chronic problem of this crop. Bacterial blight (Fig. 1) 
caused by Pseudomonas glycinea is found primarily 
in the upper Midwest and bacterial pustule caused by 
Xanthomonas phaseoli predominates in the south- 
ern United States. These pathogens enter plants 
through pores or injuries in leaves and are spread 
rapidly by windblown rain. Seed-borne infection is 
common and constitutes the major means of over- 
wintering. Infected seedlings can thus emerge, and 
the disease cycle begins again. Antibiotic sprays 
are impractical, and resistant strains of the blight 
pathogen develop rapidly during their use. Some va- 
rieties of soybean are more resistant to blight than 


Soybean 


55 


56 Soybean 


Ea 


Fig. 1. Bacterial blight symptoms on soybean leaves under 
field conditions. The condition results principally in chlorate 
spots and malformations. 


others (Fig. 2), and inheritance of resistance in the 
soybean host depends upon the particular strain of 
the pathogen being tested. 

A chlorosis (yellowing of leaves) may develop from 
strains of Rhizobia, which are abnormal in that they 
cannot fix nitrogen but are aggressive in entering soy- 
bean roots. These bacteria thus occupy space that 
efficient nitrogen fixers would otherwise occupy. A 
bacterial wilt caused by Corynebacterium flaccum- 
faciens has been reported in certain areas of the Mid- 
west but is not of great economic importance. 

Virus diseases. The viruses may enter soybean 
plants by any of several methods, although each virus 
usually is very specific in its mode of entrance. They 





Fig. 2. Response of soybean varieties to bacterial blight. (a) Complete resistance. 
(b) Complete susceptibility. 





may be injected into the plant by insects or nema- 
todes or through wounds or may go from one gen- 
eration to the next via the seed. Only a few of the 
viruses of this crop have been studied extensively. 

Soybean mosaic is common and is easily transmit- 
ted mechanically or by insects or seeds. It usually ap- 
pears on leaves as crinkling and distortion of young 
tissues, and damage is similar to that produced by 
sprays used in weed control. It affects the seed by 
producing a mottled appearance; the seed becomes 
discolored when pigments from the seed scar appear 
to streak across seed coat surfaces during their de- 
velopment. Control is difficult, but yield losses have 
not been excessive. 

Bud blight, on the other hand, is a very destructive 
virus disease, and no resistance is known. It, too, is 
seed-borne; it is apparently transmitted by insects. 
Usually, isolated plants in a field are found scattered 
at random (originating from infected seed), or plants 
along the edge of fields bordering alfalfa or fence 
rows may be heavily infected. This phase is thought 
to be due to an insect vector moving the virus from 
other plants to soybeans. Loss in yield can be 100% 
of the crop. This virus causes the plant to produce 
a rank green stem which stays green after frost, and 
seed pods are either absent or small; if infected while 
young, the growing points of plants die and curve 
downward, giving rise to the common name “bud 
blight.” The same virus causes ring spot in tobacco. 
See PLANT VIRUSES AND VIROIDS. 

Nematodes. These pathogens are microscopic, eel- 
like worms in the soil. Members of the genus 
Meloidogyne cause root knots or galls which form 
as large swellings and cause severe yellowing and 
stunting of aboveground portions of the plant. Crop 
rotation and the use of resistant varieties constitute 
the usual control methods. The soybean cyst nema- 
tode Heterodera glycines represents a real threat to 
soybean production and is now so widespread that 
quarantine no longer is effective and has been aban- 
doned. It was first found in 1952 in North Carolina 
and since then has been found in isolated places 
in Virginia, Mississippi, Arkansas, Illinois, Kentucky, 
Missouri, and Tennessee. It is now feared from south- 
ern to northern climates. It moved northward in Illi- 
nois for many years, and was found for the first time 
as far north as Minnesota in 1978 and in Iowa in 1979; 
it appears to be destructive in northern as well as 
southern climates. Symptoms vary but are not very 
dissimilar to root knot symptoms on aboveground 
parts of the plants. Below ground, small white cysts 
are noted; once present, these cysts can exist many 
years in the soil, and complete control becomes al- 
most impossible. Yield loss may be severe. Resis- 
tant varieties and rotation with nonsusceptible crops 
are recommended. See NEMATA (NEMATODA); PLANT 
PATHOLOGY. Bill W. Kennedy 


Processing 


Processing of soybeans into oil and defatted cake or 
meal began in the Orient. The oil was used primarily 
in foods, whereas the meal was used as a fertilizer and 
in animal feed. In the United States, soybeans have 
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Fig. 3. Steps involved in the solvent extraction method for soybeans. 


been processed primarily to obtain oil. The oil was 
processed for use in shortenings, margarines, and 
salad dressings. The defatted cake or soybean meal, 
once a by-product of soybean processing, has be- 
come the principal product from soybeans because 
of its protein value for livestock feeding. Soybean 
protein products are prepared primarily for direct 
human consumption. Direct food usage has been lim- 
ited, but has been expected to increase dramatically 
because of population pressure on the food supply 
and limited land and energy resources available for 
expansion of the livestock industry. 

The soybean processing industry converts the oil 
and meal from the soybean seed into numerous food 
and industrial products. The products can be gen- 
erally classified into three categories: soybean oil, 
soybean meal, and soybean protein products. The 
processing steps are oil extraction, oil refining, and 
protein processing. 

Oil extraction. The basic processing of soybeans 
consists of separation of the oil and meal. The equip- 
ment used varies considerably from plant to plant al- 
though the general process, solvent extraction of the 


oil, remains the same. Older, obsolete methods used 
either continuous screw presses or hydraulic presses 
to squeeze out the oil. Solvent extraction has almost 
universally been adopted because of its efficiency 
and low operating cost. 

The steps involved in the solvent extraction of soy- 
beans are outlined in Fig. 3. Storage of soybeans in 
elevators or silos is included as part of the process- 
ing since it ensures a continuous, year-round supply 
of beans for processing without the day-to-day price 
fluctuations of the commodity market. The soybeans, 
prior to storage, are cleaned and dried to 12-14% 
moisture for improved stability until processed. 

Preparation. The processing operation consists of 
preparation, extraction, and meal-finishing phases. 
Preparation for extraction consists of steam- 
conditioning the beans, cracking, dehulling, and flak- 
ing. Cracking the beans into small particles, usually 
six to eight pieces, loosens the hulls from the soy- 
bean meats and assists in the flaking operation. Cor- 
rugated rollers are used for the cracking operation. 
Dehulling or separation of the hull material from the 
cracked meats is necessary to increase extraction 
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efficiencies. Dehulling is performed by a combina- 
tion of air aspiration, screening, centrifugal separa- 
tion, and density separation methods. Flaking the 
cracked, dehulled soybean meats facilitates the ex- 
traction process by disruption of the internal cellular 
structure of the bean and by allowing better contact 
of the extracting solvent with the internal parts of 
the seed. Flaking is done by heating and steaming 
the cracked meats and passing them through differ- 
ential roller mills. The resulting flakes are paper-thin 
(0.009-0.02 in. or 0.25-0.40 mm). 

Extraction. This operation consists of countercur- 
rent washing of the flakes with a petroleum solvent, 
usually hexane. The solvent is kept near 120°F (50°C) 
during extraction to speed the solubilization of oil 
from the flakes. The oil-laden solvent, called micella, 
is then separated from the flakes and distilled for 
recovery of the crude soybean oil. The solvent is 
recovered for further use. The crude soybean oil is 
then pumped to storage tanks for further process- 
ing or to be sold as crude soybean oil. See SOLVENT 
EXTRACTION. 

Finishing. The solvent-wet, defatted flakes are des- 
olventized by steam and heat. The severity of the 
desolventizing depends on the uses for the meal. If 
the extracted flakes are to be used in animal feeds, 
moisture is adjusted and the flakes are heat-treated 
(toasted) to increase their feed value as a protein 
supplement. Flakes to be used for edible soy flour or 
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protein products are subjected to less severe heating 
to prevent heat damage to the proteins present. After 
desolventizing, the flakes are screened and ground. 
The protein content may also be adjusted. Anticaking 
agents may be added to ensure ease in handling. 

Yields. The processing yields based on a 60-lb 
(27.2-kg) bushel are 10.2-11.6 Ib (4.6-5.3 kg) of 
crude oil and 48.4-49.8 Ib (22-22.6 kg) of meal 
and feed products. The maximum yield of crude oil 
is always attempted. The meal from the extraction 
contains 49-50% protein. The hulls, accounting for 
about 5% of the beans, are ground and may be added 
back to the extracted meal for production of a 44% 
protein meal. The quantities of each meal prepared 
depend upon the market value at the time of pro- 
duction. The ground hulls may be stored for later 
use, sold as mill feed to be used as a source of fiber in 
mixed livestock feed products, or used in a patented 
process as a carrier for molasses in feed concentrates. 

Soybean oil processing. Soybean oil is used in salad 
dressings, shortenings, and margarines and in var- 
ious nonfood applications such as paints and var- 
nishes. Most (93%) is used in foods. To be suitable 
for edible products, crude oil from the extraction 
requires further processing. 

The processing of soybean oil, generally referred 
to as refining, is outlined in Fig. 4. Individual pro- 
cesses are degumming, alkali refining, bleaching 
(decolorizing), hydrogenation, deodorization, and 
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Fig. 4. Steps involved in the soybean oil refining process. 


winterization. Semirefined oils are also marketed, de- 
pending upon the customer’s demands. Hydrogena- 
tion is varied, depending upon the final use of the 
oil. All edible oils are deodorized. 

Degumming. Phospholipids, or lecithin, present in 
the crude soybean oil will precipitate during storage 
and contribute to deterioration during use of the oil. 
During degumming, the soybean phospholipids are 
removed from the crude oil. Degumming itself refers 
to the gummy consistency of the wet phospholipids. 
The process involves the addition of a small amount 
of water or steam to the crude oil, resulting in the 
hydration of the phospholipids and their subsequent 
precipitation. Acidic catalysts are sometimes added 
with the water to improve the hydration. The pre- 
cipitated phospholipids are then separated by cen- 
trifugation and are vacuum-dried. The dried phos- 
pholipids are sold as soybean lecithin. 

Lecithin. Commercial soybean lecithin is widely 
used in foods, particularly chocolate, margarine, 
shortening, and greaseless frying compounds. 
Soybean lecithin from the degumming step contains 
60-65% phospholipids and 30-35% entrained soy- 
bean oil. The principal phospholipids present are 
phosphatidylcholine, | phosphatidylethanolamine, 
and phosphatidylinositol. The crude lecithin is 
dark-colored and may be bleached with hydrogen 
peroxide to produce a light, straw-colored product. 
Fatty acids may be added to make a fluid product, or 
the lecithin may be treated with ammonia to form 
a firm, plastic consistency. More refined grades (phar- 
maceutical lecithin) are produced by acetone extrac- 
tion of the entrained soybean oil. The phospholipids 
are insoluble and precipitate. Further processing 
consists of desolventizing and granulating. 

Alkali refining. Free fatty acids decrease the heat 
stability of the oil. The degummed soybean oil is 
alkali-refined to remove the free fatty acids, residual 
phospholipids, and color. Alkali refining consists of 
addition of caustic soda, soda ash, or a combination 
of the two to neutralize the free fatty acids. Careful 
addition of the alkali is required to prevent further 
degradation of the oil. The neutralized free fatty acids 
or soaps are removed from the oil by centrifugation 
or settling. The soaps, referred to as soapstock, are 
sold to soap manufacturers, fatty acid industries, or 
livestock feed formulators in the alkaline form, or in 
an acidulated form after treatment with sulfuric acid. 
The alkali-refined oil is further washed with water to 
remove all traces of alkali and soaps present and then 
is dried. The neutralized oil is often sold to industrial 
users for nonfood applications. 

Bleaching. Bleaching (decolorizing) removes the 
color bodies or pigments present in the oil by ad- 
sorption on activated earth or carbon. The bleaching 
agent is simply dispersed in the oil and removed by 
filtration. The bleached oil is sent to the hydrogena- 
tion unit or to the deodorizers, depending upon the 
final use. Foods requiring improved oxidative stabil- 
ity or plastic fats such as margarines or bakery short- 
enings require hydrogenation. 

Hydrogenation. Development of the hydrogenation 
process for vegetable oils has permitted the substi- 


tution of vegetable oils for animal-derived fats and 
the substitution of the various types of vegetable 
oils with soybean oil. Variations in the hydrogena- 
tion conditions can be used to produce high-stability 
soybean oil, hardened soybean oils that melt rapidly 
at body temperature, and shortening-like fats. 

Hydrogenation is performed by purging the oil 
with hydrogen gas at an elevated temperature and 
pressure in the presence of a catalyst. Nickel catalysts 
are generally used. The addition of hydrogen to the 
fat saturates the molecular structure. The degree of 
saturation depends upon the time, temperature, and 
pressure of hydrogenation. See HYDROGENATION. 

Deodorizing. Deodorization of the oil removes the 
undesirable flavor and odor constituents. Oils must 
be odorless and bland in flavor to be suitable for food 
use. Deodorization is carried out under high tem- 
perature and vacuum with steam injected to assist 
volatization of the undesirable components. 

Winterization. Winterization is performed on lightly 
hydrogenated soybean oil that is to be used as a salad 
oil. The oil, in addition to conforming to oxidative 
stability requirements, must remain clear at refriger- 
ator temperatures. Winterization consists of cooling 
of the oil and filtering off the cloudy haze that forms. 

Reversion. Refined soybean oil is a highly unsatu- 
rated oil and will develop, after storage, characteris- 
tic flavors and odors described as painty or fishy. The 
development of these flavors is called reversion, and 
is possibly a result of the oxidation of the linolenic 
acid component of the oil. Care in the exposure of 
the oil to air during processing and controlled hy- 
drogenation to partially saturate the linolenic acid 
minimizes the tendency of soybean oil to reversion. 
Metal sequestrants, such as citric acid, are sometimes 
added during deodorization to inactivate the proox- 
idant metals such as iron and copper. Antioxidants 
may also be added. See ANTIOXIDANT; FAT AND OIL 
(FOOD). 

Soybean protein products. The soybean protein 
products are prepared from the extracted meal. 
Three distinct product types are prepared: soy flour 
and grits, soy protein concentrates, and soy protein 
isolates. These are used both in foods and in nonfood 
applications. 

Flourand grits. Soy flour and soy grits contain approx- 
imately 50% protein based on the nonlipid fraction. 
Flours and grits are used in processed meats, bak- 
ery mixes, cereals, breads, and baby foods. A large 
amount has been utilized in nutrients for export to 
undeveloped, underfed countries. 

The term soy flour refers to an extremely fine 
powdered product, whereas the term grits refers to 
a granular product. Various products are prepared 
varying in oil content, oil type, particle size, and 
water solubility of the protein. In general, the oil con- 
tent varies from less than 1% (defatted) to 18-20% 
(full fat); the oil source is soybean oil or lecithin. Par- 
ticle size varies from powders that pass a 200-mesh 
screen (flour) to those that pass a 14-mesh (grits) 
one. Water solubility of the protein ranges from 10% 
(toasted) to 80% (untoasted). 

Soy flour or grits is produced similarly to soybean 
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Fig. 5. Production of soy protein concentrates and isolates. 


meal, except with thorough removal of the hulls be- 
fore extraction. Desolventizing and toasting are var- 
ied to alter the water solubility of the protein, with 
minimum heating being required for high protein sol- 
ubility. The defatted, desolventized flakes are ground 
and screened to the desired particle size. 

The oil-containing products are generally prepared 
by blending soybean oil or lecithin back to the ex- 
tracted meals. Full-fat soy flour may also be prepared 
by steaming the unextracted, dehulled flakes to re- 
move the bitter flavors, and then drying, grinding, 
and screening them to the desired particle size. 

Concentrates. Soybean protein concentrates and soy- 
bean protein isolates are generally prepared from soy 
flour or grits. The processing methods vary consid- 
erably between processors. A generalized process is 
shown in Fig. 5. 

Soybean protein concentrates contain 70% pro- 
tein. Concentrates are prepared by washing the 
water-soluble carbohydrates from the meal under 
conditions designed to ensure minimum protein 
solubility. Dilute acids or aqueous alcohol solu- 
tions have been used. The washing is performed 
in agitated tanks. The remaining insoluble proteins 
and carbohydrates are removed from the extraction 
by centrifugation or filtration, washed, dried, and 
ground. Products are marketed with ranges of par- 
ticle sizes and protein solubilities similar to soy flour 
and grits. Soy protein concentrates are usually bland 
in flavor, possess a lighter color, and are more func- 
tional than soy flour (having emulsification, water- 
binding, gelling, and other properties). 


Isolates. Soy protein isolates are the purest form of 
soy proteins marketed (90% protein minimum). The 
isolates are prepared by solubilizing the protein pres- 
ent in the grits by using mildly alkaline solutions, sep- 
arating the protein solution from the fibrous residue 
by centrifugation or filtration, and acidifying the pro- 
tein solution to precipitate the protein as a curd. The 
curd is recovered, washed, resolubilized, and spray- 
dried into a fine, flourlike powder. 

Soy protein isolates are manufactured to possess 
specific functional properties such as gelling, whip- 
ping, foaming, emulsifying, thickening, and binding. 
Isolates are used in meats, simulated dairy products, 
and bakery products. 

Textured proteins. Each of the soybean protein prod- 
ucts can be used to form textured soy proteins. 
Textured soy proteins have a meatlike texture and 
appearance when hydrated and can be used for ex- 
tension of ground and comminuted meats (such as 
ground meat and bologna) and for the preparation 
of simulated meats. When they are used as exten- 
ders, the textured proteins increase cook yield, aid 
in binding of meat constituents, and improve emul- 
sion stability. 

Two general processes (with variations) are used 
for texturizing soy proteins: extrusion cooking and 
spinning. Extrusion cooking consists of moistening 
the protein, working the doughlike mass under high 
temperature and pressure, and forcing the cooked 
mass through an orifice, which permits the heated 
compressed mass to expand. The product is then 
dried and sized. Colored and flavored extruded soy 
protein products are prepared by addition of these 
additives either before or after extrusion. Imitation 
bacon bits that are based on extruded soy flour are 
widely marketed. 

Soy protein isolates are required for spinning. 
Spun proteins are somewhat more adaptable to simu- 
lated meat production than extruded proteins. Spun 
proteins are produced by solubilizing the isolates in 
an alkaline solution and forcing this solution through 
a multiholed die immersed in an acid bath. The 
solubilized protein immediately coagulates in the 
acid bath, forming a continuous filament. The fila- 
ments are stretched to orient the protein molecules 
within the filament. Toughness of the filaments is 
dependent upon the degree of protein orientation. 
Filament diameters are approximately 0.003 in. 
(0.076 mm). Groups of filaments when combined 
with egg white, flavors, and colors, followed by com- 
pression and heat setting, form imitation meats such 
as ham, chicken, or beef. See FOOD MANUFACTURING. 

F. T. Orthoefer 
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Space 

Physically, space is that property of the universe as- 
sociated with extension in three mutually perpen- 
dicular directions. Space, from a newtonian point of 
view, may contain matter, but space exists apart from 
matter. Through usage, the term space has come to 
mean generally outer space or the region beyond 
the Earth. Geophysically, space is that portion of the 
universe beyond the immediate influence of Earth 
and its atmosphere. From the point of view of flight, 
space is that region in which a vehicle cannot ob- 
tain oxygen for its engines or rely upon an atmo- 
spheric gas for support (either by buoyancy or by 
aerodynamic effects). Astronomically, space is a part 
of the space-time continuum by which all events 
are uniquely located. Relativistically, the space and 
time variables of uniformly moving (inertial) refer- 
ence systems are connected by the Lorentz transfor- 
mations. Gravitationally, one characteristic of space 
is that all bodies undergo the same acceleration in 
a gravitational field and therefore that inertial forces 
are equivalent to gravitational forces. Perceptually, 
space is sensed indirectly by the objects and events 
within it. Thus, a survey of space is more a survey 
of its contents. See EUCLIDEAN GEOMETRY; LORENTZ 
TRANSFORMATIONS; RELATIVITY; SPACE-TIME. 

The space around the Earth and extending out per- 
haps 10 earth radii (40,000 mi or 64,000 km) has 
properties which differ from those of interplanetary 
space. The chief reasons for this difference are the 
existence of the gravitational and magnetic fields of 
Earth. These fields affect the densities and motions of 
neutral and charged particles in the vicinity of Earth. 
The corona of the Sun, which was once believed 
to be a static and limited atmosphere, fills the entire 
solar system, although in a highly tenuous form. This 
corona, in the form ofa plasma of hydrogen particles, 
is pervaded by a magnetic field, and it flows past the 
Earth at speeds as high as 250 mi/s (400 km/s). See 
VAN ALLEN RADIATION. 

The geomagnetic field of the Earth, unlike an 
undistorted dipole field, is confined to a limited 
region of space, as data supplied by satellite mea- 
surements suggest. The actual field is distorted by 
radiation belts and primarily by its interaction with 
the solar wind. This volume of space is called the 
magnetosphere, and is bounded by a thin layer called 
the magnetopause that fluctuates in response to the 
solar wind. The magnetosphere appears to have the 
shape of an elongated comet whose tail points away 
from the Sun. 

Between 10 to 20 earth radii, a shock wave is pro- 
duced by the flow of the solar wind around the mag- 
netosphere. 

The gravitational field allows Earth to retain its at- 
mosphere but is not strong enough to prevent escape 
of the atmosphere completely. Above a level of about 
310 mi (500 km), where the atmosphere is so rar- 
efied that collisions between atoms can be neglected, 
the atoms describe ballistic orbits and form the exo- 
sphere. Light atoms, especially hydrogen, but also he- 
lium, occasionally obtain enough energy so that they 


escape completely from Earth’s gravitational field. 
See ATMOSPHERE; MESOSPHERE. 

Earth’s magnetic field deflects moving charged par- 
ticles. Particles of the highest energy, namely cosmic 
rays, are deflected by the magnetic field so that only 
those having an energy of more than 15 GeV can en- 
ter near the Equator. At the poles, however, cosmic 
rays of all energies may enter because there the lines 
of force of the magnetic field are vertical. As a result, 
cosmic ray intensity at the pole is 10-50 times higher 
than at the Equator. See GEOMAGNETISM. 

Solar corpuscular radiation, containing particles 
of much lower energies, is affected more strongly 
by Earth’s magnetic field and is deviated into zones 
at high latitudes. Many of these solar particles are 
trapped in Earth’s magnetic field and remain in it 
for long periods of time, moving back and forth 
along a line of force, but being reflected from each 
end. In this process particles may be accelerated to 
higher energies and then be energetic enough to pro- 
duce luminous displays in the upper atmosphere. In 
their trapped condition these particles contribute to 
Earth’s radiation belt, first observed in early Earth 
satellite experiments. Another source for the radia- 
tion belt comes from cosmic rays which plunge into 
Earth’s atmosphere, disintegrate atmospheric nuclei, 
and throw back into space debris which can be 
trapped. See ASTRONOMY; AURORA; COSMIC RAYS. 

S. F. Singer 
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Collectively, the various biological sciences con- 
cerned with the study of organisms and their com- 
ponents when exposed to the environment of space 
(or of a spacecraft beyond the immediate influence 
of Earth and its atmosphere). Space biology studies 
include all the biological science disciplines, such 
as molecular, cellular, developmental, environmen- 
tal, behavioral, and radiation biology; physiology; 
medicine; biophysics; and biochemistry. Space biol- 
ogy encompasses both basic and applied sciences, 
and focuses on using space research to understand 
and resolve biological problems on Earth, as well as 
alleviating or preventing the deleterious physiologi- 
cal changes associated with space flight or habitation 
in non-Earth environments. See SPACE. 

As crewed space flight developed, medical re- 
search focused on the health and well-being of as- 
tronauts was paramount. This medical research has 
historically been included in the area of aerospace 
medicine. From these research efforts, directed at 
survival of living systems in the space environment, 
has evolved the broader discipline of space biol- 
ogy, which uses the space environment to gain 
an understanding of basic biological processes. See 
AEROSPACE MEDICINE. 

Space biology considers all of the inherent phys- 
ical factors of the space environment and their ef- 
fects on living organisms: (1) factors due to the en- 
vironment of space, such as radiation and reduced 
pressure; (2) factors due to the flight dynamics of 
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spacecraft, including acceleration, weightlessness, 
and vibration; (3) factors due to the environment of 
the spacecraft, such as air composition, atmospheric 
pressure, toxic materials, noise, and confinement; 
and (4) factors due to the environment on nonter- 
restrial bodies, such as the level of gravity, compo- 
sition of soil, and temperature extremes. Of these 
factors, only weightlessness and space radiation are 
unique to the space environment, but the effects of 
the other factors must be considered and mimicked 
in ground-based or flight controls when conducting 
space biological experiments. In addition, space ex- 
periments must take into consideration the physical 
effects of weightlessness on the environment sur- 
rounding and supporting the organisms. For exam- 
ple, neither gases nor liquids behave in a space envi- 
ronment as they do under the conditions found on 
Earth, as sedimentation, buoyancy, and convection 
do not occur in a weightless environment. See AC- 
CELERATION; WEIGHTLESSNESS. 


Space Gravitational Biology 


Among the effects of space flight, weightlessness 
(free fall) associated with orbital flight presents the 
greatest scientific opportunity and poses the great- 
est challenge. All life has evolved under the most 
constant and ubiquitous of the Earth’s environmen- 
tal factors, gravity. The near-weightless (microgray- 
ity) environment of space flight provides the means 
to manipulate gravity below 1 g in order to under- 
stand its effects on biological systems. Studies in mi- 
crogravity provide information as to the influence of 
gravity on living systems, and insights into possible 
life in environments beyond Earth. 

Periods of near-weightlessness, ranging from a few 
seconds to about 15 minutes, can be achieved by 
using drop towers, parabolic airplane flights, and 
sounding rockets. To extend the exposure time pe- 





Fig. 1. Cosmos 782, one of a series of Soviet and Russian spacecraft that carried 
biological experiments. Launched on November 25, 1975, on a 20-day mission, it carried 
more than 20 different species. Scientists from France, Czechoslovakia, Hungary, Poland, 
Romania, the United States, and the Soviet Union participated in these investigations. 


riod to days, weeks, and months requires orbital 
space flight. Biological specimens have been sent 
into space in a variety of satellites and crewed space 
ships (Figs. 1 and 2). The collection of biological 
data in space has been complicated by a number of 
limiting and confounding factors: few flight oppor- 
tunities, short-duration space flights, adverse space- 
craft environmental conditions, complicating space- 
craft flight dynamics, accommodations for only a few 
species, limited means to manipulate specimens in 
space, and the general lack of a 1-g reference control 
centrifuge on board the spacecraft. Yet many signif- 
icant observations have been made that establish a 
foundation for future space research. 

Ground-based research. A variety of ground-based 
experimental methods have been used successfully 
to complement space experiments. Each method de- 
pends upon changing the force imposed on part of 
an organism or the entire organism—for example, by 
using centrifugation to increase the level of gravity 
or changing the gravitational vector by clinostating 
(randomly rotating). These methods are restricted by 
the untoward additional biological effects they cre- 
ate. Organs and tissues such as bone and muscle that 
have evolved to support animals against the Earth’s 
gravitational force have been the most responsive 
to ground-based modeling. Rats exposed to space 
flight and those subjected to 1-g and 2-g centrifuga- 
tion showed that total body calcium was decreased 
by reduced gravitational loads and increased by el- 
evated gravitational loads. Computerized analysis of 
the morphology of the rat inner-ear vestibular sys- 
tem, crucial for orientation and balance, has shown 
that the number of nerve cells innervating these or- 
gans is also gravity-dependent. There are more nerve 
cells in space-flight-exposed tissue, for which the 
gravitational stimulus has been reduced, while the 
number of cells is decreased when the gravitational 
load is increased to 2 g. Further, bone cells unloaded 
by rotation demonstrate differential responsiveness 
to growth mediators compared to static cultures. Im- 
position ofa load to the cells results in cell-type differ- 
entiation. During centrifugation, the increase in the 
level of gravity produces distinct differences in the 
behavior of rodents. Activity is reduced, and there 
are shifts in circadian rhythms opposite to those ob- 
served during space flight. Ground-based research 
using these types of experimental models provides 
direction for and insights into space flight experi- 
ments. 

Effects on unicellular organisms. Space experi- 
ments have been conducted with a variety of unicel- 
lular organisms, including bacteria, fungi, protozoa, 
and algae. Developmental, physiological, and behav- 
ioral effects have been noted. In prokaryotic cells 
(cells without nuclei, such as bacteria), changes in 
metabolism have been repeatedly documented. Bac- 
teria grow two to three times more rapidly in micro- 
gravity, are more resistant to antibiotics, and have 
more complete exchange of nuclear material during 
conjugation than on the ground. Decreased sporula- 
tion is observed, suggesting a change in bacterial dif- 
ferentiation in microgravity. While no differences are 


found in the genetic recombination mechanisms of 
transformation or transduction of Escherichia coli, 
conjugation is increased three- to fourfold in space 
flight. See BACTERIAL GENETICS; BACTERIAL PHYSIOL- 
OGY AND METABOLISM; TRANSDUCTION (BACTERIA); 
TRANSFORMATION (BACTERIA). 

Under microgravity conditions the eukaryotic 
single-celled organism Paramecium tetraurelia dis- 
plays an increased growth rate accompanied by 
increased cell volume and decreased dry weight, 
protein content, and calcium. There are possible 
changes in cell membrane assembly and extrusion 
of trichocysts as well. The alga Chlorella vulgaris 
shows changes in the ultrastructure of the cells, 
and rearrangement of organelles. The degree and 
spectrum of change are correlated with the dura- 
tion of the space flight. Mitochondrial density and 
volume and the number of cristae in plastids are 
the first to undergo change. Chlamydomonas cul- 
tures show increased cell size and a prolongation of 
the active growth phase. The slime mold Physarum 
polycephalum demonstrates that both light and grav- 
ity regulate the contraction frequency of plasmodial 
strands. The results implicate the mitochondria, 
since they play a major role in changing the con- 
traction rhythm, and suggest that the mitochondria 
not only supply most of the energy necessary for the 
contraction but also may be involved in its regula- 
tion and possibly even in the perception of the grav- 
ity stimulus. See CELL ORGANIZATION; CELL PLASTIDS; 
MITOCHONDRIA. 

Effects on animals. Physiological and developmen- 
tal effects of microgravity have been observed in 
many types of animals. 

Physiological effects. Bone in growing rats is subtly 
altered in space flights lasting up to 21 days. The 
mineral concentration, size, and bone resorption are 
minimally changed, while bone formation, mineral- 
ization, and maturation are suppressed. While the 
bone increases in size in growing rats during flight, it 
has reduced strength or resistance to breaking. Bone 
vascularity, nerves, ligaments, tendons, and vertebral 
discs of young rats all respond in some way to space 
flight. 

Skeletal muscles that normally function to over- 
come gravity atrophy during space flight. Although 
muscle mass changes occur quickly, the underlying 
physiological changes may continue even after the 
mass achieves maximum loss. There is a decrease 
in the cross-sectional area of muscle fibers (primar- 
ily of the slow-twitch oxidative fibers), associated 
with a substantial loss of muscle protein, especially 
myofibril proteins; a decrease in alpha-actin mRNA; 
increased glucose uptake in the presence of insulin; 
and increased activities of glycolytic enzymes; while 
oxidation of fatty acids appears to be reduced. In 
cardiac muscle, mitochondrial numbers decrease, 
and increased lipid and glycogen content indicate 
a reduced work capacity. Cardiac muscle cells ap- 
pear deficient in their ability to utilize energy, based 
on adenosine triphosphate (ATP) content and fatty 
acid metabolism. Regulatory proteins in the ventri- 
cle show reduced responsiveness to hormone stim- 


Fig. 2. Payload specialist Mamoru Mohri using a microscope to produce photomicro- 
graphs of mammalian cells during the Spacelab SL-J mission, a joint effort of Japan and 
the United States aboard flight STS 47 of the space shuttle Endeavour in September 1992. 
The biopack contains a health-monitoring experiment. (NASA) 


ulation, and heart cells show mild degradation. See 
CARDIOVASCULAR SYSTEM; SKELETAL SYSTEM. 

In rats flown in space, there is a reduction in the 
volume of red blood cells. Hemolysis of red blood 
cells is about three times greater in rats exposed 
to space flight than in control rats, and the mean 
life-span of these cells is decreased by about 4 days 
during flight. This reduction is in part due to a de- 
crease in plasma levels of erythopoietin, reducing 
production of red blood cells. There is a reduced im- 
mune response to mitogens. Livers from rats exposed 
to weightlessness contain up to 100% more glycogen 
than those from control rats, and bone and other or- 
gans also show increases in lipid content. Analysis of 
the body constituents of space-flight-exposed young 
rats shows an increase in potassium, creatine, nitro- 
gen, and magnesium and a decrease in calcium, phos- 
phorus, and sodium when compared to 1-g controls 
on Earth. Conversely, rats subjected to an increase in 
gravity to 2 g by centrifugation exhibit a decrease in 
potassium, creatine, and nitrogen and an increase in 
calcium, phosphorus, magnesium, and sodium. Pi- 
tuitary oxytocin and vasopressin are decreased, sug- 
gesting in-flight adjustments in the regulation of body 
water content. Rats, like humans, appear dehydrated 
after flight, with changes in water-salt metabolism. 
The extent of this water loss appears to be related to 
the time after landing at which the measurements are 
made. The longer the time after landing, the greater 
the loss of body water. This is corrected within 
3-5 days after return to Earth. Upon return to Earth, 
animals have a pronounced increase in urine output, 
further supporting a reduction in regulation of water 
and electrolyte homeostasis. 

Decreased levels of growth hormone are related 
to increased concentration of growth hormone in 
pituitary cells; these cells release less growth hor- 
mone when cultured postflight, suggesting a secre- 
tory dysfunction. These changes in growth hormone 
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also appear to be associated with a reduction in trans- 
lation, as adequate amounts of mRNA are present. 
Most changes induced by space flight are corrected 
within 1 month of return to Earth, and an artificial 
gravity level of 1 g during flight appears to normalize 
many flight-induced effects. See ENDOCRINE MECHA- 
NISMS. 

In rhesus monkeys, cardiovascular measurements 
taken during space flights indicate a decrease in 
heart rate, increase in blood pressure, and recipro- 
cal changes in blood flow velocity associated with 
adaptation to space flight. Opposite responses are 
noted with exposure to increased gravity produced 
by centrifugation. After space flight, studies of mus- 
cle function and morphology show a shift in muscle 
typing from slow twitch to fast twitch. This is pre- 
dominantly noted in those muscles used to support 
and move the body against gravity. In addition, the 
function of the muscle during movement is altered. 
Circadian rhythms of activity and body temperature 
persist during space flight, but both regulation and 
circadian timing of these systems are significantly in- 
fluenced by microgravity. At the same time, heart 
rate maintains a normal 24-hour rhythm, indicating 
it is unaffected by microgravity. Studies of vestibu- 
lar function suggest that acclimation to microgravity 
causes changes in the way that body orientation in- 
formation is processed in the nervous system. These 
changes in vestibular function, body orientation, and 
muscle morphology and function contribute to al- 
tered posture and movement upon return to Earth. 
There are significant alterations in walking and run- 
ning gait. Fine motor control also appears to be af- 
fected. These changes are similar to those reported 
in astronauts. 

Reproduction and development. The experimental diffi- 
culty in initiating fertilization in space and the short 
duration of most space flights have been limiting fac- 
tors in research on animal reproduction and devel- 
opment in space. The scientific questions fall in two 
main categories: (1) the effects on fertilization and 
early development, and (2) the effects on the differ- 
ent organ systems, since each has a critical develop- 
ment period that is especially sensitive to environ- 
mental perturbation. The majority of the research 
has focused on fertilization and early development, 
and in general has found that microgravity has mini- 
mal effect on the animals tested. 

Normal fertilization and development of fruit flies 
can occur in microgravity. However, larval hatching 
and the number of adults emerging after recovery 
are significantly reduced, oogenesis is affected, and 
there is a slight increase in abnormal head morphol- 
ogy. Stick insects also show a decrease in hatching 
with apparent normal development. The develop- 
mental process in polyps and ephyrae of jellyfish (Au- 
relia aurita) appears to be normal. However, their 
pulsing movement is abnormal and may reflect ab- 
normal development of the graviceptors, the nerve 
net, or the neuromuscular system. Successful fertil- 
ization of the sea urchins Paracentrotus lividus and 
Lytechinus pictus has been achieved; however, sig- 
nificant abnormalities were noted in the centrosome- 


centriole complex of dividing cells. These abnormal- 
ities suggest alterations in developmental processes 
in microgravity. The nematode Caenorhabditis ele- 
gans exhibits normal development and chromosome 
mechanics. 

The fish Fundlulsu reproduces and develops nor- 
mally, and gravid guppies give birth to normal young 
when returned to Earth. Female frogs of the species 
Xenopus laevis have been stimulated to ovulate in 
space, and their eggs have been fertilized in micro- 
gravity. Exposure to microgravity from oocyte matu- 
ration through hatching does not deleteriously affect 
amphibian morphogenesis or permanently alter be- 
havior. The duration of the flights with developing 
frogs has been short, and some developmental dif- 
ferences have been noted that may affect morpho- 
genesis. For example, the lungs of tadpoles reared in 
space are not inflated, either because they are not 
needed for buoyancy or because the young cannot 
break through the air-water interface to inflate them. 
Upon return to Earth, tadpoles fail to swim appropri- 
ately, possibly due to the difference in buoyancy. 

Apparently normal chicks have hatched from fertil- 
ized Japanese quail eggs (Coturnix coturnix japon- 
ica) incubated in space. They react to visual and au- 
ditory stimuli and display normal motor activity but 
have orientation problems. Early chicken embryos 
have died during space flight following early prein- 
cubation on the ground. The deaths have been sug- 
gested to result from the absence of buoyancy during 
space flight. The yolk normally floats at the top of the 
egg, allowing the embryo close contact with the shell 
for easy diffusion of oxygen. During space flight the 
yolk does not float to the top of the egg, and thus 
the embryo may be starved for oxygen and die. Later 
in development, surviving chicks from embryos ex- 
posed to space flight show decreased vestibular sen- 
sitivity to gravitational stimuli. 

In rats, early development during gestation does 
not appear to be affected by microgravity. Pregnant 
rats launched into space early in gestation deliver 
physiologically and behaviorally normal rat pups 
upon return to Earth. However, maternal differences 
occur that could affect the young. The metabolism 
of the mammary glands of rats flown in space is re- 
duced and could result in a decrease in milk pro- 
duction. Young rats flown on the space shuttle prior 
to weaning do show deleterious effect. If they are 
too young, mortality is very high, as the micrograv- 
ity environment affects behavioral interactions be- 
tween the mother and neonate. Animals that survive 
appear to have neurological deficiencies that persist 
into later life when returned to Earth. 

Effects on cultured animal cells. Most mammalian cells 
grown in microgravity (Fig. 2) show a reduced 
growth rate compared to ground controls. Reduced 
proliferation is observed with cultured human lung 
cells, monkey kidney cells, mouse hybridoma cells, 
and human lymphocytes. The rate of glucose utiliza- 
tion may be reduced, and there may be changes in 
membrane structure or receptors. Culture experi- 
ments with cells that are dependent upon attach- 
ment to a substrate are compromised in microgravity, 


where all objects tend to float. Culturing of lympho- 
cytes in microgravity is not similarly impacted be- 
cause the cells are capable of autonomous motility. 
Immune-cell cultures in space produce interferon 
more rapidly and with four to eight times higher 
titers. Human mononuclear cells grown in space 
lose 50-90% of their ability to respond to stimula- 
tion with the mitogen concanavalin A, compared to 
control cells grown on Earth or in 1-g centrifuge in 
space. Immune cells also display major depression 
of blast cell transformation in response to mitogens, 
loss of cytokine production or function, and changes 
in natural killer cell activity and response to colony- 
stimulating factor. Changes in cellular function are 
associated with alteration of gene levels. Human kid- 
ney cells grown in space show alterations in signal- 
ing pathways, and structures within the cell possi- 
bly contribute to functional changes. See CELLULAR 
IMMUNOLOGY. 

Effects on plants. Plants were the first organisms 
to demonstrate a biological role for gravity. Experi- 
mental interest centered on gravitropism, the differ- 
ential growth induced by gravity that causes shoots 
to grow up and roots to grow down. This process 
allows plants to correct for changes in direction im- 
posed by the environment, such as roots growing 
around rocks. See PLANT MOVEMENTS. 

Although seeds and plants have been flown in 
space for over 40 years, experiments have been infre- 
quent and experimental conditions limiting. Major 
environmental influences due to the physical proper- 
ties of the space flight environment have been iden- 
tified. Successes of early plant experiments were in- 
fluenced by the physical properties of water in the 
microgravity environment. For example, free- 
floating water forms into a ball, while contact with 
a surface causes the water to form a film. These 
properties led to inappropriate nutrient delivery. 
Airflow was also limited in early flights due to the 
lack of convection, which allowed the creation of 
boundaries around leaves limiting nutrient delivery 
and increased concentrations of waste products. 
These problems were overcome by engineering in- 
terventions in the design of subsequent habitats to 
grow plants. Results of experiments on plants in 
space suggest an extensive direct influence of grav- 
ity on all aspects of plant growth, development, and 
metabolism. 

Physiology, orientation, and tropisms. Plants are exqui- 
sitely sensitive to gravity. Graviperception (the mag- 
nitude of acceleration required to induce a directed 
growth response) has been calculated at approxi- 
mately 10~> g in shoots and 10~* g in roots in flight 
experiments. 

Orientation of roots and shoots (Fig. 3) is directed 
by external stimuli. In microgravity, light directs the 
orientation of wheat and pea shoots while roots re- 
main disoriented. These studies suggest that, at least 
in these species, gravity directs the orientation of 
root growth, while gravity and light guide shoot 
orientation. Both circumnutation (a gravity-induced 
touch response) and epinastic movements (bending 
of leaves downward due to growth of their upper 





Fig. 3. Shoots in space. 


surface) occur in microgravity. See PLANT GROWTH. 

Flight experiments indicate that microgravity af- 
fects a plant’s mineral balance. It has been postulated 
that cytoplasmic calcium concentration increases, 
calcium ion adenosine triphosphatase activity is al- 
tered, and ion intake by roots is disrupted in micro- 
gravity. In fact, calcium has been linked to both the 
mechanisms of gravitropism and the mitotic defects 
seen in seedlings grown in microgravity. 

Reproduction and development. Space flight does not af- 
fect germination, and its effects on growth are un- 
clear. Generally, by gross observation plants appear 
to grow normally for short periods of time in mi- 
crogravity. Over extended periods of microgravity, 
plants slow or stop growing and then die, most often 
at the flowering stage. Success was finally achieved 
with small, fast-growing Arabidopsis seeds which 
were sown in space, completed a full life cycle, and 
produced fertile seeds in 1982. Growth was not vig- 
orous, few seeds were produced, and the seeds had 
a significant level of abnormalities. Many of these ob- 
servations were demonstrated to be the product of 
the physical properties of the seeds’ space environ- 
ment. Stagnant air layers that form in the absence of 
convection during microgravity stress the plants by 
producing localized high humidity, diminished avail- 
ability of metabolic gases, buildup of harmful gases, 
and high temperatures at the top of the plant near 
light sources. Implementation of protocols to com- 
pensate for these properties has greatly improved the 
reproductive development and germination of Ara- 
bidopsis such that, in every aspect examined, values 
were similar to those of ground control plants. 

Cell division is generally reduced or inhibited in 
microgravity. In fact, in one space shuttle exper- 
iment, oat seedlings had only one-tenth as many 
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Fig. 4. Groups of reconstructed gravity-sensing cells from the root tips of 3-day clover seedlings. (a) Cells grown on Earth. 
Gravity-sensitive particles (amyloplasts) are sedimented to the bottom of the cell. (b) Cells grown in space (microgravity; no 
gravity direction), with larger particles that display grouping. (c) Cells grown in a clinostat where they are subjected to slow 
tumbling (gravity in all directions), with particles that are fewer in number and dispersed, showing that slow tumbling does 
not mimic the effect of space flight on plants. Particle size and number respond to gravity sensitivity, while particle 


distribution responds to biochemical signals. 


dividing cells as ground controls. Carrot and day lily 
plants grown in microgravity from somatic cells de- 
velop a substantial number of binucleate cells, while 
the ground control cells are uniformly uninucleate. 
This may be due to a defect in the ability to put 
down wall material. Chromosomal perturbations, in- 
cluding breakage, translocations, and deletions, are 
seen in a variety of microgravity-grown plants de- 
rived from somatic embryos, tissue-culture plantlets, 
and seeds. Dosimetry data collected in conjunction 
with some of these experiments have ruled out ra- 
diation as a cause of the damage, and in-flight fixa- 
tion of the plants eliminates reentry and 1-g ground 
recovery perturbations as possible causes of the ob- 
served chromosomal damage. Microgravity caused 
disturbances in the mitotic process. Abnormalities 
are believed to be associated with spindle distur- 
bances, although this has yet to be demonstrated. See 
CELL DIVISION; CHROMOSOME ABERRATION; MITOSIS. 

The pattern and rate of cellular differentation may 
be altered by microgravity. Undifferentiated clusters 
of anise cells differentiate more rapidly in space. Car- 
rot and lettuce embryo roots and grape hyacinth ga- 
metophytes show accelerated development as well; 
and roots of lettuce, Crepis, and oat seedlings dif- 
ferentiate prematurely. This could indicate that mi- 
crogravity accelerates organ formation and develop- 
ment. See CELL DIFFERENTIATION. 

Cellular effects. The size of plant cells may be incre- 
ased, decreased, or normal in microgravity. Rounded 
cells are seen in space-grown shoots of pine and 
maize and the roots of maize and Crepis, possibly 
due to the increased effect of surface tension and re- 
duced cell-to-cell mechanical pressure in micrograv- 
ity. Root caps do not regenerate on some plants in 
microgravity; however, this failure could be associ- 
ated with the environment of their growth chamber. 

Growth on Earth is responsible for the arrange- 
ment of at least some cell organelles, for in mi- 
crogravity the distribution of not only amyloplasts 


but also endoplasmic reticulum is altered. Amylo- 
plasts in root caps of plants grown in microgravity 
(Fig. 4 are always distributed randomly. Moreover, 
their structure and starch content are changed. 
Chloroplasts of space-grown pea plants show mor- 
phological changes. See ENDOPLASMIC RETICULUM. 

Nuclei of space-grown pea and Arabidopsis root 
cells show abnormal distribution and increased 
amounts of condensed chromatin, which correlates 
well with the general observation of decreased mito- 
sis and increased chromosomal aberrations in cells 
grown in microgravity. This suggests reduced func- 
tional activity that could indicate an effect on protein 
synthesis. See CELL NUCLEUS. 

Mitochondria of microgravity-grown plants are 
characteristically swollen. This would suggest an 
effect of microgravity on cellular metabolism. 
Microgravity-grown seedling root cells contain 50- 
90% less dictyosomal volume than cells of ground 
control plants. This correlates with the thinning of 
cell walls in space. 

It has been proposed that cellulose and lignin 
should be reduced in space because they function in 
a structural capacity that is unneeded there. Flight ex- 
periments have supported this assumption. See CELL 
(BIOLOGY); SPACE FLIGHT. 


lonizing Radiation 


The physical environment of space includes a broad 
spectrum of ionizing radiation, ranging from the 
ultraviolet to galactic cosmic rays, that is able to pen- 
etrate the life-supporting enclosures in which organ- 
isms live (habitats, spacecraft, and space suits). In ad- 
dition, biological specimens are exposed to products 
derived from interactions of radiation with other bio- 
logical systems and materials in the spacecraft. These 
by-products, such as free radicals and superoxides, 
can impact biological processes. 

The components of space radiation consisting of 
energetic charged particles are the ones of greatest 


concern in terms of possible biological effects. They 
can be categorized as galactic cosmic radiation, geo- 
magnetically trapped radiation (the Van Allen belts), 
and other particles of solar origin. The number of 
particles arriving from the Sun varies as a function 
of solar activity, which has a cycle of approximately 
11 years. During solar particle events, large numbers 
of energetic charged particles, mainly protons, are 
ejected from the Sun and increase the interplanetary 
radiation levels by several orders of magnitude. See 
COSMIC RAYS; MAGNETOSPHERE; SUN; VAN ALLEN RA- 
DIATION. 

On Earth the atmosphere shields the planet 
from short-wavelength ultraviolet radiation and low- 
energy charged particles. The magnetic field of the 
Earth shields the planet from low- and medium- 
energy charged-particle radiation by deflecting the 
particles along magnetic field lines, where they re- 
main captured for long periods of time. The effective- 
ness of this magnetic shielding varies with latitude, 
being highest at the Equator and disappearing at the 
magnetic poles. The displacement of the magnetic 
poles with respect to the geographic poles results in 
regions of higher radiation (for example, the South 
Atlantic Anomaly) within some low-altitude space- 
craft orbits. 

Measurements of galactic cosmic radiation from 
space probes show that these particles consist of 
87% protons (hydrogen nuclei), 12% alpha particles 
(helium nuclei), and 1% energetic heavy nuclei, also 
known as HZE (high charge and energy) particles, 
ranging from lithium to tin. The energy of individ- 
ual particles is very high, reaching 107° eV in some 
instances. Consequently, it is virtually impossible to 
passively shield against these particles. 

Effects of HZE particles. The energy transfer (the 
number of ionizations) that occurs along the track 
of a charged particle is identified in terms of lin- 
ear energy transfer. High linear energy transfer char- 
acterizes highly ionizing radiation, while low linear 
energy transfer characterizes sparsely ionizing radia- 
tion. When HZE particles enter cells, they produce 
intense ionization along their path and generate a 
cascade of secondary ionization events. The number 
of ions produced per unit length of track varies with 
the charge and velocity of the particles. This varia- 
tion in ion distribution within a cell is an important 
factor in the induction of biological damage. See LIN- 
EAR ENERGY TRANSFER (BIOLOGY). 

Radiation hazard and exposure risks are expressed 
in several interrelated units used to define radiation 
levels and biological effects due to exposure. For ex- 
ample, a rad of radiation corresponds to the absorp- 
tion of 100 ergs of energy per gram of any material 
(10~? joule per kilogram, or 10~? gray). However, 
a unified theoretical understanding of the radiation 
quality of cosmic radiation has not been achieved. 
See RADIATION BIOLOGY; UNITS OF MEASUREMENT. 

The major radiation hazards for living systems oc- 
curring during exploration-class missions outside the 
Earth’s magnetosphere are due to solar-particle-event 
protons and to the HZE particles in galactic cosmic 
rays. These hazards are complicated by the fact that 


there may be a much lower probability of observing 
the biological effects of HZE particles (and, to some 
extent, of protons and helium nuclei) than of lightly 
ionizing radiation (x-rays, gamma rays). 

The biological effects of radiation depend on com- 
plex energy transduction factors (including energy 
deposition or dose, dose rate, ionization density, 
radiochemical intermediaries, molecular biology le- 
sions, and cellular response mechanisms and their 
regulation and expression in the intact organism). 
The difference in biological effects due to x-rays and 
gamma rays on the one hand and heavy charged par- 
ticles on the other hand are largely derived from 
the much denser local pattern of ionization around 
charged-particle trajectories. For lightly ionizing ra- 
diation (x-rays and gamma rays), effects are mainly 
dependent on dose and dose rate. For HZE particles, 
the concept of dose is inadequate; in particular, the 
density of energy deposited around a single track is 
high enough that there is no such thing as a low dose, 
even for a single particle. See RADIATION INJURY 
(BIOLOGY). 

Measurement. Measurements of radiation on 
board spacecraft are performed with a large variety 
of detectors. Dosimeters measuring the average 
energy deposited by radiation are used both inside 
and outside spacecraft in order to understand 
the way in which spacecraft materials modify the 
radiation field. Materials suffering permanent (or 
long-term) changes in their physical properties due 
to radiation are used as so-called passive dosimeters. 
Examples are film emulsions, certain plastics, and 
thermoluminescent materials. Active dosimeters 
are radiation detectors whose response to radiation 
can be monitored electronically as it occurs. These 
detectors are used to provide ongoing warnings of 
radiation hazards during space flight activities. See 
DOSIMETER. 

Since the biological effects of HZE particles are 
strongly dependent on the particle type and energy 
(which result in variations in the linear energy trans- 
fer), instruments to detect these particles need to 
identify their charge and energy. Such instruments 
are called particle identification spectrometers. They 
generally consist of stacked semiconductor detectors 
in conjunction with some velocity-sensitive detector. 
The characteristic signature identifying the charge of 
the particle is given by its pattern of energy loss as it 
traverses the detector stack; for particles stopping in 
the stack, the total energy deposition is also a mea- 
sure of the particle energy. See JUNCTION DETECTOR. 

A simplified approximation of particles and their 
spectra is obtained by use of a low-pressure, gas-filled 
ionization chamber of precisely defined geometry de- 
signed to have the same electron density as the tissue 
of humans. The ionization produced in the chamber 
is proportional to the energy loss of the incident ra- 
diation. Theoretical models are then used to inter- 
pret these measurements in terms of the biological 
impact of the radiation. See IONIZATION CHAMBER; 
PARTICLE DETECTOR. 

Ground-based experiments. Ground-based experi- 
ments provide the means to study the effects of 
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individual components of space radiation in a 1- 
g environment uncomplicated by all of the other 
physical factors of space flight, including micrograv- 
ity. The availability of a ground-based accelerator fa- 
cility to simulate galactic cosmic-ray irradiation by 
means of accelerated beams of HZE nuclei is essential 
for the theoretical understanding of the phenomena 
involved. Ground-based and space-flight studies of 
HZE effects have revealed that radiobiological mech- 
anisms of densely ionizing radiation are qualitatively 
different from those of sparsely ionizing radiation. 
Ground-based experiments have shown that x-ray ir- 
radiation followed by heavy ion exposure is far more 
damaging to an organism than irradiation in the re- 
verse order. See PARTICLE ACCELERATOR. 

Experiments in space. The HZE-particle compo- 
nent of galactic cosmic radiation was discovered in 
1948, and in 1952 it was predicted that humans 
exposed to these particles could experience a vi- 
sual light flash sensation. The crews of Apollo 1 and 
subsequent missions reported experiencing a visual 
light flash. Microscopic examination of the eyes of 
rats flown on space flights indicates that light flashes 
and eye damage result from cosmic rays passing 
through the eye retina. The particles stimulate the 
photoreceptors of the eye as they pass through the 
tissue and are thus perceived as light. Even more im- 
portant to the understanding of HZE particle effects, 
however, is the finding that HZE particles produce 
channellike or tunnel lesions. Ground-based accel- 
erator experiments show that each tunnel lesion is 
produced by a single HZE particle. 

Packaged series of monolayers of bacterial spores, 
plant seeds, animal eggs, and lower animals interlay- 
ered with physical track detectors permit evaluation 
of individual tracks, and allow identification of each 
penetrating particle and determination of its possi- 
ble effects on biological matter in its path. It is found 
that the very high concentration of absorbed energy 
produced by a single HZE particle of cosmic radi- 
ation can have serious biological effects. Mutations 
are produced in corn seeds, but in general bacterial 
spores and seeds are highly resistant. The eggs of 
the brine shrimp Artemia salina are extremely sen- 
sitive. An egg that survives a hit by an HZE particle 
demonstrates abnormal growth and behavior when 
it continues development. Two species of bacteria 
show reduced viability and antibiotic sensitivity that 
is believed to be due to defective DNA repair mech- 
anisms. Space radiation also results in an increase 
in sex-chromosome-linked mutations in the fruit fly, 
Drosophila melanogaster. Radiobiological models 
derived from ground-based models fall significantly 
short of reproducing the observed space effects. See 
MUTATION. 

Both synergistic and antagonistic relationships are 
observed between space radiation and the micro- 
gravity environment. However, embryonic develop- 
ment is more sensitive to HZE particle damage under 
microgravity conditions than inside a centrifuge sim- 
ulating the acceleration of gravity at the Earth’s sur- 
face (1 g). When the eggs of the stick insect, Ca- 
rausius morosus, are exposed in space both at 1 g 


and in a weightless environment, statistically signif- 
icant differences in development are observed be- 
tween them. These results indicate that microgravity 
affects biological processes directly at the cell level, 
and that these changes in turn modify or interfere 
with the expression of the effects of the various com- 
ponents of space radiation. It has been proposed that 
microgravity may affect cell repair and recovery pro- 
cesses, as well as cytoskeleton and membrane func- 
tions, thereby making cells more sensitive to radia- 
tion damage. Charles Edwin Wade 
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The net electric charge within a given volume. If 
both positive and negative charges are present, the 
space charge represents the excess of the total posi- 
tive charge diffused through the volume in question 
over the total negative charge. Since electric field 
lines end on electric charge, the space-charge den- 
sity p may also be defined in terms of the divergence 
of the electric field E or the laplacian of the electric 
potential V by Eq. (1) (Poisson’s equation). Here € is 
£ —div E = V?’ V 
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the dielectric constant of the medium and x, y, and 
z are rectangular coordinates defining the position 
of a point in space. If, under the influence of an ap- 
plied field, the charge carriers acquire a drift velocity 
v, the space charge becomes j/v, where j is the cur- 
rent density. For current carried by both positive and 
negative carriers, such as positive ions and electrons, 
the space charge density is given by Eq. (2). Here the 


pat 
VL Us 
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subscripts + and — indicate the current density and 


drift velocity for the positive and negative carriers, 
respectively. Thus a relatively small current of slow- 
moving positive ions can neutralize the space charge 
of a much larger current of high-velocity electrons. 
See CHILD-LANGMUIR LAW. Edward G. Ramberg 
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Communications between a vehicle in outer space 
and Earth, using high-frequency electromagnetic 
radiation (radio waves). Provision for such commu- 
nication is an essential requirement of any space mis- 
sion. The total communication system ordinarily in- 
cludes (1) command, the transmission of instructions 
to the spacecraft; (2) telemetry, the transmission of 
scientific and applications data from the spacecraft 
to Earth; and (3) tracking, the determination of the 
distance (range) from Earth to the spacecraft and its 
radial velocity (range-rate) toward or away from Earth 
by the measurement of the round-trip radio trans- 
mission time and Doppler frequency shift (magni- 
tude and direction). A specialized but commercially 
important application, which is excluded from con- 
sideration here, is the communications satellite sys- 
tem in which the spacecraft serves solely as a relay 
station between remote points on Earth. See COM- 
MUNICATIONS SATELLITE; DOPPLER EFFECT; MILITARY 
SATELLITES; SATELLITE (SPACECRAFT); SCIENTIFIC AND 
APPLICATIONS SATELLITES; SPACE FLIGHT; SPACE NAV- 
IGATION AND GUIDANCE; SPACE PROBE; SPACECRAFT 
GROUND INSTRUMENTATION; TELEMETERING. 

General features. Certain characteristic constraints 
distinguish space communication systems from their 
terrestrial counterparts. Although only line-of-sight 
propagation is required, both the transmitter and 
the receiver are usually in motion. The movement 
of satellites relative to the rotating Earth, for exam- 
ple, requires geographically dispersed Earth stations 
to achieve adequate communication with the space- 
craft on each orbit. The signal power received is ex- 
pressed by 


x 2 
P, = KP;G;,G; | — 
on (+) 


where P, = received signal power, P, = transmit- 
ter power output; G, = transmitter antenna gain; G, 
= receiver antenna gain; K = a factor to allow for 
system losses, such as cable and connector losses, 
or losses due to antenna pointing error; R = range 
(the distance between transmitter and receiver); and 
à = wavelength of the transmitted signal in the same 
units as the range. 

Because enormous distances are involved (over a 
billion miles to the planets beyond Jupiter), the signal 
received on Earth from deep-space probes is so small 
that local interference, both artificial and natural, has 
to be drastically reduced. For this purpose, the trans- 
mitted frequency has to be sufficiently high, in the 
gigahertz range, to reduce noise originating in the 
Milky Way Galaxy (galactic noise background). The 
receiver site must be remote from technologically ad- 
vanced population centers to reduce artificial noise, 
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Fig. 1. Goldstone 210-ft (64-m) antenna. (Jet Propulsion Laboratory) 


and at a dry location such as Goldstone, California 
(Fig. 1), or in the Australian desert, to avoid pre- 
cipitation attenuation of the radio signal as well as 
the higher antenna thermal noise associated with 
higher atmospheric absolute humidity and relatively 
warm cloud droplets. The receiving antennas must 
be steerable and large, typically 85 ft (26 m) or at 
times 210 ft (64 m) in diameter, to enhance the re- 
ceived signal strength relative to the galactic noise 
background. Special low-noise preamplifiers such as 
cooled masers are mounted on the Earth receiver an- 
tenna feed to reduce the receiver input thermal noise 
background. Sophisticated digital data processing is 
required, and the ground-receiver complex includes 
large high-speed computers and associated process- 
ing equipment. See MASER; PREAMPLIFIER; RADIO RE- 
CEIVER; RADIO TELESCOPE. 

The spacecraft communications equipment is con- 
strained by severe power, weight, and space lim- 
itations. Typical communications equipment mass 
ranges from 25 to 220 Ib (12 to 100 kg). Another 
major challenge is reliability, since the equipment 
must operate for years, sometimes for decades, unat- 
tended, in the difficult radiation, vacuum, and ther- 
mal environment of space. Highly reliable compo- 
nents and equipment have been developed, and 
redundancy is employed to eliminate almost all 
single-point failures. For example, it is not unusual 
to have as many as three redundant command re- 
ceivers operating continuously, because without at 
least one such receiver in operation no command can 
get through, including a command to switch from a 
failed command receiver to a backup radio. Power 
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can be saved by putting some or all of the redundant 
radios on timers, and to switch to a backup receiver 
if no commands have been received through the pri- 
mary receiver within a predetermined interval; but 
the saved power may come at the cost of a possible 
delay in emergency response initiation. 

Spacecraft power is always at a premium, and 
other techniques must also be used to minimize its 
consumption by the communication system. The 
transmitter is a major power consumer, so its effi- 
ciency must be maximized. All aspects of data trans- 
mission must contribute to error-free (very low bit 
error rate) reproduction of the telemetry data using 
no more power or bandwidth than is absolutely es- 
sential. Pulse-code modulation is a common tech- 
nique which helps meet this goal. In general terms, 
space communication systems are far less forgiving 
than terrestrial systems and must be designed, con- 
structed, and tested to much higher standards. See 
PULSE MODULATION; SPACE POWER SYSTEMS; SPACE 
TECHNOLOGY. 

Before describing several typical space communi- 
cation systems, a brief examination of some impor- 
tant subsystems is in order. 

Data sources. The kinds of data that must be trans- 
mitted by the space communication system vary, but 
certain basic types (internal spacecraft environment, 
scientific, and applications) may be distinguished. 

Internal spacecraft environment (housekeeping) data. On- 
board sensors provide information from selected 
locations about a number of functions of internal 
spacecraft behavior, such as switch position, temper- 
ature, pressure, acceleration, voltage, and current. 
Earth-based monitoring of such data is essential to 
detect and correct or bypass malfunctions as well 
as to evaluate trends such as battery, solar cell, and 
thermal coating degradation. 

Scientific data. These data contain the information 
needed to accomplish the objectives of the mission 
being flown. For the most part, they originate in sen- 
sors of electromagnetic radiation at wavelengths as 
long as radio waves, ranging to photons as energetic 
as primary cosmic rays. The sources of this radia- 
tion vary from violent astrophysical events which 
occurred in deep space as long as 10 billion years 
ago, to solar flare outbursts whose timing and dura- 
tion over the 23-year solar magnetic-field (sunspot) 
cycle are still very difficult to understand and to pre- 
dict. Many of these data are collected and transmit- 
ted to Earth in a raster-scan image similar to that used 
on computer and television displays. Like terrestrial 
television, the desired data bandwidths are often in 
the multimegahertz range. Deep-space probes, for 
which such unchallenged bandwidth needs would 
produce prohibitively massive and costly spacecraft, 
trade more transmission time for less bandwidth. 
Should the mission be a planetary flyby, the data 
can be acquired in a few hours, stored in solid- 
state broadband recorders in the probe, and trans- 
mitted relatively slowly over the following days or 
weeks. The ratio of the data collection time inter- 
val to the transmission time interval equals the ratio 
of the required transmission bandwidth to the orig- 


inal data collection bandwidth. Additional bit-error- 
rate reduction may be accomplished by the statisti- 
cal treatment of successive transmissions of the same 
image from the space probe. 

Another class of scientific investigations has to 
do with the immediate external environment of 
the spacecraft, particularly involving the measure- 
ment of the complex terrestrial and solar vector 
(magnitude and direction) magnetic fields and 
associated energetic plasma (ionized particles). In 
lower-altitude denser terrestrial and other planetary 
atmospheres, measurements of the neutral and ion- 
ized atmospheric composition, pressure, tempera- 
ture, and density are also valuable. Such investiga- 
tions require telemetry bandwidths in the range of 
tens of kilohertz, substantially less than that which 
is required for images. 

Applications data. A third category includes data 
which are supplied regularly as a service to users 
of images of Earth’s land and water surfaces (Earth 
resources), and of the atmosphere (weather and cli- 
matology). The former use is exemplified by Landsat 
7 images in the 0.45-12.50-micrometer bands, and 
the latter by the National Oceanic and Atmospheric 
Administration (NOAA) polar and Geostationary Op- 
erational Environmental Satellite (GOES) images in 
both visible and infrared spectral bands. Such images 
again use telemetry bandwidths in the multimega- 
hertz range. Another applications data type is infor- 
mation transmitted from data collection platforms 
on Earth and relayed to ground processing stations 
via telemetry. Here the data bandwidth is in the kilo- 
hertz range. In general, applications data are part of 
a guaranteed service; that is, if an operational space- 
craft fails, another already has been or shortly will be 
placed in orbit to continue the service. 

Antennas. Antennas for space communication 
may be classified as ground or spacecraft antennas 
according to their function. 

Ground antennas. The use of paraboloidal (dish) re- 
flectors for large-aperture antennas is practically uni- 
versal. As the size increases, so do cost and struc- 
tural problems, in step with the volume which is 
proportional to the aperture diameter cubed. The 
signal collection area increases only as the diameter 
squared. The magnitude of reflector surface irregu- 
larities sets an upper limit to the usable frequency. 
Standard reflector diameters in the National Aeronat- 
ics and Space Agency (NASA) deep-space network 
are 85 ft (26 m) and 210 ft (64 m). The feed system has 
to be carefully designed for optimum performance. 
For large antennas a common and efficient feed sys- 
tem uses the Cassegrain configuration in which a hy- 
perboloidal reflector is interposed between the focal 
point of the main antenna and its reflecting surface 
(Fig. 1). Its main advantage is that the feed horn and 
associated microwave receiving equipment are near 
the surface of the paraboloid rather than at its focus, 
which results in critical operational advantages and 
higher efficiencies. 

Microwave antennas for near-Earth missions usu- 
ally have smaller diameters, typically 40 or 30 ft (12 
or 9 m). Smaller antennas are used for this purpose 


because they cost considerably less, and larger sys- 
tems are not needed because the range is so much 
smaller. For the reception of lower-frequency VHF 
(very high frequency) signals from near-Earth space- 
craft, an array of crossed-dipole Yagi antennas is often 
used. See ANTENNA (ELECTROMAGNETISM); YAGI-UDA 
ANTENNA. 

Spacecraft antennas. An important requirement for a 
spacecraft antenna is high reliability in the space en- 
vironment, since it is usually a component in line to 
a single-point failure of the communication system. 
For deep-space missions, spacecraft microwave an- 
tennas have diameters up to 16 ft ( m), depend- 
ing upon transmitter power, data rate, and range in- 
volved. Near-Earth spacecraft use both high-gain and 
omnidirectional antennas to transmit telemetry data, 
the particular design depending mainly upon data 
rates, type of orbit, and the receiving system being 
employed. Nearly all spacecraft use omnidirectional 
antennas for receiving commands and transmitting 
low-data-rate housekeeping information. With this 
mode of operation, if a partial system failure occurs, 
the problem can be analyzed and corrective actions 
taken via the command system to keep the spacecraft 
in operation. 

Modulation techniques. Pulse-code modulation 
(PCM), in which the data are time-multiplexed, sam- 
pled, digitized, and transmitted as a binary stream 
of pulses grouped into code words, is employed 
almost universally for transmitting scientific data 
and command signals. This information is usually 
phase-modulated on a very stable transmitter carrier 
to optimize system performance. Techniques such 
as block coding or convolutional coding are often 
used to improve the system sensitivity by up to 6 dB 
(a factor of 4 in power). These techniques require 
more bandwidth for a given data rate and additional 
complexity of signal-processing equipment. 

Spacecraft transponders. The major function of a 
spacecraft transponder is to provide a means of mea- 
suring range and range rate of a spacecraft by a 
ground station. This information is then used to com- 
pute spacecraft orbits and trajectories. The basic 
principle of operation is to use a phase-locked loop 
(PLL) to lock an oscillator coherently to the carrier 
of the received signal by an exact ratio (typically 
240:221 for NASA S-band systems). Comparison of 
the transmitted and received signals at the ground 
station provides a coherent two-way Doppler-shift 
measurement which translates directly into range 
rate. A ranging signal is modulated on the ground 
transmitter carrier and returned to the ground 
station via the transponder carrier; the time delay be- 
tween transmission and reception on the ground pro- 
vides a means of computing the range of the space- 
craft. Ranging modulation is usually in the form of 
pseudonoise (PN) PCM codes or coherently related 
subcarriers. By suitable multiplexing, the transpon- 
der also performs the functions of a command re- 
ceiver and a telemetry transmitter. See DOPPLER 
EFFECT; PHASE-LOCKED LOOPS. 

Tracking and Data Relay Satellite System. The Track- 
ing and Data Relay Satellite System (TDRSS) con- 
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sists of a series of geostationary spacecraft and an 
Earth terminal located at White Sands, New Mex- 
ico. The purpose of TDRSS is to provide telecommu- 
nication services between low-Earth-orbiting (LEO) 
user spacecraft and user control centers (Fig. 2). 
A principal advantage of the system is the elimina- 
tion of the need for many of the worldwide ground 
stations for tracking such spacecraft. The Tracking 
and Data Relay Satellite (TDRS) provides no pro- 
cessing of data; rather, it translates received signals 
in frequency and retransmits them. User orbits are 
calculated from range and range-rate data obtained 
through the TDRS by using transponders on the user 
spacecraft. 

The first TDRS was launched in 1983, and the first- 
generation spacecraft will near the end of their oper- 
ational life early in the first decade of the new millen- 
nium. The upgraded second-generation replacement 
series was initiated in 2000 with the launch of TDRS 
H (the eighth of the series). 

Forward and return link data are transmitted 
over three basic services in the second-generation 
TDRSS. 

Multiple access. The multiple-access (MA) service 
provides simultaneous return-link service from as 
many as 20 LEO user spacecraft, with data rates 
up to 3 megabits per second (Mbps) for each user. 
Forward-link service to the user spacecraft is time- 
shared with a maximum data rate of 300 kilobits 
per second (kbps), one user at a time. The multiple- 
access service is provided at S-band frequencies of 
2106.4 MHz for the forward link and 2287.5 MHz 
for the return link. Multiple-access users operat- 
ing on the same frequency are discriminated by 
unique pseudonoise codes and antenna beam point- 
ing. Multiple-access antennas on each TDRS are adap- 
tive arrays in which signals received by each antenna 
element are transmitted to White Sands on separate 
carriers. Ground-based processing provides the abil- 
ity to form multiple antenna beams simultaneously. 

S-band single access. The S-band single-access (SSA) 
service provides return-link data rates up to 6 Mbps 
for each user spacecraft. The forward S-band access 
link provides data at 300 kbps. Carrier frequencies 
are assigned in the bands from 2025 to 2120 MHz 
for the forward link and 2200 to 2300 MHz for the 
return link. 

Ka- and Ku-band single access. The K-band single-access 
(KSA) service provides wide-band forward and re- 
turn access links. Both forward KSA links provide 
data at 25 Mbps. The return-link data rates are up 
to 300 and 800 Mbps for the Ku and Ka bands, re- 
spectively. The Ku forward link carrier operates at 
13.775 GHz, the return link at 15.0034 GHz. Cor- 
responding Ka assignments are 22-23 and 25- 
27.5 GHz. 

Complete systems. Two examples—the automated 
Landsat 7 and the crewed space shuttle—illustrate 
TDRSS service to modern space systems. 

Landsat 7. This applications spacecraft is designed 
to monitor the resources and environment of Earth 
and supply these data to users worldwide. The 
U.S. Geological Service (USGS) and NASA share 
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Fig. 2. Tracking and Data Relay Satellite System. (Goddard Space Flight Center) 


the responsibility for Landsat 7 operations, data 
processing, archiving, and distribution. The Earth- 
observing sensor system on board Landsat 7 is 
the Enhanced Thematic Mapper Plus (ETM+), op- 
timized for global change studies, land-cover mon- 
itoring and assessment, and large-area monitoring. 
Seven Earth-observing spectral bands are provided 
at wave-lengths between 0.45 and 12.50 um, as well 
as a panchromatic band. Applications of Landsat 
7 data include crop and forestry monitoring and 
resource mapping, land-use mapping, geological fea- 
ture mapping, surface water coverage, coastal re- 
source mapping, and environmental change moni- 
toring. See AGRICULTURE; REMOTE SENSING. 

Landsat 7 is controlled by the Mission Operations 
Center (MOC) at the NASA Goddard Space Flight 
Center. Communication with this satellite may be 
divided into two parts: the wideband user data gen- 
erated by the ETM+ sensor, and the narrow-band 
housekeeping telemetry and command traffic. 

Landsat 7 has the capability to transmit wideband 
ETM+ data to as many as three ground stations simul- 
taneously. A ground network receives these data via 
an X-band direct downlink at a data rate of 150 Mbps. 
The primary receiving station is at the USGS Data 
Center in Sioux Falls, South Dakota. Processing of the 
image data is done after receipt at the data center by 
the Landsat 7 processing system (LPS). The image 
data may be transmitted as it is collected, in real time, 
or it may be stored on an onboard solid-state recorder 
(SSR) for delayed playback to the ground. 


The X-band wideband data system transfer is done 
by the use of gimbaled X-band antennas (GXA) 
[Fig. 3]. Three frequencies are employed: 8082.5, 
8212.5, and 8342.5 MHz. Quadrature phase-shift 
keying (QPSK) is used, permitting division of each 
150-Mbps link into two 75-Mbps data streams, one 
of which is modulated on the in-phase Œ) channel 
and the other on the quadrature (Q) channel. Each 
QPSK modulator drives a solid-state power amplifier 
(SSPA) which transmits the amplified modulated car- 
rier at the appropriate frequency to the ground sta- 
tion through the designated GXA using right-hand 
circular polarization (RHCP). 

Telemetry and commanding of Landsat 7 is done 
through two S-band transponders which operate 
continuously in their receive mode. Communica- 
tions with Landsat 7 can be done either directly 
from ground stations of the NASA Spaceflight Track- 
ing and Data Network (STDN) or through the 
TDRSS. Each transponder is equipped with a trans- 
mit/receive S-band omnidirectional antenna which is 
left-hand circular-polarized (LHCP). Each receiver in 
these S-band transponders simultaneously searches 
for TDRSS pseudonoise code correlation and STDN 
continuous-wave (CW) signals greater than the de- 
tection threshold. Upon acquisition of either com- 
mand signal type, the other mode is inhibited until 
loss of the initial signal and a subsequent return to 
the search mode occurs. 

The S-band telemetry downlink has three data 
rates: 1.216, 4.864, and 256.0 kbps. This downlink 
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Fig. 3. Landsat 7 satellite. (Goddard Space Flight Center) 


is also used for two-way Doppler tracking. The nom- 
inal output frequency is 2287.5 MHz. The teleme- 
try data are phase shift-key (PSK) modulated onto 
a 1.024-MHz subcarrier. When the recorded data 
playback is commanded, the 256-kbps SSR data are 
phase-modulated onto the baseband channel. The 
PSK subcarrier is summed with the baseband chan- 
nel data, and this composite signal phase-modulates 
the transponder transmitter. Command uplinking to 
the user spacecraft is done through the S-band sys- 
tem at a data rate of 2.0 kbps. 

Space shuttle. The space shuttle is a crewed space- 
craft system which is used to provide a heavy-lift 
reusable space transportation capability to low Earth 
orbit. Since 2000 its primary use has been to sup- 
port the construction, continued occupation, and 
long-term microgravity research aboard the Interna- 
tional Space Station USS). The launch configuration 
consists of the orbiter, external tank, and solid rocket 
boosters (SRBs). Shortly after launch, the SRBs are 
separated from the space shuttle and are recovered 
by oceangoing tugs for reuse on later flights. The 
external tank is jettisoned when the main orbiter en- 
gines are shut down, and the orbital maneuvering 
system (OMS) engines provide the principal orbiter 
propulsion for the rest of the mission. The space 
shuttle carries up to seven crew and mission special- 
ists, and each orbiter is designed to be reused for up 
to 100 missions. It carries a mass of up to 65,000 Ib 
(29,000 kg) of payload in the cargo bay (Fig. 4 on 
ascent and up to 30,000 Ib (14,000 kg) on landing. 
Major cargo includes logistics support items to and 
from the /SS; massive modules to the ISS during its 
construction, such as its robotic arm; and automated 
spacecraft for launch or recovery from orbit for re- 
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furbishment on Farth and possible reuse. See SPACE 
STATION. 

The Orbiter Communications and Tracking Sys- 
tem operates with the NASA ground-based STDN, 
the TDRSS, the U.S. Air Force Satellite Control Fa- 
cility (SCF), the JSS, and crew members performing 
extravehicular activities. The shuttle uses UHF-, S-, 
and Ku-band frequencies. 

The Ku-band system is designed to operate as a 
radar during a space rendezvous and, when not so 
employed, as a two-way link with the TDRSS. The Ku- 
band antenna is a 3-ft (0.9-m) monopulse paraboloid 
tracking antenna with a gain of 38 dB and a beam 
width of 0.36°. This antenna is deployed after the or- 
biter is in space and the cargo bay doors are opened. 
The transmitter uses a 50-W traveling-wave tube am- 
plifier. The Ku-band data from the orbiter occupy a 
192-kbps channel for voice and operations data, and 
a channel of up to 50 Mbps for science data. On 
command, the 50-Mbps channel can be replaced by 
a 4-MHz analog channel for television transmission. 
Television is used for monitoring onboard activities, 
for monitoring the cargo bay area, and by astronauts 
in extravehicular activities. See TRAVELING-WAVE 
TUBE. 

The S-band communication system is composed 
of two independent subsystems: the network subsys- 
tem and the payload subsystem. The network subsys- 
tem provides tracking and two-way communication 
direct to ground stations or through TDRSS. The pay- 
load subsystem provides two-way communication 
with automated orbiting spacecraft in the vicinity 
of the orbiter. All S-band antennas are low-gain and 
flush-mounted. Locations are chosen to favor the de- 
sired direction of coverage. The flush antennas are 
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Fig. 4. Cutaway view of the space shuttle orbiter. (NASA) 


overlaid with thermal protective material to enable 
them to survive the heat of entry. 

The network subsystem is designed to be com- 
patible with both the NASA and the Department 
of Defense (DOD) networks. The NASA-compatible 
transponders use a transmit-to-receive frequency 
ratio of 240:221 and are phase-modulated. When 
transmitting to NASA ground stations, a power out- 
put of 2 W is used. Transmissions to the TDRS use 
100 W, and the data are convolutionally encoded 
to increase the link margins. The DOD-compatible 
transponders use a frequency ratio of 256:205 and 
transmit to the SCF with a power of 2 W. See PHASE 
MODULATION. 

These transponders provide the capability of two 
digitally encoded voice channels plus a command 
channel to the orbiter, and a low-data-rate teleme- 
try and two digital voice return channels. A special- 
purpose frequency-modulated (FM) transmitter 
operates on a frequency of 2250 MHz and has a mini- 
mum power output of 10 W. This transmitter is used 
primarily for television video data transmission to 
ground stations. See FREQUENCY MODULATION. 

The S-band payload subsystem operates with a 
large variety of spacecraft communication systems. 
It is used during the release and recovery of space- 
craft by the orbiter and is effective at ranges from a 
few feet to over a mile. The maximum power output 
is 2 W, with the ability to transmit at reduced power 
levels when the spacecraft is nearby. 

The UHF communication system is used during ex- 
travehicular activities by astronauts and operates at 
frequencies of 260 and 297 MHz. These transmitters 
are amplitude-modulated (AM), with direct modula- 
tion of voice plus a 5.4-kHz subcarrier of biomedi- 
cal data. Two orbiter ultrahigh-frequency (UHF) an- 
tennas are used, on the inside of the air lock and 





on the lower portion of the orbiter. See AMPLITUDE 
MODULATION; SPACE SHUTTLE. 
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Say 
Space flight 

The penetration by humans into the reaches of the 
universe above the terrestrial atmosphere, and in- 
vestigation of these regions by automated, remote- 
controlled and crewed vehicles. 

The purpose of space flight is to provide signifi- 
cant contributions to the physical and mental needs 
of humanity on a national and global basis. Such con- 
tributions fall specifically in the areas of (1) Earth re- 
sources of food, forestry, atmospheric environment, 
energy, minerals, water, and marine life; (2) Earth 
and space sciences for research; (3) commercial ma- 
terials processing, manufacturing in space, and pub- 
lic services. More general goals of space flight in- 
clude expansion of knowledge; exploration of the 
unknown, providing a driving force for technol- 
ogy advancement and, hence, improved Earth-based 


productivity; development and occupation of new 
frontiers with access to extraterrestrial resources and 
unlimited energy; strengthening of national prestige, 
self-esteem, and security; and providing opportunity 
for international cooperation and understanding. 

This article focuses on crewed space flight. For 
discussion of other missions see APPLICATION SATEL- 
LITES; COMMUNICATIONS SATELLITE; METEOROLOG- 
ICAL SATELLITES; MILITARY SATELLITES; SATELLITE 
(SPACECRAFT); SATELLITE NAVIGATION SYSTEMS; SCI- 
ENTIFIC SATELLITES; SPACE PROBE. 

To conduct crewed space flight, the two leading 
space-faring nations, the United States and Russia, 
formerly the Soviet Union, have developed space- 
craft systems and the necessary ground facilities, 
research and development base, operational know- 
how, planning experience, and management skills. 
In the United States, crewed space programs are con- 
ducted by the National Aeronautics and Space Ad- 
ministration (NASA), a federal agency established in 
1958 for the peaceful exploration of space. In Russia, 
crewed space flights have been under the auspices 
of the U.S.S.R. Academy of Sciences; they are now 
the responsibility of the Russian Space Agency 
(Rosaviakosmos, or RKA). The first spacecraft with 
a human on board, the Vostok 1, piloted by Yuri A. 
Gagarin, was launched on April 12, 1961, from the 
Baikonur Cosmodrome in Kazakhstan and returned 
after completing one revolution of the Earth. The 
first American space flight of a human took place 
3 weeks later, when NASA launched Alan B. Shepard 
on May 5 on the Mercury-Redstone 3 for a 15-min 
suborbital test flight. 

The early spacecraft of both nations were built 
for only one space flight. The first multiply reusable 
space vehicle, the space shuttle, was launched by 
the United States on April 12, 1981. By December 
31, 2006, 906 men and 109 women had flown a total 
of 254 missions. Of these, the United States alone, 
in a total of 148 missions, had put 668 men and 101 
women into space (some of them more than once). 
Up until permanent occupancy of space began on 
November 2, 2000, with the boarding of the Interna- 
tional Space Station USS) by its first resident crew, 
humans had spent a total of 11,008 days 3 hours 
in space (or about 519,525 crew-hours, accounting 
for crew sizes). By the beginning of permanent resi- 
dency, the United States had accumulated a total of 
1195 days in space (or about 149,174 crew-hours; 
Table 1). 

In November 1987, the 13 member nations of the 
European Space Agency (ESA) agreed to become the 
third major power engaging in human space flight. 
European efforts today focus on a two-pronged pro- 
gram: participation as a partner in the International 
Space Station, primarily with the Columbus Orbital 
Facility (COF), and extension of the Ariane family 
of expendable launch vehicles with the heavy-lift 
carrier Ariane 5. By joining the original partnership 
behind the space station and assigning astronauts 
to United States space shuttle missions (as did the 
European Space Agency and Russia), Canada and 
Japan have also entered the ranks of space-faring 


nations, while other countries, such as Brazil, are 
preparing to join. 


Crewed Spacecraft 


A crewed spacecraft is a vehicle capable of sustaining 
humans above the terrestrial atmosphere. In a more 
limited sense, the term “crewed spacecraft” is usually 
understood to apply to vehicles for transporting and 
sustaining human crews in space for time periods 
limited by prestored on-board supplies, as distinct 
from orbital space stations which support theoret- 
ically unlimited habitation of humans in space by 
autonomous systems, crewed maintenance, and pe- 
riodic resupply. 

Reentry. The basic requirements of crewed space- 
craft are quite different from those of crewless space 
probes and satellites. The presence of humans on 
board necessitates a life-sustaining environment and 
the means to return safely to Earth. The major com- 
mon feature of all crewed spacecraft, therefore, is 
the atmospheric return element or reentry module. 
It consists basically of a pressure-tight cabin for the 
protection, comfort, and assistance of the crew, sim- 
ilar to the cockpits of fighter aircraft, but shaped ex- 
ternally in accordance with the desired airflow and 
its forces, and surrounded by heat-resistant material, 
the thermal protection system (TPS), to cope with 
the high frictional and radiative heating accompany- 
ing the energy dissipation phase of the atmospheric 
return flight to Earth. 

To better deal with this heating, the design of the 
forward-facing portion of the reentry module is usu- 
ally relatively blunt and symmetrical. Temperatures 
on and around the stagnation point of the blunt face 
can reach 3000°F (1700°C) and more. When they are 
too high for metallic materials, a class of materials 
called ablators is used. Ablators can be composed of 
phenolic, silicone rubber, or epoxy compounds. At 
high temperatures, a great deal of heat is absorbed 
and carried away from the body by outgassing, char- 
ring, and ablating. The remaining heat influx can be 
dealt with by the insulating properties of the thermal 
protection system to keep the backface temperatures 
within the limits of conventional structural materials. 
See NOSE CONE. 

Ballistic reentry. A ballistic reentry, flown with a 
blunt, symmetrical heat shield, is the simplest, most 
direct mode of return from space, but it also in- 
curs the highest heating on the thermal protection 
system and the heaviest deceleration loads on the 
crew since the orbital energy has to be dissipated in 
the short time of the plunge. This mode also allows 
no further control of the landing point once the at- 
mosphere has been entered. The first generation of 
spacecraft of both the United States and the Soviet 
Union was designed for ballistic reentry and landing 
by parachute. While the United States since 1981 has 
gone on to lifting reentry for human space flight, Rus- 
sia still depends on ballistic reentry with the Soyuz 
spacecraft for crewed missions and the automated 
Progress drones for logistics flights. 

Lifting reentry. To provide more flexibility to the 
spacecraft for selecting its touchdown point and to 
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TABLE 1. Crewed space flights 1961-2006 





A. Weight 
B. Revolutions 











Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 12, 1961 Vostok 1 10,419 Ib 1 h 48 min First person in space, conducted radio and TV communication 
Yuri A. Gagarin 1 rev with Earth. Spacecraft contained life-support systems, telemetry, 
203 mi ejection seat, and recovery system. Cosmonaut and spacecraft 
landed at preselected area. Automatic control with cosmonaut 
available as backup. Two-gas air supply at sea-level pressure. 
May 5, 1961 Freedom 7 2855 Ib 15 min Mercury spacecraft launched in ballistic suborbital trajectory by 
Alan B. Shepard, Jr. — Redstone booster (Mercury-Redstone 3). Downrange distance 
116 mi 302 mi. First United States person in space; weightless for 5 min 
with no ill effects. Astronaut exercised manual control of spacecraft. 
July 2-21, 1961 Liberty Bell 7 2836 Ib 15 min Mercury spacecraft launched into ballistic trajectory by Redstone 
Virgil |. Grissom — booster (Mercury-Redstone 4). Downrange distance 303 mi. 
118 mi Weightless for 5 min with no ill effects. Hatch opened prematurely 
during recovery, spacecraft filled with water and sank in 2500 
fathoms in Atlantic Ocean. Astronaut was recovered. 
Aug. 6-7, 1961 Vostok 2 10,432 Ib 25h11 min First test of prolonged weightlessness; cosmonaut ate, worked, and 
Gherman S. Titov 16 rev slept in space. Monitored by TV and radio. Vestibular disturbances 
152 mi produced motion sickness but apparently no significant aftereffects. 
Feb. 20, 1962 Friendship 7 2987 Ib 4h 55 min Mercury spacecraft launched into orbit by Atlas booster (Mercury- 
John H. Glenn, Jr. 3 rev Atlas 6). First United States crewed orbital flight, achieving original 
162 mi Project Mercury objectives of placing a human in orbit, observing 
reactions to space environment, and making a safe recovery. No 
adverse physiological effects from the 4.5 h of weightlessness. 
May 24, 1962 Aurora 7 2975 Ib 4h 56 min Mercury spacecraft launched into orbit by Atlas booster (Vercury- 
M. Scott Carpenter 3 rev Atlas 7). Flight plan similar to Friendship 7. Manual attitude control 
167 mi exercised during retrofire. Landed 250 mi beyond predicted impact 
point. 
Aug. 11-15, 1962 Vostok 3 10,412 Ib 94 h 22 min Launched as first half of tandem mission with Vostok 4. Radiation 
Andrian G. Nikolayev 60 rev measurements taken and effects of biological specimens studied. 
146 mi Extensive physiological measurements recorded during prolonged 
exposure to space environment with no adverse effects noted. TV 
monitoring, radio and TV contact with Vostok 4. Cosmonaut floated 
free of restraint for a total of 3.5 h, was ejected to land separately 
from spacecraft. 
Aug. 12-15, 1962 Vostok 4 10,425 Ib 70 h 57 min Launched from same facility as Vostok 3 into tandem flight. Passed 
Pavel R. Popovich 45 rev to within 4 mi of Vostok 3 at closest point. Extensive radiation and 
147 mi physiological experiments conducted. TV monitoring, radio and TV 
contact with Vostok 3. Cosmonaut floated free of restraint for a total 
of about 3 h, was ejected to land separately from spacecraft. 
Oct. 3, 1962 Sigma 7 3029 Ib 9h 13 min Mercury spacecraft launched into orbit by Atlas booster (Wercury- 
Walter M. Schirra, Jr. 5 rev Atlas 8). Automatic reentry mode utilized, landed 9000 yd from 
176 mi prime recovery ship for first recovery in Pacific Ocean. Experiments 
included camera, radiation sensors, and ablative samples on the 
neck of spacecraft. 
May 15-16, 1963 Faith 7 3033 Ib 34 h 20 min Mercury spacecraft launched into orbit by Atlas booster (Wercury- 
L. Gordon Cooper, Jr. 21 rev Atlas 9). Long-endurance mission with no adverse physiological ef- 
166 mi fects as astronaut ate, slept, and worked in the space environment. 
Reentry sequence was with astronaut orienting spacecraft and 
firing retrorockets manually. A very accurate landing was made in the 
Pacific Ocean 7000 yd from prime recovery ship. Experiments in- 
cluded infrared and standard photography, radiation measurement, 
ejection of flashing light (sighted in fifth and sixth orbits), and obser- 
vation of lights on the ground. 
June 14-19, 1963 Vostok 5 10,408 Ib 119h6 min New time and distance space-flight records set, last 45 revolutions 
Valery F. Bykovsky 76 rev in tandem with Vostok 6. Radio contact with Vostok 6. Direct TV 
146 mi broadcast to Earth in real time. Extensive medical and biological 
experiments conducted. 
June 16-19, 1963 Vostok 6 10,392 Ib 70h 50 min Orbited first woman and first person not trained as a pilot. Passed 
Valentina V. Tereshkova 45 rev within proximity of Vostok 5 on first orbit, in different orbital plane. 
143 mi Radio contact with Vostok 5. Direct TV broadcast to Earth in real 
time. Parachuted separately from spacecraft on landing. 
1lb=0.45kg.  mi=1.6km. 1f=0.3m. 1yd=09m. 1 fathom=1.8m. 1 Ib/in2 =6.9kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 








A. Weight 
B. Revolutions 
Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Oct. 12-13, 1964 Voskhod 1 11,731 Ib 24h 17 min First multicrewed space flight; first “shirt-sleeve” environment 
Vladimir M. Komarov 15 rev in space flight, no pressure suits; record altitude for crewed 
Konstantin P. Feoktistov 254 mi space flight. Feoktistov, scientist, and Yegorov, physician, as 
Boris B. Yegorov experimenters and observers in addition to Komarov as pilot. 
Both Feoktistov and Yegorov experienced motion sickness and 
disorientation illusions. Crew worked, ate, drank, and slept 
unrestrained. Crew remained in spacecraft through landing. 
Mar. 18-19, 1965 Voskhod 2 12,529 Ib 26 h 2 min New record altitude for crewed space flight. During second 
Pavel |. Belyayev 16 rev orbit Leonov took first space walk with autonomous life-support 
Aleksei A. Leonov 308 mi system, moved 17 ft from spacecraft on tether, 23 min in space 
environment, including 11 min in air lock through which he exited 
from spacecraft. While outside, Leonov was observed through the 
spacecraft TV camera. First Soviet landing sequence using manual 
system in lieu of automatic. Crew remained in spacecraft through 
landing. Planned landing point was overshot: landed in deep snow. 
Mar. 23, 1965 Gemini 3 7111 Ib 4h 53 min Gemini spacecraft launched into orbit by modified Titan 2 booster 
Virgil |. Grissom 3 rev (Gemini-Titan 3). First United States two-person space flight, 
John W. Young 140 mi Grissom first person in space for second time. One orbital 
maneuver was conducted in each of the three orbits. Manual 
control was exercised throughout the reentry phase, using the 
limited lifting characteristics of the spacecraft to steer toward 
touchdown. 
June 3-7, 1965 Gemini 4 7879 Ib 97h 56 min Gemini spacecraft launched into orbit by modified Titan 2 
James A. McDivitt 62 rev booster (Gemini-Titan 4). Endurance mission of 4 days to test 
Edward H. White II 184 mi crew and spacecraft in buildup to longer missions. Eleven 
scientific and engineering experiments, in addition to basic 
medical checks on 4 days of exposure to the space environment. 
White spent 22 min outside the spacecraft in EVA, propelling 
himself with an oxygen-jet gun. During space walk he was 
attached to the spacecraft with a 25-ft tether-and-oxygen line; 
he carried emergency oxygen and camera. Spacecraft computer 
malfunction necessitated a ballistic reentry sequence. 
Aug. 21-29, 1965 Gemini 5 7947 Ib 190 h 55 min Gemini spacecraft launched into orbit by modified Titan 2 
L. Gordon Cooper, Jr. 120 rev booster (Gemini-Titan 5). Endurance mission of 8 days confirmed 
Charles Conrad, Jr. 217 mi the physiological feasibility of Apollo lunar landing mission. 
First flight of fuel-cell electrical power system. On Aug. 23 
a simulated rendezvous was conducted with a series of four 
maneuvers through two orbits which raised the perigee to 124 mi 
and apogee to 192.6 mi. Five medical experiments measured 
physiological effects; extensive weather and terrain photography 
was conducted. Voice communication was conducted with Sea 
Lab 2 under the Pacific Ocean. 
Dec. 4-18, 1965 Gemini 7 8076 Ib 330 h 35 min Gemini spacecraft boosted into orbit by modified Titan 2 
Frank Borman 206 rev booster (Gemini-Titan 7). Endurance mission of 14 days, 
James A. Lovell, Jr. 204 mi longest scheduled flight in Gemini program. Station-keeping 
conducted with booster second stage after separation; spacecraft 
maneuvered to serve as rendezvous target for Gemini 6. Accurate 
controlled reentry was made, landing 7.6 mi from target. One 
astronaut at a time doffed his pressure suit to enhance comfort. 
No adverse physiological effects from 2 weeks in the space 
environment. Twenty experiments were incorporated into the 
mission, ranging from detailed medical measurements and sleep 
analysis to Earth photography and radiometry. 
Dec. 15-16, 1965 Gemini 6 7817 Ib 25h 51 min Gemini spacecraft launched into orbit by modified Titan 2 booster 
Walter M. Schirra, Jr. 16 rev (Gemini-Titan 6). Rendezvous with Gemini 7 as target. A series of 
Thomas P. Stafford 193 mi posigrade maneuvers began the rendezvous chase and shortened 


the initial 1200 mi separation distance to 730 mi. During second 
revolution the distance was reduced to 300 mi with another 
thruster firing, followed by a plane change to match Gemini 
7’s orbit. Orbit was adjusted to near circular at beginning of 
third revolution; as fourth revolution began, the terminal phase 
of rendezvous was initiated, resulting in approach to within 1 ft 
of Gemini 7. Station-keeping and fly-around maneuvers were 
conducted for 5 h 19 min. Controlled reentry to landing within 
13 mi of prime recovery ship. 





1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





Dates launched 
and recovered 


Designation 
and crew 


A. Weight 


B. Revolutions 
C. Max. distance 


from Earth 


Duration 


Remarks 





Mar. 16, 1966 


June 3-6, 1966 


July 18-21, 1966 


Sept. 12-15, 1966 


Nov. 11-15, 1966 


Gemini 8 
Neil A. Armstrong 
David R. Scott 


Gemini 9a 
Thomas P. Stafford 
Eugene A. Cernan 


Gemini 10 
John W. Young 
Michael Collins 


Gemini 11 
Charles Conrad, Jr. 
Richard F. Gordon, Jr. 


Gemini 12 
James A. Lovell, Jr. 
Edwin E. Aldrin, Jr. 


8351 Ib 
62 rev 
186 mi 


8268 Ib 
45 rev 
194 mi 


8248 Ib 
43 rev 
476 mi 


8509 Ib 
44 rev 
853 mi 


8297 Ib 
59 rev 
187 mi 


10h 41 min 


75 h 21 min 


70 h 47 min 


71h17 min 


94 h 35 min 


Gemini spacecraft launched into orbit by modified Titan 2 booster, 
1h 41 min after Gemini-Agena target vehicle (GATV)- Atlas booster 
combination. Rendezvous with the GATV was accomplished in 
fourth orbit, 6 h after launch, and first docking of two vehicles in 
space was achieved 33 min later. After 28 min of flight in the docked 
configuration, a thruster in the orbital-attitude maneuvering system 
stuck open and the spacecraft began to spin. Gemini spacecraft 
was undocked from GATV and, after 25 min, was stabilized by use 
of reentry control system. Mission terminated early with landing in 
secondary recovery area in western Pacific. 


Mission utilizing augmented target docking adapter (ATDA) as the 
target was placed into orbit on June 1 with an Atlas booster. The 
Gemini was launched into orbit on June 3 by its modified Titan 
2 booster. Rendezvous with ATDA achieved on third orbit. Crew 
confirmed that shroud covering the docking collar had failed to 
separate, and docking was canceled. Two additional rendezvous 
were performed as planned, one using visual techniques, the last 
from above the ATDA. After 1 h of EVA, Cernan’s visor accumulated 
fog, and communications between Stafford and Cernan were poor. 
Planned use of the self-contained life-support and propulsion 
systems to maneuver was canceled. Total EVA was 2 h 7 min. 
Controlled reentry to 0.42 mi of target. 


GATV-Atlas booster combination launched 100 min before Gemini 
spacecraft launched into orbit by modified Titan 2 booster. 
Rendezvous and docking with GATV 10 accomplished in fourth 
orbit. GATV engine used to propel the docked combination to 
record altitude, then proper orbit to rendezvous with GATV 8. After 
separation from GATV 10, Gemini 10 effected rendezvous with 
the passive target. First period of standup EVA was terminated 
at 50 min when both crew members suffered eye irritation. Second 
EVA period, after rendezvous with passive GATV, consisted of 
Collins on umbilical moving to GATV and recovering micrometeorite 
experiment package. 


GATV-Atlas booster combination launched 1 h 37 min before 
Gemini spacecraft was launched into orbit by modified Titan 2 
booster within 2-s launch window. Rendezvous accomplished 
on first orbit using on-board information exclusively. Docking at 
1 h 34 min after launch. Docking accomplished twice by each 
crew member. During umbilical EVA, Gordon removed the nuclear- 
emulsion experiment package and attached tether between Gemini 
spacecraft and GATV. Excessive fatigue from these activities 
caused early termination of EVA at 33 min. Using GATV primary 
propulsion system, docked configuration was propelled to an 
altitude of 853 mi. Hatch opened third time for 2 h 10 min standup 
EVA, during which scheduled photography was accomplished. The 
two spacecraft undocked, and station-keeping by use of a 100-ft 
nylon tether was exercised with a slow spin rate imparted to the 
tethered combination. After separation of the tether, a second 
rendezvous with GATV was conducted from 25 mi separation. 
Reentry sequence was fully automatic, with impact 2.9 mi from 
the aiming point. 


GATV-Atlas booster combination launched 1 h 38 min before 
Gemini spacecraft was launched into orbit by modified Titan 2 
booster. Rendezvous was accomplished in third orbit with docking 
4 h 14 min after Gemini launch. Two standup EVA periods, on 
first and third days, totaled 3 h 34 min. On second day Aldrin 
conducted 2 h 6 min umbilical EVA, testing restraint devices to 
overcome body-positioning problems experienced on prior flights. 
Resting frequently, Aldrin used portable hand rails, foot restraints, 
and various tethers. He completed 19 tasks, demonstrating that 
useful EVA work was feasible with proper planning and restraining 
devices. Gemini and GATV spacecraft undocked, remaining 
joined by 100-ft tether for station-keeping and gravity-gradient 
stabilization experiment. Fourteen experiments were conducted. 
Automatic reentry sequence was used for second time. 





1 Ib = 0.45 kg. 





1 mi = 1.6 km. 


1 ft = 0.3 m. 


1yd=0.9m. 


1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 23-24, 1967 Soyuz 1 14,000 Ib 24 h 45 min After a 2-year hiatus, Soviet Union crewed space flight resumed with 
Vladimir M. Komarov 18 rev Soyuz 1. During reentry on its eighteenth orbit, after 25.2 h of flight, 
139 mi Soyuz 1’s parachutes fouled and the 14,000-Ib spacecraft crashed, 
killing Komarov. 
Oct. 11-22, 1968 Apollo 7 87,382 Ib 260 h 9 min 3 s First crewed flight of Apollo command and service modules. 
Walter M. Schirra, Jr. 163 rev Launched by a Saturn 1B, this was an engineering test flight and was 
Donn Eisele 244 mi a complete success. Rendezvous and simulated docking exercises 
Walter Cunningham with the Saturn upper stage were practiced. A TV camera was 
carried, and a series of in-flight programs were staged by the 
astronauts, showing their activities within the command module. 
Transmissions were relayed live to home receivers all over the United 
States. 
Oct. 26-30, 1968 Soyuz 3 14,000 Ib 94h 51 min First successful crewed flight of the Soyuz spacecraft. Accom- 
Georgi T. Beregovoi 64 rev plished orbital rendezvous with the uncrewed Soyuz 2, which was 
151 mi orbited several days previously. Docking was not attempted. Auto- 
matic control was used to bring the two craft within 650 ft of each 
other. At the point the pilot assumed manual control, and he made 
at least two simulated dockings. Television pictures of the inside of 
the spacecraft were relayed to home receivers in the Soviet Union 
and Europe. 
Dec. 21-27, 1968 Apollo 8 87,000 Ib 146 h 59 min 49s First crewed circumlunar orbital mission. Also first use of Saturn 5 
Frank Borman 10 rev launch vehicle for a crewed flight. Remained in an orbit about 69 mi 
James A. Lovell, Jr. 231,000 mi (lunar above the lunar surface for approximately 20 h (10 revolutions). The 
William Anders module) ability to use lunar landmarks for navigation to the lunar landing 
sites was successfully verified. Communications with Earth were 
excellent, and TV pictures of good quality, of both the lunar surface 
and the Earth from lunar and translunar distances, were transmitted 
to the Earth from circumlunar reentry speed. Landing was within 
3 mi of the recovery ship. 
Jan. 14-17, 1969 Soyuz 4 14,000 Ib 71h 14 min Rendezvoused and docked with Soyuz 5 to form the “first 
Vladimir A. Shatalov 48 rev experimental space station.” Cosmonaut Shatalov in Soyuz 5 
158 mi performed the actual docking. The two spacecraft remained docked 
for 4h 35 min, during which time cosmonauts Khrunov and Yeliseyev 
transferred from Soyuz 4 to Soyuz 5, spending about 1 h outside the 
spacecraft in the process. 
Jan. 15-18, 1969 Soyuz 5 14,000 Ib 72 h 46 min After docking with Soyuz 4, cosmonauts Khrunov and Yeliseyev 
Boris V. Volynov 50 rev transferred to Soyuz 5 and cosmonaut Volynov remained in the 
Yevgeny V. Khrunov 158 mi spacecraft. Details of the transfer operation were transmitted 
Aleksei S. Yeliseyev over TV. 
Feb. 28-Mar. 13,1969 Apollo 9 95,231 Ib 241 h 1 min Earth orbital flight designed to give the lunar module its first crewed 
James A. McDivitt 151 rev flight test. This vehicle was crewed for 28 h by astronauts Schweikart 
David R. Scott 240 mi and McDivitt. During this period the hatches of both command 
Russell L. Schweikart module and the lunar module were opened. Schweikart got out onto 
the “front porch” of the lunar module and thus was able to exercise 
the suit and backpack designed for operation on the lunar surface. 
Some curtailment of this activity was deemed advisable because 
of Schweikart’s previous nausea. Subsequently, the lunar module 
was separated from the command module, and after 6 h 24 min 
was rendezvoused and docked with the command module. The 
astronauts in the lunar module reentered the command module and 
then jettisoned the lunar module. Numerous navigation and control 
exercises and Earth photography experiments were accomplished. 
Because of weather conditions in the prime recovery areas, the 
landing area was moved 500 mi south. Splashdown occurred within 
3 mi of the recovery ship. 
May 18-26, 1969 Apollo 10 96,000 Ib 192 h 3 min 23 s Lunar module tested in lunar orbit. Astronauts Stafford and Cernan 
Thomas P. Stafford 31 rev entered the lunar module, which separated from the command 
John W. Young 251,000 mi (lunar module, descended to within 50,000 ft of the lunar surface, and 
Eugene A. Cernan module) then rendezvoused and docked with the command module, which 


had remained in an orbit about 69 mi above the lunar surface. Trans- 
mitted to the Earth were color TV pictures of the lunar surface, of the 
Earth from lunar and translunar distances, and of the activities of 
the astronauts inside the command module. This flight successfully 
verified that the Apollo spacecraft was ready to attempt a lunar 
landing on the next flight. 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


1yd =0.9m. 1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
July 16-24, 1969 Apollo 11 96,000 Ib 195h17min The objective of the Apollo program (a crewed lunar landing) was 
Neil A. Armstrong 31 rev 12s achieved in this flight. Astronaut Armstrong emerged from the lunar 
Michael Collins 250,000 mi module on July 20, 1969, followed shortly afterward by Aldrin. This 
Edwin E. Aldrin, Jr. (lunar CM) event was recorded by a TV camera and relayed to home receivers in 
many parts of the world by satellite. The astronauts gathered samples 
of the lunar surface material for return to Earth. After deploying a United 
States flag, they installed a passive seismic instrument and a laser 
reflector on the lunar surface so that data of scientific interest could be 
obtained after their return to Earth. The astronauts found that mobility 
on the lunar surface in the one-sixth gravity environment presented no 
unanticipated problems, and walking was relatively easy. During the 
whole flight all the systems functioned correctly and the other parts of 
the flight followed Apollo 10 pattern closely. 
Oct. 11-16, 1969 Soyuz 6 14,000 Ib 118h 42min Tested effectiveness of various welding techniques in space. Along 
Georgi S. Shonin 79 rev with Soyuz 7 and 8, tested multiple launch capacity of Soviet Union 
Valeri N. Kubasov 144 mi and performed various scientific investigations. 
Oct. 12-17, 1969 Soyuz 7 14,000 Ib 118h41 min Along with Soyuz 6 and 8, tested capacity of Soviet Union for multiple 
Anatoly V. Filipchenko — launch and rendezvous; also performed navigation experiments and 
Vladislav N. Volkov 141 mi various scientific investigations. 
Viktor V. Gorbatko 
Oct. 13-18, 1969 Soyuz 8 14,000 Ib 119 h Along with Soyuz 6 and 7, tested capacity of Soviet Union for multiple 
Vladimir A. Shatalov — launch and rendezvous; also tracked other ships’ maneuvers and 
Aleksei S. Yeliseyev 140 mi performed various scientific investigations. 
Nov. 14-22, 1969 Apollo 12 96,000 Ib 142h52 min More lengthy exploration of the Moon’s surface. 
Charles Conrad, Jr. 45 rev 
Richard F. Gordon, Jr. 250,000 mi 
Alan L. Bean (lunar CM) 
Apr. 11-17, 1970 Apollo 13 97,000 Ib 142h52min Rupture of a fuel-cell oxygen tank in service module forced 
James A. Lovell, Jr. — cancellation of the third crewed trip to the Moon. The rupture 
Fred W. Haise, Jr. 250,000 mi was caused by high pressure which developed when the failure of 
John L. Swigert, Jr. two thermal switches responsible for regulating the tank heaters 
led to abnormally high temperatures within the tank, produced 
by the combustion of Teflon in the supercritical (cryogenic liquid) 
oxygen. Lunar module life-support system enabled crew to return to 
Earth. 
June 1-19, 1970 Soyuz 9 14,000 Ib 424h59min Predominantly biomedical flight, testing human ability to live and 
Andrian G. Nikolayev 300 rev work for prolonged periods of time in spacecraft environment. Earth 
Vitali I. Sevastyanov 167 mi resources applications experiments were also performed. 
Jan. 31-Feb. 9,1971 Apollo 14 98,000 Ib 216 h 2 min Third lunar landing. Precision touchdown accomplished by Shepard 
Alan B. Shepard Jr. 34 rev and Mitchell in “Antares.” On two EVAs, crew deployed Apollo 
Stuart A. Roosa 250,000 mi lunar surface scientific experiment package (ALSEP), completed other 
Edgar D. Mitchell (lunar CM) scientific tasks. First extraterrestrial use of a two-wheeled cart, 
modular equipment transporter (MET). New record surface time, 9 h 
24 min. Surface samples returned to Earth, 94 Ib. 
‘Apr 19, 1971 Salyut 1 40,000 Ib 175 days First Soviet space station. Prototype. Launched crewless. Reentered 
Crewless — Oct. 11, 1971. 
176 mi 
Apr. 23-24, 1971 Soyuz 10 14,500 Ib 47 h 46 min Three-person spacecraft, docked with Salyut 7 space station. 
Vladimir A. Shatalov 32 rev Remained docked for 5.5 h, but no crew transfer took place, probably 
Aleksey S. Yeliseyev 165 mi because of mechanical difficulties. 
Nikolai Rukavishnikov 
June 6-30, 1971 Soyuz 11 14,300 Ib 570 h22 min Second docking with Salyut 1. Crew boarded Salyut and remained 
Georgi T. Dobrovolskiy 385 rev for 23 days 17 h 40 min, breaking previous Soyuz 9 record. After 
Vladislav N. Volkov 140 mi extensive on-board activities, crew transferred to Soyuz for Earth 
Viktor |. Patsayev return and was killed during reentry by pulmonary embolisms, 
caused by rapid decompression of cabin through leaking hatch 
valve. 
July 26-Aug. 7,1971 Apollo 15 107,000 Ib 295 h Perfect landing by Scott and Irwin in “Falcon” at Hadley Rill. First use of 
David R. Scott 74 rev lunar roving vehicle (LRV), driven a total of 17.5 mi during three surface 
Alfred M. Worden 250,000 mi EVAs. Brought back more than 170 Ib of samples. First subsatellite, 
James B. Irwin (lunar CM) carrying particles and fields (P&F) experiment, launched in lunar orbit 
by crewed spacecraft. First inflight EVA performed by Worden in deep 
space on return leg. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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A. Weight 


B. Revolutions 


Remarks 





Landing by Young and Duke in “Orion” at Descartes. Second use of 
LRV, for a total of 16.6 driven miles. Three EVAs totaling 20 h 14 min 
yielded 209 Ib of samples. Fourth ALSEP station was deployed on 
surface and second P8&F subsatellite launched in lunar orbit. On return 
leg, Mattingly undertook second deep-space EVA to retrieve film from 
external bay. 


Last Apollo lunar landing, by Cernan and Schmitt, a geologist, in 
“Challenger” in mountainous region at Taurus-Littrow. Third LRV was 
driven 21 mi; fifth ALSEP deployed. Longest lunar stay time (74 h 
59 min 38 s), with three EVAs setting new record of 22 h 5 min 6 s. 
Most samples returned to Earth, 250 Ib. Third deep-space film retrieval 
EVA, by Evans. 


Second Soviet space station. Failed in orbit. 


First United States (experimental) space station, launched crewless 
on two-stage Saturn 5. At 60 s after launch, the workshop 
meteoroid/thermal shield tore loose. Primary solar array wing was lost, 
the other jammed closed by debris. As a result, inside temperatures 
became too high and power output too low for long-term human 
habitation. Station was saved through determined effort by NASA 
engineers on ground and astronauts in space, to become successful 
beyond expectations. Station reentered July 11, 1979. 


Crew launch on Saturn 1B postponed from May 15 to analyze 
problems. After entering Skylab, crew deployed parasol-like thermal 
blanket on workshop outside through small air lock. On June 7, Conrad 
and Kerwin performed hazardous EVA to repair jammed solar wing. 
Three EVAs totaled 5 h 51 min. 


Launch vehicle, Saturn 1B. Crew erected new twin-pole sun shield on 
EVA, installed new rate gyros. Extended solar observations with Apollo 
telescope mount (ATM), and 39 Earth observation passes. Three EVAs 
totaled 13 h 44 min. Space manufacturing and maneuvering unit tests 
added to mission schedule. 


First flight after Soyuz 11 tragedy. Short-duration test of a modified 
Soyuz with new hatch/valve design, improved maneuvering and 
manual control capabilities. Tested new space suit design and cabin 
suit loop. 


Launch vehicle, Saturn 1B. Last Skylab mission with 4 EVAs for 
film exchange and mechanical repairs, totaling 22 h 21 min. Mission 
included first exoatmospheric observation of a comet, Kohoutek. 


Similar to Soyuz 12, with addition of Orion 2 telescope system for solar, 
stellar, and cometary studies. Also included were biomedical studies, 
such as Levka brain-bloodflow device. 


Third Soviet space station. Military type. New solar panel and other 
systems. 


Crew docked and transferred to Salyut 3 space station. Purpose was 
to test the new space station. Stay on board was 14'/ days. 


Intended as second crew transfer to Sa/yut 3, but unable to dock 
because remotely controlled docking system failed. 


Soviet test and simulation of ASTP mission, including new compatible 
docking system and reduction of spacecraft pressure from 14.7 to 
10 Ib/in.? in flight. To approximate the United States docking system for 
docking tests, a dummy structural ring was taken along and separated 
in orbit. 


Fourth Soviet space station. Civilian type. 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration 
Apr. 16-27, 1972 Apollo 16 103,000 Ib 265 h 51 min 
John W. Young 64 rev 5s 
Thomas K. Mattingly II 250,000 mi 
Charles M. Duke (lunar CM) 
Dec. 7-19, 1972 Apollo 17 103,000 Ib 301 h 51 min 
Eugene A. Cernan 75 rev 
Ronald E. Evans 250,000 mi 
Harrison H. Schmitt (lunar CM) 
Apr. 3, 1973 Salyut 2 40,000 Ib — 
Crewless — 
May 14, 1973- Skylab 1 170,000 Ib (without — 
July 11, 1979 Crewless payload shroud) 
276 mi 
May 25- Skylab 2 31,000 Ib 28 days 
June 22, 1973 Charles Conrad, Jr. 404 rev 49 min 
Joseph P. Kerwin 276 mi 
Paul J. Weitz 
July 28- Skylab 3 31,000 Ib 59 days 1 h 
Sept. 25, 1973 Alan L. Bean 858 rev 9 min 
Jack R. Lousma 270 mi 
Owen Garriott 
Sept. 27-29, 1973 Soyuz 12 14,460 Ib Approx. 48 h 
Vasily G. Lazarev 32 rev 
Oleg G. Makarov 216 mi 
Nov. 16, 1973- Skylab 4 31,000 Ib 84 days 1h 
Feb. 8, 1974 Gerald P. Carr 1214 rev 16 min 
William R. Pogue 270 mi 
Edward Gibson 
Dec. 18-26, 1973 Soyuz 13 14,400 Ib 7 days 20h 
Pyotr |. Klimuk — 55 min 
Valentin V. Lebedev 169 mi 
June 25, 1974 Salyut 3 40,000 Ib 214 days 
Crewless — 
172 mi 
July 3-19, 1974 Soyuz 14 14,400 Ib 15 days 17h 
Pavel R. Popovich — 39 min 
Yuri P. Artukhin 169 mi 
Aug. 26-28, 1974 Soyuz 15 14,400 Ib 48 h 12 min 
Gennady V. Sarafanov 32 rev 
Lev S. Demin 183 mi 
Dec. 2-8, 1974 Soyuz 16 14,400 Ib 5 days 22 h 
Anatoly V. Filipchenko — 24 min 
Nikolai Rukavishmikov 140 mi 
Dec. 26, 1974 Salyut 4 40,000 Ib 770 days 
Crewless — 
226 mi 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1ft=0.3m. 1yd = 0.9 m. 1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Jan. 11-Feb.9,1975 Soyuz17 14,400 Ib 29 days 13h Crew docked and transferred to Salyut 4 space station. 
Aleksei Gubarev — 20 min 
Georgiy Grechko 225 mi 
Apr. 5, 1975 Soyuz 18A 14,400 Ib Approx. 20 min Suborbital flight. Mission aborted after launch due to vehicle failure 
Vasiley Lazarev — after approximately 20 min of flight. 
Oleg Makarov = 
May 24-July 26, Soyuz 18 14,400 Ib 62 days 23 h Second crew transfer to Salyut 4, which had completed 3352 
1975 Pyotr |. Klimuk — 20 min revolutions at the time. Flew part of the time simultaneously with 
Vitali I. Sevastyanov 220 mi ASTP. 
July 15-24, 1975 Apollo-Soyuz Test First international link-up in space (docking at 31 mi max. distance 
Project (ASTP) from Earth for 1 day 23 h 17 min) and last flight of an Apollo 
United States: 32,500 Ib 9 days 1 h spacecraft. Apollo launched on Saturn 1B 7"> h after Soyuz. First 
Thomas P. Stafford 138 rev 28 min handshake between Stafford and Leonov occurred 3:19 p.m. EDT 
Vance D. Brand 143 mi on July 17. Crews performed interspacecraft transfer four times 
Donald K. Slayton and conducted joint activities. A total of 31 experiments were 
Soviet Union: 15,000 Ib 5 days 22 h performed by American crew, some of them jointly with Soviets, 
Aleksely A. Leonov 96 rev 31 min and docking tests with Soyuz were practiced. After final undocking, 
Valerity N. Kubasov — each spacecraft continued on own mission. Soyuz returned July 21. 
June 22, 1976 Salyut 5 40,000 Ib 412 days Fifth Soviet space station. Civilian type. 
Crewless — 
169 mi 
July 6-Aug. 24, Soyuz 21 14,4000 Ib 49 days 6 h Crew docked on July 7 and transferred to Salyut 5 space station, 
1976 Vitaly Zholobov — 24 min launched June 22. On-board activities included earth resources 
Boris V. Volynov 169 mi photography, weather observations; formation of crystals and 
metals-melting in zero g; plant and insect growth for genetic 
studies. Crew reportedly suffered sensory deprivation effects. 
Flight had been preceded by uncrewed Soyuz 20 (launched Oct. 17, 
1975, docked to Salyut 4 on Oct. 19). 
Sept. 15-23, 1976 Soyuz 22 14,400 Ib 7 days 21 h Earth surveillance mission. Test of a special East German camera 
Valery F. Bykovsky 127 rev 54 min system for use on space stations. 
Vladimir Aksenov 175 mi 
Oct. 14-16, 1976 Soyuz 23 14,400 Ib 2 days 6 min Attempt to dock with Salyut 5 space station failed. Crew returned 
Vyacheslav Zudov 32 rev to Earth. 
Valery Rozhdestvenskiy 172 mi 
Feb. 7-25, 1977 Soyuz 24 14,400 Ib 17 days 17h Crew transferred to Salyut 5 on Feb. 8. Space station had mostly 
Viktor V. Gorbatko 285 rev 23 min military mission with special photo reconnaissance runs. 
Yuri Glazkov 176 mi 
Sept. 29, 1977 Salyut 6 42,000 Ib = Uncrewed launch of new Soviet space station. Reentered July 29, 
Uncrewed — 1982. 
221 mi 
Oct. 9-11, 1977 Soyuz 25 14,400 Ib 2 days 46 min Attempt to dock with Salyut 6 failed. Crew returned to Earth. 
Vladimir Kovalyonok 32 rev 
Valery Ryumin 221 mi 
Dec. 10-Jan. 16, Soyuz 26 14,400 Ib 37 days 10h Crew transferred to Salyut 6 for a 96-day mission, breaking United 
1977 Yuri Romanenko 590 rev 6 min States record of 84 days on Mar. 4, 1978. Crew returned on Mar. 16, 
Georgiy Grechko 221 mi Crew duration: 1978, with Soyuz 27. 
96 days 10h 
Jan. 10-16, 1978 Soyuz 27 14,400 Ib 64 days 22h Crew transferred to Salyut 6 to join Soyuz 26 crew. Resupplied by 
Vladimir Dzhanibekov 1024 rev 53 min first crewless logistics flight, Progress 1 (launched Jan. 20). On 
Oleg G. Makarov 219 mi Crew duration: return to Earth, crew used the “older” Soyuz 26 spacecraft, leaving 
5 days 22 h Soyuz 27 for the long-duration crew (returned Mar. 16). 
59 min 
Mar. 2-10, 1978 Soyuz 28 14,400 Ib 7 days 22h Crew transferred to Salyut 6 on Mar. 4 to join Soyuz 26 crew. First 
Aleksei Gubarev 125 rev 17 min non-Soviet cosmonaut: Remek (Czechoslovakia). 
Vladimir Remek 221 mi 
June 16-Sept. 3, Soyuz 29 14,400 Ib 79 days 15 h Crew transferred to Salyut 6 on June 17 to establish new duration 
1978 Vladimir Kovalyonok 125 rev 23 min record (140 days). Returned on Nov. 2 with Soyuz 31 spacecraft. 
Alexander Ivanchenko 230 mi Crew duration: Resupplied by Progress 2 (launched July 7), Progress 3 (launched 
140 days Aug. 7), Progress 4 (launched Oct. 3). 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1ft=0.3m. 1yd =0.9m. 1 fathom = 1.8m. 1 Ib/in.? = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
June 27-July 5, 1978 Soyuz 30 14,400 Ib 7 days 22h Crew transferred to Salyut 6 to join Soyuz 29 crew. Second non- 
Pyotr |. Klimuk 125 rev 4 min Soviet cosmonaut: Hermaszewski (Poland). 
Miroslav 225 mi 
Hermaszewski 
Aug. 26-Nov. 2, 1978 Soyuz 31 14,400 Ib 67 days 20h Crew transferred to Salyut 6 to join Soyuz 29 crew. Third non-Soviet 
Valery F. Bykovsky 1070 rev 14 min cosmonaut: Jaehn (East Germany). Crew returned with Soyuz 29 
Sigmund Jaehn 221 mi Crew duration: spacecraft. 
7 days 20h 
49 min 
Feb. 25-June 13, Soyuz 32 14,400 Ib 108 days 4h Crew transferred to Salyut 6 to establish new endurance record 
1979 Vladimir Lyakhov 1711 rev 24 min (175 days). Returned on Aug. 19 with Soyuz 34. Soyuz 32 returned 
Valery Ryumin 232 mi Crew duration: | uncrewed. Resupplied by Progress 5 (launched Mar. 12) and 
175 days Progress 6 (launched May 13). 
Apr. 10-12, 1979 Soyuz 33 14,400 Ib 1 day 23h Docking with Salyut 6 failed. Crew returned to Earth. Fourth non- 
Nikolai Rukavishnikov 31 rev 1 min Soviet cosmonaut: Ivanov (Bulgaria). 
Georgi Ivanov 221 mi 
June 6-Aug. 19, 1979 Soyuz 34 14,400 Ib 73 days 18 h Launched crewless and docked to Salyut 6. Returned Soyuz 32 
1152 rev 17 min crew. Resupplied by Progress 7 (launched June 28). New type of 
257 mi crewless Soyuz (Soyuz T1) launched Dec. 16 on flight test to dock 
with Salyut 6. 
Apr. 9-June 3, 1980 Soyuz 35 14,400 Ib 55 days 22 min Crew transferred to Salyut 6 for record-duration mission (184 days). 
Leonid Popov 869 rev Crew duration: Returned Oct. 11 with Soyuz 37 spacecraft. Resupplied by 
Valery Ryumin 220 mi 184 days 20 h Progress 8 (launched Mar. 27), Progress 9 (launched Apr. 27), 
12 min Progress 10 (launched June 29), Progress 11 (launched Sept. 28). 
May 26-July 31, Soyuz 36 14,400 Ib 65 days 20 h Crew transferred to Salyut 6 to join with Soyuz 35 crew. Returned 
1980 Valery Kubasov 1040 rev 54 min June 3 with Soyuz 35 spacecraft. Fifth non-Soviet cosmonaut: 
Bertalan Farkas 220 mi Farkas (Hungary). 
June 5-9, 1980 Soyuz T2 14,400 Ib 3 days 22 h First crewed test of new type spacecraft. Crew transferred to Salyut 
Yuriy Malyshev 62 rev 41 min 6 on June 6. Tenth crewed visit to Salyut 6. 
Vladimir Aksenov 220 mi 
July 23-Oct. 11, Soyuz 37 14,400 Ib 79 days 15h Crew transferred to Salyut 6 on July 25 to join with Soyuz 35 
1980 Viktor V. Gorbatko 1258 rev 17 min crew. Returned July 31 with Soyuz 36 spacecraft. Sixth non-Soviet 
Pham Tuan 220 mi Crew duration: | cosmonaut: Pham Tuan (Vietnam). 
7 days 20h 
42 min 
Sept. 18-26, 1980 Soyuz 38 14,400 Ib 7 days 20h Crew transferred to Salyut 6 to join with Soyuz 35 crew. Returned 
Yuri Romanenko 124 rev 43 min Sept. 26 with Soyuz 38. Seventh non-Soviet cosmonaut: Mendez 
Arnaldo T. Mendez 220 mi (Cuba). 
Nov. 27-Dec. 10, Soyuz T3 14,400 Ib 12 days 19h Crew transferred to Salyut 6. Thirteenth crewed visit to Salyut 
1980 Leonid Kizim 202 rev 8 min 6. First three-person mission since ill-fated Soyuz 17 in 1971. 
Oleg G. Makarov 220 mi Returned Dec. 10. Salyut 6 resupplied by Progress 12 (launched 
Gennady Strekalov Jan. 24). 
Mar. 12-May 26, Soyuz T4 14,400 Ib 75 days Crew transferred to Salyut 6 on Mar. 14. Savinykh became 100th 
1981 Vladimir Kovalyonok 1183 rev human to go into space. To date: 43 U.S. astronauts, 57 East-bloc 
Viktor Savinykh 220 mi cosmonauts. 
(estimated) 
Mar. 22-30, 1981 Soyuz 39 14,400 Ib 7 days 20 h Crew transferred to Salyut 6 on Mar. 23 to join with Soyuz T4 crew. 
Vladimir Dzhanibekov 125 rev 43 min Returned on Mar. 30 with Soyuz 39. Eighth non-Soviet cosmonaut: 
Jugderdemidiyn 220 mi Gurragcha (Mongolia). 
Gurragcha 
Apr. 12-14, 1981 STS 1 205,000 Ib 2 days 6 h First crewed orbital test flight of the U.S. space shuttle Columbia, 
John Young 36 rev 20 min 59 s for a full, successful orbital check of the revolutionary reusable 
Robert Crippen 172 mi spacecraft and its many subsystems. Return to Earth, featuring at- 
mospheric entry at Mach 25, hypersonic/supersonic maneuvering 
flight and subsonic unpowered landing at Edwards AFB, California, 
was celebrated by millions around the world. Time of landing on 
Apr. 14: 1:20 pm. EST. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
May 14-22, 1981 Soyuz 40 14,400 Ib 7 days 20h Crew transferred to Salyut 6 on May 15 to join with Soyuz T4 crew. 
Leonid Popov 124 rev 41 min Returned May 22 with Soyuz 40. Ninth non-Soviet cosmonaut: 
Dumitru Prunariu 220 mi Prunariu (Romania). 
Nov. 12-14, 1981 STS 2 210,000 Ib 2 days 6h Second crewed orbital test flight of the U.S. space shuttle 
Joe Engle 36 rev 13 min Columbia. Launch at Cape Canaveral after two postponements 
Richard Truly 166 mi and a liftoff delay due to hardware problems. Mission duration 
reduced from the planned 124 h to 54 h due to malfunction of on- 
board fuel cell. Despite shortened mission, crew achieved all ob- 
jectives. Landing on Rogers Dry Lake, Edwards AFB. 
Mar. 22-30, 1982 STS 3 210,000 Ib 8 days 4 min Third crewed orbital test of space shuttle Columbia. Launch at 
Jack R. Lousma 130 rev Cape Canaveral after 1 h delay. Fully successful. Flight extended 
C. Gordon Fullerton 150 mi 1 day to enable landing at Northrup Strip, New Mexico, after rains 
rendered Rogers Dry Lake, California, too soft. 
Apr. 19, 1982 Salyut 7 42,000 Ib = Crewless launch of new Soviet space station, replacing Salyut 6 
Crewless — (deorbited July 29,1982). Eleventh anniversary of launch of Salyut 1. 
163 mi Salyut 7 and an attached module Cosmos 1686 remained in orbit 
for almost 9 years, reentering on Feb. 6, 1991, after about 50,200 
orbits of the Earth. 
May 13-Dec. 10, Soyuz T5 14,400 Ib 106 days Crew transferred to Salyut 7 on May 14 and went on to establish 
1982 Anatoliy Berezovoy Approx. 3380 rev Crew duration: new duration record of 211 days. Was host to two visiting crews, 
Valentin Lebedev (crew) 211 days T6 and 77, and returned with Soyuz T7 spacecraft. 
216 mi 
June 25-July 2, Soyuz T6 14,400 Ib 7 days 21 h Crew transferred to Salyut 7 on June 25 to join with T5 crew. 
1982 Vladimir Dzhanibekov 125 rev 51 min Returned July 2 with T6 spacecraft. Tenth non-Soviet cosmonaut: 
Aleksandr lvanchenkov 194 mi Jean-Loup Chrétien (France). 
Jean-Loup Chrétien 
June 27-July 4, STS 4 210,000 Ib 7 days 1h Fourth and last orbital test flight of Columbia, after 74 workdays 
1982 Thomas K. Mattingly 112 rev 10 min of refurbishment. Mission accomplished and flight test program 
Henry W. Hartsfield 150 mi concluded with great success. Landing at Edwards AFB. 
Aug. 19-27, 1982 Soyuz T7 14,400 Ib 113 days Crew transferred to Salyut 7 on Aug. 20 to join T5 crew. Returned 
Leonid Popov 125 rev (crew) Crew duration: Aug. 27 with T5 spacecraft. Second woman in space: Svetlana 
Aleksandr Serebrov 194 mi 7 days 21h Savitskaya. 
Svetlana Savitskaya 52 min 
Nov. 11-16, 1982 STOSS 210,000 Ib 5 days 2 h First operational mission of shuttle Columbia successfully delivers 
Vance D. Brand 81 rev 14 min reimbursable cargo of two commercial communications satellites 
Robert Overmyer 150 mi (comsats) to orbit. First four-person crew. After landing at Edwards, 
Joseph Allen Columbia returns to manufacturer for changes necessitated by 
William Lenoir Spacelab payload (STS 9). 
Mar. 10-Aug. 14, Kosmos 1443 42,000 Ib 157 days Salyut 7 temporarily enlarged by docking of crewless “Kosmodul” 
1983 Crewless = Kosmos 1443 resupply/reentry vehicle. 
214 mi 
Apr. 4-9, 1983 STS 6 250,000 Ib 5 days 24 min First flight of space shuttle Challenger featuring first EVA by shuttle 
Paul Weitz 80 rev astronauts (Musgrave and Peterson) after 9 years. Cargo of 2- 
Karol Bobko 183 mi ton Tracking and Data Relay Satellite (TDRS) deployed successfully 
Donald Peterson but attained geostationary orbit (GEO) later as planned after major 
Story Musgrave rescue program by NASA (58 days) involving 39 maneuvers by 
mini-thrust-control jets. 
Apr. 20-22, 1983 Soyuz T8 14,400 Ib 48 h 18 min Mission aborted. Crew returned after failure of link-up with Salyut 
Vladimir Titov 32 rev 7 space station when rendezvous radar deployed incorrectly. 
Gennady Strekalov 194 mi 
Alexandr Serebrov 
June 18-24, 1983 STS 7 250,000 Ib 6 days 2h Shuttle Challenger with first five-person crew including first 
Robert Crippen 97 rev 24 min American woman in space (Ride). Successful deployment of two 
Frederick Hauck 187 mi commercial comsats (Anik C-2, Palapa B-1). First deployment/ 
John Fabian retrieval of free-flying experiment-carrying platform, Shuttle Palette 
Sally Ride Satellite (SPAS 01) of West Germany, with Canadian-built robot- 
Norman Thagard arm remote manipulator system. 
June 27-Nov. 24, Soyuz T9 14,400 Ib 149 days 9h Crew transferred to Salyut 7/Kosmos 1443 on June 28, conducted 
1983 Vladimir Liakov 2390 rev 46 min close to 300 experiments during their stay—which was extended 
Aleksandr Aleksandrov 214 mi unexpectedly when a planned follow-on mission did not come 
because of explosion of carrier rocket during the launch attempt 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8m. 1 Ib/in2 = 6.9 kPa. 
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B. Revolutions 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
June 27-Nov 24, on Sept. 26. Its crew, Titov and Strekalov, already ill-fated on 
1983 (cont.) T8, escaped serious injury. T9 crew added solar arrays, delivered 
by Kosmos 1443, to Salyut 7, and returned on Nov. 24 with T9 
spacecraft. 
Aug. 30-Sept. 5, STS 8 250,000 Ib 6 days 1h 8 min Third flight of Challenger, featuring first night launch (Canaveral) 
1983 Richard Truly 97 rev and first night landing (Edwards). First black NASA astronaut in 
Daniel Brandenstein 193 mi space (Bluford). Successful delivery of Indian comsat /nsat 1-B, 
Dale Gardner and test of robot-arm remote manipulation system with 4-ton 
William Thornton proof mass. First inflight investigations into space motion sickness 
Guion Bluford phenomenon (space adaptation syndrome; SAS) by physician 
Thornton. 
Nov. 28-Dec. 8, STS 9/SL 1 250,000 Ib 10 days 7h First flight of European-built (ESA) Spacelab aboard shuttle 
1983 John Young (Spacelab: 47 min Columbia. First six-person crew. First foreign shuttle astronaut: 
Brewster Shaw 26,000) Merbold (West Germany). Work aboard Spacelab on more than 70 
Byron Lichtenberg 166 rev experiments in many scientific-technical disciplines went so well 
Ulf Merbold 156 mi that mission was extended by 1 day. Landing at Edwards, after 
Owen Garriott mission had conclusively proved the practical worth of humans in 
Robert Parker space. 
Feb. 3-11, 1984 STS 41-B* 250,000 Ib 7 days 23h Tenth flight of shuttle and fourth flight of Challenger, carrying 
Vance D. Brand 127 rev 16 min two reimbursable commercial comsats (Westar 6, Palapa B-2). 
R. Gibson 185 mi Satellites deployed successfully, but payload assist module (PAM) 
Bruce McCandless upper stage failed in both cases. Comsats retrieved later by mission 
Robert Stewart 51-A. First free (untethered) EVA performed by McCandless, 
Ron McNair wearing manned maneuvering unit (MMU) back-pack system, to 
300 ft from orbiter. Second black astronaut (McNair). First landing 
at Kennedy Space Center, Florida. 
Feb. 8-Oct. 2, Soyuz T10 14,400 Ib 63 days Crew transferred to Salyut 7 on Feb. 9 and went on to set new 
1984 Leonid Kizim 3790 rev (crew) Crew duration: | endurance record of 237 days. During the 34-week stay, Kizim and 
Vladimir Solovyov 200 mi 236 days 21h Solovyov made six EVAs and spent almost 23 h outside the station, 
Oleg Atkov 50 min mainly for major repair work on Salyut 7. Crew took more than 
25,000 photographs of Earth and was joined twice by visiting crews 
(T11 and T12). Resupplied by Progress 19 on Feb. 23, Progress 
20 on Apr. 17, Progress 21 on May 8, Progress 22 on May 30, 
and Progress 23 on Aug. 15. Returned on Oct. 2 with Soyuz T11 
spacecraft. 
Apr. 2-11, 1984 Soyuz T11 14,400 Ib 183 days Crew transferred to Salyut 7 to join T10 crew. Returned with more 
Juri Malishev 125 rev (crew) Crew duration: than 1000 photographs of India and other Earth areas with Soyuz 
Grennady Strekalov 200 mi 7 days 21 h T10 spacecraft. Eleventh non-Soviet cosmonaut: Sharma (India). 
Rakesh Sharma 42 min 
Apr. 6-13, 1984 STS 41-C 250,000 Ib 6 days 23 h Fifth flight of Challenger. Flew new “direct” ascent to higher- 
Robert Crippen (LDEF: 21,000) 41 min than-usual altitude for rendezvous with solar telescope satellite 
Richard Scobee 107 rev Solar Maximum Mission (SMM), inoperative since end of 1980. 
George Nelson 313 mi Crew deployed large Long Duration Exposure Facility (_DEF), then 
Terry Hart captured and repaired SMM with full success, setting new EVA 
James van Hoften record of 7h 18 min. 
July 17-19, 1984 Soyuz T12 14,400 Ib 12 days 7h Crew transferred to Salyut 7 on July 18 to join 710 crew. Second 
Vladimir Dzhanibekov 196 rev 14 min space mission of Savitskaya, who also became world’s first female 
Sventlana Savitskaya 221 mi spacewalker (3 h 35 min), performing experiments with electron- 
Igor Volk beam tool for space assembly and repair. Crew returned with T72 
spacecraft. 
Aug. 30-Sept. 5, STS 41-D 250,000 Ib 6 days 56 min First flight of space shuttle Discovery. Cargo: three reimbursable 
1984 Henry W. Hartsfield 96 rev comsats (SBS 4, Telstar 3-C, Leasat 1) successfully deployed. 
Michael Coats 185 mi Also: OAST-1 experiment pallet. Second U.S. woman in space 
Richard Mullane (Resnik). First “passenger” from industry (Walker, of McDonnell 
Stephen Hawley Douglas). 
Judith Resnik 
Charles Walker 
Oct. 5-13, 1984 STS 41-G 250,000 Ib 8 days 5h Sixth flight of Challenger. First seven-person crew, including two 
Robert Crippen 132 rev 24 min women, one Canadian (Garneau), one Australian (Scully-Power). 
Dave Walker 220 mi Payload: Earth Radiation Budget Satellite (ERBS), successfully 
Kathryn Sullivan deployed; OAST-3 experiments. First U.S. woman to walk in space 
Sally Ride (Sullivan). Second landing at Kennedy Space Center. 
David Leestma 
Marc Garneau 
Paul Scully-Power 
1 Ib = 0.45 kg. 1 mi =1.6 km. 1ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 


“Beginning with 41-B, shuttle flights were no longer numbered consecutively. In the new system, the first digit indicates the last digit of the current fiscal year (begins Oct. 1 of previous 
calendar year), that is, 4 = 1984. The second digit indicates the launch site: 1 = Kennedy Space Center; 2 = Vandenberg Air Force Base. The letter indicates which serial flight in the fiscal 
year; for example, B = second flight. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Nov. 8-16, 1984 STS 51-A 250,000 Ib 7 days 23h Second flight of Discovery. Two reimbursable comsats (Telesat-H, 
Frederick Hauck 126 rev 45 min Leasat/Syncom IV-1) successfully deployed. Two others, Palapa 
Dave Walker 220 mi B-1 and Westar 6 from mission 41-B, successfully retrieved and 
Joseph Allen returned to Earth for repair. Fifth U.S. woman in space (Fisher). 
Anna Fisher Third landing at Kennedy Space Center. 
Dale Gardner 
Jan. 24-27, 1985 STS 51-C = 3 days 1h Third flight of Discovery. First dedicated Department of Defense 
Thomas Mattingly 47 rev 34 min mission, classified secret. Fourth landing at Kennedy Space Center. 
Loren Shriver — 
Gary Payton 
James Buchli 
Ellison Onizuka 
Apr. 12-19, 1985 STS 51-D 250,000 Ib 6 days 23 h Fourth flight of Discovery. Payload of two reimbursable comsats 
Karol Bobko 109 rev 55 min (Telesat-I/Anik, Syncom IV-3) successfully deployed, but one 
Donald Williams 283 mi (Syncom) failed to ignite its upper stage despite heroic attempts by 
Margaret Rhea Seddon crew with remote manipulator system. First flight of a U.S. politician 
Jeffrey Hoffman (Senator Garn, Utah). Second flight of industry representative 
David Griggs Walker. Sixth U.S. woman in space (Seddon). Landing at Kennedy 
Charles Walker Space Center resulted in damaged tires. 
E.J. “Jake” Garn 
Apr. 29-May 6, 1985 STS 51-B/Spacelab 3 250,000 Ib 7 days 8 min Second flight of Spacelab pressurized module. Crew conducted 
Robert Overmyer 110 rev over a dozen major experiments and tested research-animal 
Frederick Gregory 220 mi holding facility with rats and two squirrel monkeys. Mission ac- 
Don Lind complished with great success. Landing of Challenger at Edwards 
Norman Thagard AFB. 
William Thornton 
Lodewijk van den Berg 
Taylor Wang 
June 6-Sept. 26, Soyuz T13 14,400 Ib 112 days/ Crew transferred to Salyut 7 space station on June 8, after testing 
1985 Vladimir Dzhanibekov — Dzhanibekov; new flight control systems in Soyuz spacecraft. Performed exten- 
Viktor Savinykh 200 mi 168 days 3 h sive repair/maintenance of power system on station. Resupplied 
51 min/Savinykh by Progress 24 (launched June 21), Cosmos 1669, a new type of 
free-flying, crewless platform (docked July 21, separated Aug. 29). 
On Sept. 26, Dzhanibekov returned with “visiting” Georgi Grechko 
in Soyuz T13, leaving Savinykh with Vasyutin and Volkov in Salyut 
until Nov. 21. 
June 17-24, 1985 STS 51-G 250,000 Ib 7 days 1h Fifth flight of Discovery, with cargo of three reimbursable comsats 
Daniel Brandenstein 111 rev 40 min (Morelos-A/Mexico; Arabsat 1B/Arabia; Telstar 3D/AT&T). Crew 
John Creighton 240 mi also deployed and later retrieved free-flying instrument platform 
Shannon Lucid SPARTAN 1 (Shuttle Pointed Autonomous Research Tool for As- 
Steven Nagel tronomy). Payload included six scientiflc-technological experi- 
John Fabian ments from U.S., France, and Germany. Two foreign astronauts: 
Patrick Baudrey Prince Al-Saud (Saudi Arabia), Baudry (France). Seventh U.S. 
Sultan Salman Al-Saud woman (Lucid). 
July 29-Aug. 6, 1985 STS 51-F/Spacelab 2 252,800 Ib 7 days 22 h Eighth flight of Challenger, first flight of Spacelab pallets-only 
Charles G. Fullerton 126 rev 45 min version. Lift-off after 1 h 37 min delay due to computer problem. 
Roy D. Bridges, Jr. 197 mi One of three orbiter engines shut down prematurely (after 5 min 
Karl G. Henize 45 s) but not early enough to cause mission abort. Flight extended 
Anthony W. England 1 day tohelp recover from on-board experiment problems. Mission 
F. Story Musgrave accomplished. Landing at Edwards AFB. 
Loren W. Acton 
Aug. 27-Sept. 3, 1985 STS 51-/ 337,000 Ib 7 days 2h Sixth flight of Discovery. Liftoff after two launch delays, one 
Joe H. Engle 111 rev 17 min (Aug. 24) caused by weather, the other (Aug. 25) by backup 
Richard O. Covey 280 mi computer problem. Payload of three reimbursable comsats [ASC 
James D. A. van Hoften 1, AUSSAT 7 (Australia), Leasat (Syncom) IV-4] successfully 
John M. Lounge deployed, the first two on the same day—a first. Crew performed 
William F. Fisher rendezvous with Leasat (Syncom) IV-3, deployed Apr. 13 by STS 
51-D and drifting lifeless, and successfully conducted salvage and 
repair operation in space on Aug. 31 and Sept. 1, redeploying the 
comsat for normal operation. Landing at Edwards AFB. 
Sept. 17-Nov. 21, Soyuz T14 14,400 Ib 9 days/Grechko; Crew transferred to Salyut 7/Soyuz T13 on Sept. 18. Veteran 
1985 Vladimir Vasyutin — 64 days 21 h cosmonaut Grechko returned with Dzhanibekov on Sept. 26 in 
Georgy Grechko 200 mi 52 min/Vasyutin Soyuz T13, making this the first successful space station crew 
Alexander Valkov and Volkov; rotation. Vasyutin, Volkov, and Savinykh returned suddenly on 
168 days 3 h Nov. 21 with Soyuz T14 because of illness of Vasyutin, requiring 
51 min/Savinykh hospitalization. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1yd = 0.9m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Oct. 3-7, 1985 STS 51-J = 4 days 1h First flight of fourth shuttle orbiter Atlantis, setting new altitude 
Karol Bobko 63 rev 45 min record (after 41-C). Payload: military (classifled). Landing at 
Ronald J. Grabe 320 mi Edwards AFB. 
Robert Stewart 
William A. Pailes 
David C. Hilmers 
Oct. 30-Nov. 6, 1985 STS 61-A/Spacelab D1 250,000 Ib 7 days 32 min Ninth flight of Challenger with the West German D-1 mission, the 
Henry W. Hartsfield (Spacelab: flrst foreign-chartered Spacelab flight. Crew included three foreign 
Steven R. Nagel 26,000) astronauts: Furrer, Messerschmid (both W. Germany), and Ockels 
Guion Bluford 111 rev (Netherlands). Eighth U.S. woman (Dunbar). First eight-member 
Bonnie J. Dunbar 202 mi crew. Mission concluded on Nov. 6 with great success. Landing 
James F. Buchli at Edwards AFB with test of new nose wheel steering. 
Reinhard Furrer 
Ernst Messerschmid 
Wubbo Ockels 
Nov. 26-Dec. 3, 1985 STS 61-B 261,455 Ib 6 days 22 h Second flight of Atlantis, carrying three commercial comsats: 
Brewster H. Shaw 109 rev 54 min Morelos-B (Mexico), Aussat 2 (Australia), and Satcom KU-2. Ross 
Bryan D. O’Connor 235 mi and Spring conducted major outboard activities demonstrating 
Sherwood C. Spring space-station-type assembly and handling of large structures in 
Mary L. Cleave space (payload designations: EASE/ACCESS). One foreign pay- 
Jerry L. Ross load specialist: Neri (Mexico). Third flight of industry representative 
Rudolfo Neri Vela Walker. Ninth U.S. woman (Cleave). First flight of PAM-D2 upper 
Charles D. Walker stage. Heaviest PAM payload (Satcom). Landing at Edwards AFB 
after full mission accomplishment. 
Jan. 12-18, 1986 STS 61-C 250,000 Ib 6 days 2h 4 min First flight of Columbia after major modifications since its last 
Robert L. Gibson 96 rev 9s flight (STS 9) in Nov. 1983. Originally scheduled for Dec. 18, 
Charles F. Bolden, Jr. 201 mi 1985, launch was delayed seven times due to systems problems 
George D. Nelson and inclement weather. Crew included first Hispanic American in 
Steven A. Hawley space (Chang-Diaz, Costa Rica), a payload specialist from industry 
Franklin E. Chang-Diaz (Cenker, RCA), and the second U.S. congressman in space (Florida 
Robert J. Cenker U.S. Rep. Bill Nelson). Crew deployed advanced Satcom K-1 
C. William Nelson communications satellite, took photographs of Comet Halley, and 
performed numerous on-board experiments. Landing at Edwards 
AFB after two postponements due to bad weather at Kennedy 
Space Center in Florida. Second night landing. 
Jan. 28, 1986 STS 57-L 250,000 Ib 1min 13s Challenger was launched at 11:38 a.m. EST after a night of below- 
Francis R. Scobee — freezing temperatures. Its right solid-rocket booster developed a 
Michael J. Smith — catastrophic leak due to malfunction of a joint seal between two 
Judith A. Resnik motor elements. The shuttle was destroyed 73 s after liftoff, killing 
Ellison S. Onizuka its crew of seven and bringing the United States space program to 
Ronald E. McNair a halt for 32 months. 
Gregory B. Jarvis 
Christa McAuliffe 
Feb. 19, 1986 Mir 235,000 Ib — Crewless launch of new Soviet space station Mir (Peace), similar 
Crewless 210 rev in size and design to Salyut 7, with estimated length of 57 ft. 
— Represents a new-generation station, with six docking ports for 
future buildup with add-on modules and docking of Soyuz and 
Progress logistics fiights. 
March 13-July 16, Soyuz T15 14,400 Ib 125 days Crew docked at Mir on Mar. 15 about 49 h after liftoff and 
1986 Leonid Kizim 2000 rev (est.) transferred to the space station. First resupply flight followed on 
Vladimir Solovyev 210 mi Mar. 19 with launch of crewless Progress 25 which docked to Mir 
on Mar. 21. Progress 26 was launched on Apr. 23, and the tanker- 
transport docked with Mir on Apr. 26. On May 5, Kizim and Solovyev 
transferred from Mir to Salyut 7 using Soyuz T15. Their stay in the 
Salyut 7/Soyuz T15/Cosmos 1686 complex, trailing Mir in orbit by 
15 min, included 3 h 50 min spacewalk on May 28 to demonstrate 
the ability to work with a 15-m-long triangular lattice. A crewless 
Soyuz TM, launched on May 21, docked with Mir on May 23 and 
returned to Earth on May 30—a test of a modifled version of the 
Soyuz with improved computers, rendezvous-docking systems, 
and landing parachute. A second EVA on May 31, 5 h long, also 
demonstrated structural deployment. Progress 26 undocked on 
June 22 and deorbited on June 23. On June 26, the crew returned 
to Mir with Soyuz T75. On July 16, the crew returned in Soyuz T15 
to Earth. Kizim had set a new record of 373 days of cumulative 
timein space by a human. Landing was 34 mi northeast of Arkalyk 
in Kazakhstan. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1ft = 0.3 m. 1yd = 0.9m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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B. Revolutions 

C. Max. distance 
from Earth 


Dates launched 
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Designation 
and crew 


Duration 


Remarks 





Feb. 6—-July 30, 
1987 


Soyuz TM-2 
Yuri Romanenko 
Alexander Laveikin 


14,400 Ib 
Approx. 145 rev 
230 mi 


March 31, 1987 Kvant 


Crewless 


43,000 Ib 


250 mi 


July 21-Dec. 29,1987 Soyuz TM-3 
Alexander Viktorenko 
Alexander P. Alexandrov 


Mohammed Faris 


14,400 Ib 
Approx. 2576 rev 
230 mi 


Dec. 21, 1987-June 17, Soyuz TM-4 
1988 Vladimir Titov 
Musa Manarov 
Anatoliy Levchenko 


14,400 Ib 


230 mi 


June 7-Sept. 7, 1988 Soyuz TM-5 
Anatoly Solovyov 
Viktor Savinykh 


Alexander Alexandrov 


14,400 Ib 
162 rev 
225 mi 


14,400 Ib 
138 rev 
221 mi 


Aug. 29-Dec. 21,1988 Soyuz TM-6 
Vladimir Lyakhov 
Valery Polyakov 
Abdullah Ahad 
Mohmand 


174 days 1h 
22 min/ 
Laveikin; 
8 days 20h 
1 min/Faris 
and Viktorenko 


161 days/ 
Alexandrov; 
326 days/ 
Romanenko; 
8 days/ 
Levchenko 


366 days 22h 
39 min/Titov 
and Manarov; 
7 days 21h 
2 min/ 
Levchenko 


10 days 8h 
5 min (crew) 


8 days 20h 
27 min/Lyakov 
and Mohmand; 
241 days/ 
Polyakov 


First fiight to Mir in 1987 was the crewless Progress 27 
tanker/transport, launched on Jan. 16, and docked to Mir on 
Jan. 18, to provision the station for the subsequent crew visit. 
After liftoff in a new, more advanced Soyuz version, crew docked 
at Mir on Feb. 8 for a long-duration attempt. After an initial zero-g 
adaptation problem experienced by Laveikin, crew settled down to 
conduct materials processing work. On Mar. 5, Progress 28, anew 
tanker/transport, docked with Mirs rear port and was unloaded. 
On Mar. 9, its engines boosted Mir to increase its orbit altitude. 


An astrophysics module, Kvant, launched on Mar. 31, failed to dock 
to the Mir core on Apr. 5. After a second fruitless attempt on Apr. 
9, the crew corrected the problem—a piece of cloth left by the 
ground crew—ina 3 h 40 min EVA on Apr. 12, and docking was 
accomplished. Kvant's propulsion module was jettisoned Apr. 13, 
making the module’s rear docking port accessible. Resupply ship 
Progress 29, launched Apr. 21, docked to this port on Apr. 23. After 
unloading and taking on stored waste, it burned up in reentry on 
May 11. On May 13, Progress 30 was launched with fresh supplies 
and mail. In a second and third EVA, on June 12 and 16, the crew 
installed a third solar power array on the station. 


After launch from Baikonur, the crew, which included the first 
Syrian cosmonaut (Faris), docked with Mir on July 24. On July 29, 
Viktorenko and Faris, accompanied by Laveikin who had shown an 
abnormal electrocardiogram reading, undocked Soyuz TM-2 and 
landed in Kazakhstan early on July 30. Romanenko and Alexandrov 
continued the mission on Mir. Resupply: Progress 31, docked 
Aug. 7, separated Sept. 22; Progress 32, launched Sept. 22, 
docked Sept. 26. On Sept. 30, Romanenko broke the current 
record of 237 days. After separation of Progress 32, resupply ship 
Progress 33 docked on Nov. 23. When Yuri Romanenko returned 
Dec. 29, 1987, in TM-3 with Alexandrov and Anatoliy Levchenko 
(of TM-4), he had set a new record for the longest crewed 
space flight to date-326 days. 


Docked on Dec. 23, after 33 revolutions, to Mir. When, on Dec. 29, 
Levchenko (a shuttle test pilot), Alexandrov, and Romanenko 
returned to Earth in TM-3, landing at 12:16 pm. about 50 mi from 
Arkalyk in Kazakhstan. Romanenko had set a new world record of 
326 days in space. Titov and Manarov went on for a long-duration 
stay in Mir, slightly over 1 year. Resupply ship Progress 34 was 
launched Jan. 21, 1988, and docked early on Jan. 23, Progress 35 
followed on Mar. 24 (undocked May 5), and Progress 36 on May 15. 


Docked. to Mir on June 9. The 10-day mission was the 63d crewed 
flight since Gagarin. Alexandrov was the second Bulgarian (after 
Georgi Ivanov) and the 13th foreign national flown by the Soviets 
since 1978. Countdown and launch were widely televised. Visitor 
crew returned to Earth on June 17 in TM-4, leaving the “fresh” 
TM-5 docked to Mir. On June 18, Titov and Manarov transferred 
the TM-5 from the aft to the forward docking port. During a 5-h 
EVA on June 30, Titov and Manarov did not succeed in replacing a 
faulty detector unit on a British/Dutch x-ray telescope outside Mir, 
because of a broken wrench. Progress 37, the next resupply ship, 
was launched July 19. 


TM-6 was launched for a trip to Mir, with a crew including a 
physician, Polyakov, who was to stay aboard Mir to observe Titov 
and Manarov, and an Afghan cosmonaut, Mohmand. Docking 
occurred on Aug. 31. Since Lyakhov had received special training 
as a rescuer-cosmonaut, he was able to handle TIV-6 alone. After 
undocking from Mir in Soyuz TM-5 of the previous mission, Lyakhov 
and Mohmand prepared for reentry at 3 a.m. but became stranded 
in orbit due to malfunction of a guidance system sensor, followed 
by computer input errors. With oxygen supply threatening to run 
out after 48 h, the twice-failed retroburn was achieved 24 h later 
after preventive measures. TM-5 landed on Sept. 7, 100 mi from 
Dzhezkazgan in Kazakhstan. On Sept. 12, the crewless resupply 
ship Progress 38 docked with Mir after Soyuz TM-6 had been 
moved, on Sept. 8, from the Kvant astrophysics module to the 
airlock of the station. Progress 39 was launched Sept. 9, carrying 
French equipment for an upcoming visit by a French cosmonaut. 





1Ib=045kg. 1mi=1.6km. 1ft=03m. 1yd=0.9m. 1 fathom = 1.8m. 





1 Ib/in.?2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Aug. 29-Dec. 21, In a 4-h EVA on Oct. 20, Titov and Manarov successfully repaired 
1988 (cont.) a telescope in the Kvant module, finishing an earlier attempt of 
June 30. 
Sept. 29-Oct. 3, 1988 STS 267 253,700 Ib 4 days 1h The United States’ return to space. After a 32-month hiatus, NASA 
Frederick H. Hauck 645 rev 57 min astronauts were back in space on the orbiter Discovery, after a 
Richard O. Covey 188 mi flawless launch at 11:37 a.m. EDT. At 6 h 13 min into the mission 
John M. (Mike) Lounge Nelson deployed the prime payload, NASA's second Tracking and 
David C. Hilmers Data Relay Satellite (TDRS) for transfer to geostationary orbit. 
George D. Nelson Secondary payloads were 11 experiments, including 2 by students. 
It was the seventh flight of Discovery, with over 200 engineering 
changes since the Challenger loss. Touchdown occurred at 
12:37 pm. EDT at Edwards, with a rollout distance of 7451 ft. 
Nov. 15, 1988 Buran 224,000 Ib 3h 25 min First flight of Soviet reusable space shuttle Buran (Blizzard). 
Crewless 2 rev Launched by second flight of Energia launch vehicle at 6 a.m. local 
156 mi time, landed two orbits later on runway at Baikonur, 7.5 mi from 
launch pad. 
Nov. 26, 1988-Apr. 27, Soyuz TM-7 14,400 Ib 25 days/ Second flight of French cosmonaut Chrétien to a Soviet space 
1989 Alexandrovitch Volkov — Chrétien; station. Mission, named “Aragatz,” docked on Mir on Nov. 28. 
Sergei Krikalev 230 mi 152 days/ Chrétien became first French citizen to perform an EVA when he 
Jean-Loup Chrétien Volkov and installed the deployable antenna structure ERA in space on 
Krikalev Dec. 9. On Dec. 21, Chrétien returned in Soyuz TM-6 with the long- 
duration crew, Titov and Manarov, who on this day completed 366 
days on Mir. Volkov and Krikalev stayed in space with Polyakov until 
Apr. 27, 1989, when they returned in TM-7, leaving Mir crewless 
for the first time in 2 years. During its occupied period, the space 
station had been resupplied by a total of 17 Progress flights with 
about 40 tons of goods. 
Dec. 2-6, 1988 STS 27 = 4 days 9 h Launched as second flight after the Challenger accident, Atlantis 
Robert L. Gibson 70(?) rev 6 min 19 s carried a radar imaging reconnaissance satellite on a classified 
Guy S. Gardner — Department of Defense mission into a high-inclination orbit. 
Mike Mullane Returned to Edwards after highly successful mission. 
Jerry L. Ross 
William M. Shepherd 
March 13-18, 1989 STS 29 263,290 Ib 4 days 23 h Discovery, on this third flight after the Challenger disaster, de- 
Michael L. Coats 79 rev 39 min ployed its main payload, the third Tracking and Data Relay Satellite, 
John E. Blaha 206 mi TDRS-D, completing the constellation of NASA’s new communi- 
James F. Buchli cations system. After conducting a number of physical, biological, 
Robert C. Springer medical, and materials experiment, Discovery returned to Edwards 
James P. Bagian AFB. 
May 4-8, 1989 STS 30 217,513 Ib 4 days 56 min Originally scheduled for Apr. 28, the launch of Atlantis was aborted 
David M. Walker 64 rev 38 s at T-31 due to failure of the hydrogen recirculation pump of a 
Ronald J. Grabe 185 mi main engine and rescheduled for May 4. On that day, Atlantis 
Norman E. Thagard lifted off and, 6% h later, placed the Venus radar mapper 
Mary L. Cleave probe Magellan on its trajectory to the planet Venus, the first 
Mark C. Lee interplanetary probe launched by the United States in nearly 
11 years. Atlantis landed at Edwards after a highly successful 
mission. 
Aug. 8-13, 1989 STS 28 = 5days1h56s Fifth flight after the Challenger accident. The extensively refur- 
Brewster H. Shaw, Jr. 80 rev bished Columbia, on its first trip after over 3 years of inactivity, 
Richard N. Richards 192 mi carried a classified Department of Defense payload into a 57° in- 
James C. Adamson clination orbit. Landing at Edwards took place after a successful 
Mark N. Brown mission. 
David C. Leestma 
Sept. 6, 1989-Feb. 19, Soyuz TM-8 14,400 Ib 166 days 7h After a pause of almost 5 months in its crewed program, the Soviet 
1990 Alexander Viktorenko Approx. 2662 rev 58 min Union launched TM-8 to Mir, the 67th crewed flight of the Soviet 
Alexander Serebrov 230 mi Union, carrying Mir’s 20th and 21st occupants. Docking on Mir 


occurred on Sept. 8, executed manually after a last-minute techni- 
cal problem with the automatic rendezvous/docking system of the 
TM-8. The arrival of the crew had been preceded by the docking 
of Progress M1, the first of a new series of crewless resupply craft, 
able to carry 5952 Ib of cargo (882 Ib more than the earlier version), 
and equipped with a recoverable entry capsule (not used on this 
flight). 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


1 yd = 0.9 m. 


1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 


t After the Challenger accident, shuttle mission designations returned to consecutive numbering. Unavoidable changes in flight assignments, however, resulted in numbered flights not being 


flown in sequence. 
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Remarks 


Launch of Atlantis with the Galileo Jupiter probe as primary payload 
occurred after two launch postponements, one from Oct. 12 due 
to a main engine controller problem, the other from Oct. 17 
due to inclement weather. At 7:15 pm., the 38,500-lb payload 
was successfully deployed; at 8:15 pm. the Inertial Upper Stage 
(IUS) first stage was fired, followed by second-stage ignition at 
8:20. The 2-ton Galileo separated from the IUS at 9:05 with 
deployed booms, was spun up to 2.8 revolutions per min at 
9:30 Pm., and began its 6-year voyage to Jupiter, which includes 
one Venus and two Earth flybys for gravity assist. After conducting 
in-space experiments on material processing, plant life, ozone 
monitoring, ice crystals, and human physiology, Atlantis returned 
to Earth two orbits early due to adverse wind at the landing 
site. 


On the seventh shuttle flight after the Challenger accident, the 
Discovery lifted off in darkness, carrying a classified military 
payload into a low-inclination orbit. After a successful mission, 
the vehicle landed at Edwards. It was the third night launch of the 
shuttle program. 


Kvant 2, the second of four large building blocks of the Soviet 
Mir complex, was launched on a Proton. It docked to Mir on Dec. 
6, after a lengthy delay caused by a multistep trouble-shooting 
program to deploy a stuck solar array and subsequent rendezvous 
problems. After the successful docking. Kvant 2, which carried 
systems and equipment to expand Mir’s operational capability and 
a crewed maneuvering unit for extravehicular activity, generated 
3.5 kW of power from each of its two solar arrays. 


Longest shuttle mission to date. Originally scheduled for Dec. 18, 
1989, but delayed due to unfinished launch pad work and one 
weather-caused slip (on Jan. 8). Launched on its mission to deploy 
the Navy communications satellite Syncom IV-F5 (on Jan. 10) and 
retrieve the Long Duration Exposure Facility (DEF), a 12-sided, 
open-grid aluminum satellite, 30 ft long, 14 ft in diameter, and 10.5 
tons in mass, carrying 57 experiments. LDEF was originally orbited 
by Challenger (STS 41C) on Apr. 6, 1984, but its planned retrieval 
1 year later was delayed by schedule slips and the Challenger 
accident. After Columbia's rendezvous with LDEF on Jan. 12, 
Mission Specialist Dunbar used the Remote Manipulator System 
arm to grapple the research facility, 2093 days and 32,419 orbits 
after LDEF’s launch. Columbia landed at Edwads, delayed 2 
days by ground fog. Heaviest landing weight (by 5 tons) to date. 


Docked with Mir on Feb. 13 to relieve Viktorenko and Serebrov. 
Earlier in February, they had conducted several spacewalks 
(EVA) to test a crewed maneuvering unit named /kar (Icarus): 
Serebrov on four flights on Feb. 1 during a 5-h EVA, Viktorenko 
several times on a 3-h 45-min EVA on Feb. 5. The cosmonauts also 
tested a modified space suit and portable life-support system. The 
Soviet crewed maneuvering unit weighed 440 Ib and was propelled 
by 36 compressed-air thrusters. Solovyev and Balantin became 
the 22d and 23d occupants of Mir. Viktorenko and Serebrov 
returned to Earth on Feb. 19, landing in Soyuz TM-8 near Arkalyk, 
205 mi northeast of Baikonur, in good health after 166 days 
in space, including five spacewalks and dozens of experiment 
programs. 


Launched as ninth flight since Challenger loss, after five delays due 
to head cold of the Commander, bad weather, and a ground com- 
puter glitch. Atlantis deployed a secret intelligence-gathering sat- 
ellite on a Department of Defense mission in a highly inclined orbit 
(62°). Returned to Edwards AFB after completing a fully successful 
mission. 


The long-awaited launch of the Hubble Space Telescope, 
20 years in planning and development, took place after an ear- 
lier launch attempt, on Apr. 10, was scrubbed due to malfunc- 
tion of a valve in one of Discovery’s auxiliary power units. The 








TABLE 1. Crewed space flights 1961-2006 (cont.) 
A. Weight 
B. Revolutions 
Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration 
Oct. 18-23, 1989 STS 34 264,775 Ib 4 days 23h 
Donald E. Williams 79 rev 40 min 
Michael J. McCulley 206 mi 
Shannon W. Lucid 
Franklin R. Chang-Diaz 
Ellen S. Baker 
Nov. 22-27, 1989 SIS 33 = 5 days 7 min 
Frederick D. Gregory 78(?) rev 
John E. Blaha — 
Manley L. (Sonny) Carter 
Kathryn C. Thornton 
F. Story Musgrave 
Nov. 26, 1989 Kvant 2 43,000 Ib — 
Crewless — 
250 mi 
Jan. 9-20, 1990 STS 32 229,848 Ib 10 days 21h 
Daniel C. Brandenstein 172 rev 52s 
James D. Wetherbee 224 mi 
Bonnie J. Dunbar 
Marsha S. Ivins 
G. David Low 
Feb. 11-Aug. 9, Soyuz TM-9 14,400 Ib 179 days 
1990 Anatoliy Solovyev 230 rev 
Alexander Balandin =; 
Feb. 28-Mar. 4, STS 36 — 4 days 10 h 
1990 John O. Creighton 71 rev 19 min 
John H. Casper — 
Michael Mullane 
David C. Hilmers 
Pierre J. Thuot 
Apr. 24-29, 1990 STS 31 259,229 Ib 5 days 1h 
Loren J. Shriver 76 rev 15 min 
Charles F. Bolden 381 mi 
Steven A. Hawley 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 f =0.3m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





Dates launched 
and recovered 


Designation 
and crew 


A. Weight 

B. Revolutions 

C. Max. distance 
from Earth 


Duration 


Remarks 





Apr. 24-29, 
1990 (cont.) 


May 31, 1990 


Aug. 1—Dec. 10, 
1990 


Oct. 6-10, 1990 


Nov. 15-20, 1990 


Dec. 2-11, 1990 


Bruce McCandless 
Kathryn D. Sullivan 


Kristall 
Crewless 


Soyuz TM-10 
Gennaidy Manakov 
Gennaidy Strekalov 


STS 41 
Richard N. Richards 
Robert D. Cabana 
Bruce E. Meinick 
William M. Shepherd 
Thomas D. Akers 


STS 38 
Richard O. Covey 
Frank L. Culbertson 
Carl J. Meade 
Robert C. Springer 
Charles D. (“Sam”) 
Gemar 


STS 35 
Vance D. Brand 
Guy S. Gardner 
John M. Lounge 
Robert A. R. Parker 
Ronald A. Parise 
Jeffrey A. Hoffman 
Samuel T. Durrance 


43,000 Ib 
250 rev 


14,400 Ib 
Approx. 2096 rev 
235 mi 


256,330 Ib 
65 rev 
206 mi 


79 rev 
130 mi 


267,513 Ib 
143 rev 
220 mi 


131 days 


4 days 2h 
10 min 


4 days 21 h 
54 min 32 s 


8 days 23 h 
6min5s 


12.5-ton spacecraft, 42.5 x 14 ft, was placed into space by 
the manipulator arm on Apr. 25 at 8:45 a.m. EDT and released, 
after deployment of its solar arrays and a checkout, at 3:38 p.m. 
For the remainder of the mission, the crew carried out medical 
and engineering tests in the shuttle middeck. Discovery landed at 
Edwards AFB. 


As third major addition to Mir, the technology module Kristall 
docked to the Soviet space station on June 10. Nearly identical 
to the previous module, Kvant 2, with length of 45 ft, width 
of 14.3 ft, and volume of 2000 ft°, Kristal! forms the second 
of four spokes of the Mir complex. Equipped with furnaces for 
materials experiments in space, it also has a docking port for the 
Buran space shuttle. Docking took two attempts, using the backup 
system to a failed maneuvering jet. On June 11, Kristall was 
moved by the Ljappa pivot arm from forward docking port to 
a lateral port opposite the Kvant 2 module. With core, Kvant 1, 
Kvant 2, Kristall, and Soyuz-TM, Mir now weighed 183,000 Ib. 
Kristall also carried equipment to repair loose insulation panels on 
Soyuz TM-9. Repair took place on July 17, but Solovyev and 
Balandin encountered difficulties in closing the Kvant 2 outer 
airlock door, damaged on exit, and exceeded the life-support 
capability of their spacesuits. Crew used emergency entry method 
(depressurizing Kvant 2), conducted 3-h EVA to repair hatch on 
July 26, and returned to Earth on Aug. 9, one week after arrival of 
TM-10, landing at Arkalyk in Kazakhstan with 286 Ib of material. 


Launched to replace TM-9, it docked with Mir on Aug. 3 to 
continue work of previous crew (which had completed 506 of 520 
planned science experiments), concentrating mostly on materials 
manufacturing with the Kristall module and switching solar panels 
from Kristall to Kvant 1. Crewless Progress M-4 was launched Aug. 
15 and docked Aug. 17, to bring supplies and boost Mir again up to 
248 mi altitude. Progress M-4 was used to check out a new feature, 
a returnable capsule for returning material to Earth from Mir, prior 
to its use on Progress M-5. Manakov and Strekalov returned on 
Dec. 10. 


Replacing the postponed STS 35 (Columbia), Discovery was 
launched with the European-built out-of-the-ecliptic solar probe 
Ulysses plus additional scientific experiments on board. Ulysses 
was deployed during the fifth orbit and launched by its own 
IUS/PAM rocket system. The fastest artificial object, it started its 
5-year mission via a gravity-assist slingshot maneuver around 
Jupiter in Feb. 1992. It passed out of the Sun in summer 1994 and 
spring 1995, respectively. 


On this fifth night launch, Atlantis, with a classified DOD payload, 
lifted off at 6:48 p.m. EST on it seventh flight. The planned landing 
at Edwards on Nov. 19 was canceled due to high winds and look 
place 1 day later at Kennedy at about 4:43 p.m. EST. It was the 
sixth Kennedy landing, 51/5 years after the last one (Discovery, on 
Apr. 5, 1985). 


The first shuttle mission in 7 years dedicated to a single science 
discipline: astrophysics. After 6 months delay due to a faulty 
cooling system valve, hydrogen leaks, and payload electronics 
repair, Columbia’s tenth flight took off with ASTRO 1 (two Spacelab 
pallets with three ultraviolet telescopes on a pointing system, and 
one x-ray telescope) to explore the hottest, albeit invisible parts of 
the universe from the Orbiter cargo bay. The flight featured more 
than 80 h of observations (about 394) of about 135 science targets 
in the universe, starting 23 h after orbit insertion and ending 4 h 
before deorbit burn. Also, an experiment of communicating with 
amateur radio stations within line-of-sight of Columbia in voice and 
data mode (SAREX) was conducted by Parise. During orbit 39 on 
Dec. 4, Columbia and Mir were only 31 mi apart, and Mir was 
sighted twice from the shuttle. STS 35 landed at Edwards one day 
earlier than planned because of bad weather that was forecast for 
later, after a mission troubled by computer, pointing, and plumbing 
problems. 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


1yd=0.9m. 


1 fathom = 1.8m. 


1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 





June 5-14, 1991 


Aug. 2-11, 1991 


Sept. 12-18, 1991 


STS 40 


Bryan D. O’Connor 
Sidney M. Gutierrez 
Tamara E. Jernigan 
M. Rhea Seddon 
James P. Bagian 

F. Drew Gaffney 
Millie Huges-Fulford 


STS 43 


John E. Blaha 
Michael A. Baker 
Shannon W. Lucid 
G. David Low 
James C. Adamson 


STS 48 


John O. Creighton 
Kenneth S. Reightler 
Mark N. Brown 
James F. Buchli 
Charles D. Gemar 


241,175 Ib 
145 rev 
185 mi 


247,965 Ib 
140 rev 
207 mi 


224,141 Ib 
81 rev 
356 mi 


and Manarov 


9 days 2 h 
15 min 


8 days 21 h 
21 min 


5 days 8 h 
27 min 


Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Dec. 2-May 26, Soyuz TM-11 14,400 Ib 8 days/Akiyama; Lifting off with the first Japanese in space on board, a journalist, 
1990 Viktor Afanasyev Approx. 2740 rev 131 days/ Soyuz TM-11 docked at Mir 2 days later, making up a record num- 
Musa Manarov 230 mi Manakov ber of 12 humans in space at the same time (5 in Mir, 7 in STS 35). 
Toyohiro Akiyama and Strekalov Cosmonauts Manakov, Strekalov, and Akiyama returned on Dec. 
10, in TM-10 with Afanasyev and Manarov taking over on Mir, 
Resupply ship Progress M7, launched Mar. 19, failed twice to 
dock to Mir’s Kvant module. On the second attempt on Mar. 23, 
ground controllers barely avoided a collision between Mir and 
M7. On March 26, the crew relocated Soyuz TM-11 from the main 
port to the Kvant 1 module and docked M7 successfully to that 
port on Mar. 28. 
Apr. 5-11, 1991 STS 37 240,809 Ib 5 days 23 h Atlantis lifted off with the 35,000-lb Gamma Ray Observatory 
Steven Nagel 92 rev 32 min (GRO), the heaviest NASA science satellite ever carried by a 
Kenneth D. Cameron 281 mi shuttle. Equipped with four instruments to observe celestial high- 
Jerry L. Ross energy sources with 10 times the sensitivity of prior experiments, 
Jerome Apt GRO was set free on April 7 after an unplanned emergency EVA 
Linda M. Godwin by Ross and Apt to deploy GRO’s stuck high-gain antenna—the 
first United States spacewalk since Dec. 1985. On Apr. 8, Ross 
and Apt performed a scheduled 6-h EVA to test various crew 
equipment translation aids such as carts on a track, for use on 
the exterior of the space station. 
Apr. 28-May 6, STS 39 236,623 Ib 8 days 7h Discovery made the first unclassified DOD-dedicated shut- 
1991 Michael L. Coats 133 rev 22 min tle mission. The mission was highlighted by around-the-clock 
L. Blaine Hammond, Jr. 162 mi observations of the atmosphere, gas release from subsatellites, 
Gregory J. Harbaugh shuttle engine firings, and the shuttle’s orbital environment in 
Donald R. McMonagle wavelengths ranging from infared to the far-ultraviolet, including 
Guion S. Bluford the aurora australis. 
C. Lacy Veach 
Richard J. Hieb 
May 18-Oct. 10, Soyuz TM-12 14,400 Ib 7 days 19h With the first British space traveler, food technologist Helen 
1991 Anatoly Artsebarsky Approx. 2265 rev 5 min/ Sharman, as part of Britain’s commercial project Juno, Soyuz 
Sergei Krikalev 230 mi Sharman; docked at Mir on May 20, the ninth crew to visit the station. 
Helen Sharman 175 days 2h Earlier, on Apr. 25, the Mir crew, during a 5-h EVA, inspected 
48 min/ the automatic docking system antenna that had nearly caused 
Afanasyev the collision with the Progress M7 automated transport on Mar. 


23. M7 was discarded to burn up in the atmosphere. Sharman, 
Afanasyov, and Manarov returned on May 26 in TM-117, landing 
in Kazakhstan. In the first 3 months of their stay, Artsebarsky and 
Krikalev made six spacewalks totalling a record 32 h, for such 
tasks as repairing a docking antenna and a television camera, 
deploying instruments, and assembling a 46-ft-tall towerlike 
structure called Sofora, to carry reaction jets, on Mir. On Aug. 
16, Progress M8 was separated from Mir, and on Aug. 20 new 
supplies were brought up on M9 in a night launch. Soyuz TM-12 
returned on Oct. 10 with Artsebarsky, Aubakirov, and Viehboeck. 


Columbia carried Spacelab Life Sciences 1 (SLS 1), with three 
medical doctors, one biochemistry researcher, and one physi- 
cist on board, to perform 18 major experiments on the cardio- 
vascular, renal/endocrine, blood, immune, musculoskeletal, and 
neurovestibular systems. Test objects included white rats and 
jellyfish. 


The Atlantis flight put NASA’s fourth Tracking and Data 
Relay Satellite (TDRS-E) into space, boosted successfully to 
a geosynchronous orbit by an Inertial Upper Stage (IUS). 
Other payloads included an atmospheric ultraviolet calibration 
instrument to measure the Earth’s ozone layer, a heat pipe 
experiment, and an optical communications experiment. Atlantis 
landed at Kennedy. 


Discovery carried the giant Upper Atmosphere Research Satellite 
(UARS) into a57*-inclination orbit, the first major flight element of 
NASA's Mission to Planet Earth program. The 14,400-Ib satellite 
carried nine complementary scientific instruments to provide a 
more complete understanding of the chemistry, dynamics, and 
energy input of the upper atmosphere, including the ozone layer, 
plus a radiometer measuring the Sun’s energy output. Other 
on-board work included microgravity research on crystal growth, 
fluids and structures behavior, life sciences, and radiation effects. 








1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


1yd =0.9m. 


1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Oct. 2—Mar. 25, Soyuz TM-13 14,400 Ib 8 days/ Soyuz docked with Mir on Oct. 4. Cosmonauts Artsebarsky, 
1991 Alexander Volkov Approx. 2740 rev Viehboeck Viehboeck (first space traveler from Austria), and Aubakirov (first 
Toktar Aubakirov 230 mi and Kazakh crew member) returned on Oct. 10 in TM-172, leaving 
Franz Viehboeck Aubakirov; Volkov and Krikalev in control of the space station. 
144 days 19 h/ 
Artsebarsky 
Nov. 24-Dec. 1, STS 44 247,087 Ib 6 days 22 h Atlantis, on a DOD-dedicated mission, carried a Defense Support 
1991 Frederick D. Gregory 108 rev 51 min Program (DSP) satellite for detection, from geosynchronous 
Terence T. Henricks 195 mi orbit, of nuclear detonations, and missile and space launches. 
Story Musgrave After its deployment on the first day of the mission, the crew 
Mario Runco, Jr. worked with a variety of secondary payloads, including naked- 
James S. Voss eye Earth observations, radiation measurements, and medical in- 
Thomas J. Hennen vestigations. Due to malfunction of one of three Inertial Measure- 
ment Units (IMUs), the flight had to be shortened by 3 days. 
Jan. 22-30, 1992 STS 42 231,497 Ib 8 days 1h Discovery carried the International Microgravity Laboratory 1 
Ronald J. Grabe 128 rev 16 min (IML 1), dedicated to worldwide research in the behavior of 
Stephen S. Oswald 188 mi materials and life in weightlessness. 225 scientists from 15 
Norman E. Thagard countries, experimenting in 42 research disciplines, cooperated in 
William F. Readdy this fight. Along with IML 1, the payload consisted of 12 Get Away 
David C. Hilmers Special (GAS) containers, the large-format IMAX camera, and two 
Roberta L. Bondar student experiments. The crew included foreign astronauts Bondar 
Ulf D. Merbold (Canada) and Merbold (Germany). 
Mar. 17-Aug. 10, Soyuz TM-14 14,400 Ib 312 days 6h De AT E T e E nace 
1992 Alexander Viktorenko Approx. 2280 rev 1 min/Krikalev; ene eee ee ee aE Onaren aerO EAE ENCE ran 
> : and returned with former Mir occupants Volkov and Krikalev 
eee ot ENCOUN Mar. 25, to land in Soyuz TM-13 in Kazakhstan, leavin 
Klaus-Dietrich Flade 52 min/Volkov; On 8h eo, ayaa y g 
NSh Viktorenko and Kaleri in Mir. In space since May 18, 1991, 
y 
S7 miniFade Krikalev was originally slated to return in late 1991, but the 
schedule was revised, leaving him in orbit for five extra months. 
On Feb. 20-21 he conducted his seventh spacewalk, fetching 
experiments left outside during previous EVAs, and setting a new 
record. 
Mar. 24-Apr. 2, STS 45 222,086 Ib 8 days 22 h The multinational flight of Atlantis, including a Belgian payload spe- 
1992 Charles Bolden 142 rev 9 min 20 s cialist (Frimout), carried the first ATLAS (Atmospheric Laboratory 
Brian Duffy 182 mi for Applications and Science) associated with NASA's Mission to 
C. Michael Foale Planet Earth initiative. The ATLAS payload of 14 instruments re- 
David Leestma mained attached to the Orbiter, most of them mounted on two 
Kathryn Sullivan Spacelab pallets. A Far Ultraviolet Space Telescope (FAUST) was 
Dirk Frimout also employed. 
Byron Lichtenberg 
May 7-16, 1992 STS 49 246,008 Ib 8 days 21h This was the first mission featuring four EVAs, and the maiden 
Daniel C. Brandenstein 140 rev 18 min 36s voyage of the orbiter Endeavour. The main mission was reboost- 
Kevin P. Chilton 181 mi ing the 11.7-ft-diameter, 17.5-ft-high communications satellite 
Richard J. Hieb INTELSAT VI (F-3) launched 2 years earlier by a Titan rocket, 
Bruce E. Melnick but stranded in a useless orbit when the Titan’s second stage 
Pierre J. Thuout failed to separate. After rendezvous with the satellite, Thuot and 
Kathryn C. Thornton Hieb, on two spacewalks on flight days 4 and 5, failed to cap- 
Thomas D. Akers ture the 8960-Ib satellite. Joined by Akers on May 13, however, 
they succeeded in grasping it with their hands, berthing it in the 
shuttle cargo bay with the remote manipulator arm, attaching it 
to a new perigee kick motor, and redeploying it. This first three- 
person EVA in history lasted 8 h 36 min, the longest spacewalk 
ever conducted. The solid-fuel motor was fired remotely from 
the ground on May 14, taking INTELSAT VI to its final desti- 
nation in a geosynchronous orbit. In a fourth EVA on May 14, 
Thornton and Akers evaluated space-station assembly equipment 
and techniques as well as several prototype crew self-rescue 
devices. 
The first extended-duration (EDO) shuttle flight. Columbia, spe- 
June 25-July 9, STS 50 ? 245,902 Ib 13 days 19h cially modified for 2-week missions, carried the first U.S. 
VRE: ee eels aout ae Microgravity Laboratory 1 (USML 1), a Spacelab module 
Kenneth D. Bowersox 185 mi 4 


Bonnie J. Dunbar 
Carl J. Meade 

Ellen S. Baker 
Lawence J. DeLucas 
Eugene H. Trinh 


equipped with 31 experiments for advanced research in zero 
g. The longest shuttle flight to date, it supported investigations 
of the effects of weightlessness on plants, humans, and mate- 
rials. The mission also carried investigations into polymer mem- 
brane manufacturing, the Space Shuttle Amateur Radio Experi- 
ment II (SAREX-II), and numerous other cargo bay, middeck, and 
secondary payloads. 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 1 ft = 0.3 m. 


1 yd = 0.9m. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 








1 Ib = 0.45 kg. 


1 mi = 1.6 km. 1 Ib/in.2 = 6.9 kPa. 


1 ft = 0.3m. 


1yd = 0.9 m. 1 fathom = 1.8 m. 


Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
July 27-Feb. 1, Soyuz TM-15 14,400 Ib 146 days/Kaleri The third Russian-French mission since 1982, Soyuz carried French 
1992 Anatoliy Solovyev 2946 rev and Viktorenko; researcher Tognini, to conduct 10 biomedical, fluid and materials 
Sergei Avdeyev 245 mi 14/ Tognini science and technological experiments on board Mir. Tognini 
Michel Tognini returned on Aug. 10 in Soyuz TM-14 along with Viktorenko and 
Kaleri. At touchdown in Kazakhstan, the spherical TM 14 rolled 
over, briefly trapping but not injuring the crew. Solovyev and 
Avdeyev remained in Mir with numerous tasks such as installing 
new gyrodynes and rocket thrusters for attitude control. Progress 
M-13 was launched on Aug. 14, carrying supplies and medical 
gear to Mir. 
July 31-Aug. 8, STS 46 241,797 Ib 7 days 23 h Atlantis successfully deployed (on Aug. 2) the first European 
1992 Loren J. Shriver 126 rev 14 min 12 s Retrievable Carrier (EURECA) platform, carrying seeds, spores, 
Andrew M. Allen 266 mi crystals, and other experimental materials for research in zero g, 
Jeffrey A. Hoffman to be retrieved about 9 months later. Due to data errors, EURECA’s 
Franklin R. Chang-Diaz orbital self-transfer to its final altitude of 313 mi occurred later than 
Marsha S. Ivins planned but was accomplished on Aug. 7, Deployment of the main 
Claude Nicollier payload, the 1142-Ib Italian Tethered Satellite System (TSS 1) on 
Franco Malerba a 12.5-mi-long cable from the Atlantis, however, was terminated 
when a breakdown in the reel mechanism halted the tether after 
only 850 ft. The crew included two foreign astronauts, Malerba 
(Italy) and Nicollier (Switzerland) for ESA. 
Sept. 12-20, 1992 STS 47 232,661 Ib 7 days 22h Endeavour carried the Japan-chartered Spacelab SL-J with 44 
Robert L. Gibson 126 rev 30 min materials and life sciences experiments and nine Get Away Special 
Curtis L. Brown, Jr. 192 mi (GAS) payload canisters. The cargo also included the Israeli Space 
Mark C. Lee Agency Investigation About Hornets (ISAIAH) experiment and 
Jay Apt another flight of the Shuttle Amateur Radio Experiment (SAREX). 
N. Jan Davis The crew included the first Japanese shuttle astronaut (Mohri). 
May C. Jemison 
Mamoru Mohri 
Oct. 22—Nov. 1, STS 52 239,178 Ib 9 days 20h Columbia carried the U.S. Microgravity Payload (USMP 1) 
1992 James D. Wetherbee 158 rev 56 min as main cargo. The crew, with the third Canadian astronaut 
Michael A. Baker 189 mi (MacLean), performed materials and science experiments in micro- 
C. Lacy Veach g with the U.S. Microgravity Payload 2 (USMP 2), deployed the 
William M. Shepherd second Italian/U.S. Laser Geodynamic Satellite (LAGEOS 2), and 
Tamara E. Jernigan conducted other physical experiments including the Canadian 
Steven G. MacLean Experiment #2 (CANEX 2), and space-station-related investiga- 
tions. 
Dec. 2-9, 1992 STS 53 230,731 Ib 7 days 7 h As the last dedicated DOD mission, Discovery carried the satellite 
David M. Walker 116 rev 20 min DOD 1 as primary and two major experiments, the Glow/ 
Robert D. Cabana 237 mi Cryogenic Heat Pipe Experiment (GCP) and Orbital Debris Radar 
Guion S. Bluford Calibration Spheres (ODERACS), as secondary payloads. With 
James S. Voss GCP, remotely commanded special sensors observed orbiter 
Michael Richard Clifford surfaces, contamination events, and airglow, and measured the 
zero-g performance of liquid oxygen heat pipes. ODERACS had to 
be canceled when its container in the cargo bay failed to open. 
Jan. 13-19, 1993 STS 54 248,338 Ib 5 days 23 h Endeavour carried the sixth Tracking and Data Relay Satellite 
John H. Casper 188 rev 39 min (TDRS), its Inertial Upper Stage (IUS), and the Diffuse X-Ray 
Donald R. McMonagle 95 mi Spectrometer (DXS), consisting of two instruments attached to 
Mario Runco, Jr. the sides of the cargo bay. After successful deployment of TDRS- 
Susan J. Helms F, crew activities included several microgravity experiments on 
Gregory J. Harbaugh materials and rodents, and the first in a series of test spacewalks 
leading up to space-station construction. Harbaugh and Runco, 
assisted by Helms from the cabin, tested EVA training procedures 
for the space station and the Hubble repair mission (STS 61). 
Using children’s toys, the crew responded to students from 
various elementary schools asking questions about physics in 
zero g. 
Jan. 24-July 22, Soyuz TM-16 14,400 Ib 189 days/ Equipped with the new androgynous docking system APAS-89, to 
1993 Gennadiy Manakov Approx. 2790 rev Solovyev be tested for use in the newly planned Mir/space shuttle docking 
Alexander Polishchuk 242 mi and Avdeyev mission, Soyuz docked with Mir, for the first time, at the Kristall 


module’s docking port. The previous crew, Solovyev and Avdeyev, 
returned on Feb. 1 in Soyuz TM-15 after 189 days in space. On Feb. 
4, Progress M-15 was undocked from Mir’s aft port and used to 
deploy, by centrifugal force, a 66-ft-diameter solar sail made from 
superthin aluminum foil. The purpose of the test, which reflected 
a visible light beam to Earth, was to assess the force of photons 
from the Sun for propulsive application. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 8-17, 1993 STS 56 225,597 Ib 9 days 6h Discovery carried ATLAS 2, the second in NASA's series of 
Kenneth D. Cameron 147 rev 8 min Atmospheric Laboratory for Applications and Science missions. 
Stephen S. Oswald 185 mi The remote sensing instruments of ATLAS studied the Sun’s 
Michael Foale energy output and Earth’s middle-atmosphere chemical makeup, 
Kenneth D. Cockrell particularly with regard to ozone depletion. The crew on Apr. 11 
Ellen Ochoa deployed Spartan-201, a free-flying platform carrying instruments 
to study the Sun’s corona and the solar wind, and recovered the 
platform on Apr. 13. 
Apr. 26-May 6, STS 55 244,156 Ib 9 days 23h Columbia carried the Spacelab module on its eleventh flight, 
1993 Steven R. Nagel 158 rev 40 min chartered for the second time by the German space agency 
Terence T. Henricks 187 mi DARA, with two German payload specialists (Walter, Schlegel) 
Jerry L. Ross among its crew. The Spacelab D2 mission was devoted to zero- 
Charles J. Precourt g-relevant investigations in a wide range of science disciplines, 
Bernard A. Harris including materials research; life sciences; technology experiments 
Ulrich Walter in optics, robotics, automation, and medical applications; Earth 
Hans Wihelm Schlegel observations; atmospheric physics; and astronomy, for a total of 
about 90 different projects. Many investigations were aimed at 
developing, testing, and verifying future operations/utilization in 
permanent space-station laboratories. 
June 21-July 1, STS 57 239,319 Ib 9 days 23h Highlights of the Endeavour’s flight were the first flight of the 
1993 Ronald Grabe 158 rev 46 min privately developed middeck module Spacehab and the retrieval 
Brian Duffy 289 mi of the European space platform EURECA, in orbit since its 
David Low deployment from STS 46 on Aug. 1,1992. Spacehab experiments 
Nancy Sherlock included 13 commercial payloads, ranging from drug improvement 
Peter (Jeff) J. K. Wisoff to feeding plants, cell splitting, the first soldering experiment in 
Janice Voss space by United States astronauts, and high-temperature melting 
of metals; and six NASA experiments involving biotechnology, 
human factors, and water recycling for the future space station. 
Other payloads included SHOOT (Superfluid Helium On-Orbit 
Transfer) to demonstrate the resupply of liquid helium containers in 
space. A 5‘-h EVA by Low and Wisoff on June 25 served to latch 
two EURECA antennas and to refine procedures for the planned 
Hubble repair mission. 
July 1-Jan. 14, Soyuz TM-17 14,400 Ib 179 days/ Carrying the fourth French cosmonaut since 1982, Soyuz docked 
1993 Vasili Tsibliyev Approx. 2790 rev Polishchuk and with Mir on July 3. Resupply craft Progress M-18, docked on May 


Alexander Serebrov 
Jean-Pierre Haignere 


242 mi 


Manakov; 21 days/ 
Haignere 


24, was undocked on July 4 and reentered destructively, after 
separating a small ballistic Raduga (“Rainbow”) reentry capsule 
carrying experiments and recordings for a soft landing in Russia. 
During his stay at the station, Haignere conducted 11 major Altair 
experiments, then returned to Earth on July 22 with the previous 
crew, Polishchuk and Manakov, in TM-16. 





Sept. 12-22, 1993 STS 57 250,023 Ib 9 days 20 h 11 min Discovery carried two payloads. An Advanced Communication 
Frank Culbertson 157 rev Technology Satellite (ACTS), a testbed for technology leading to 
William Readdy 296 mi a new generation of comsats, was deployed and injected by a 
Jim Newman Transfer Orbit Stage (TOS) toward its geosynchronous location. 
Dan Bursch The German Orbiting and Retrievable Far and Extreme Ultraviolet 
Carl Walz Spectrometer-Shuttle Pallet Satellite (ORFEUS-SPAS) was set 
free by the robot arm and was retrieved 6 days later, the first of 
a series of such missions. Other mission tasks included a 7-h EVA 
by Walz and Newman on Sept. 16 as part of planning for the Hubble 
repair mission. 

Oct. 18-Nov. 1, STS 58 244,860 Ib 14 days 13 min 40s Columbia carried SLS-2, the second Spacelab dedicated to 
1993 John E. Blaha 224 rev life sciences research. NASA's longest shuttle mission to date 
Richard A. Searfoss 179 mi conducted, in a39° inclination orbit, a series of experiments to gain 
M. Rhea Seddon more knowledge on how the human body adapts to the weightless 
William S. McArthur, Jr. environment of space. The 14 experiments were performed on 
David A. Wolf the crew, which included two medical doctors (Seddon, Wolf), a 
Shannon W. Lucid physicist/biochemist (Lucid), and a veterinarian (Fettman), and on 

Martin J. Fettman laboratory animals (48 rats). 
Dec. 2-13, 1993 STS 61 236,670 Ib 10 days 19h Endeavour made the first in a series of planned visits to the orbiting 
Richard O. Covey 162 rev 58 min 35 s Hubble Space Telescope (HST). The crew included four astronauts 
Kenneth D. Bowersox 370 mi with considerable EVA experience (Hoffman, Musgrave, Thornton, 
Tom Akers Akers) and Swiss astronaut Nicollier (ESA). On Dec. 4, Nicollier 
Jeffrey A. Hoffman grappled the HST with the shuttle’s robot arm, berthing it in the 
F. Story Musgrave cargo bay. In five space walks during the following 5 days, the as- 
Claude Nicollier tronauts installed the Wide Field/Planetary Camera Il and the 
Kathryn Thornton Corrective Optics Space Telescope Axial Replacement (COSTAR) 

1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 


Dates launched 
and recovered 


Designation 
and crew 


A. Weight 


B. Revolutions 
C. Max. distance 


from Earth 


Duration 


Remarks 





Dec. 2-13, 
1993 (cont.) 


Jan. 8—July 9, 
1994 


Feb. 3-11, 1994 


Mar. 4-18, 1994 


Apr. 9-20, 1994 


July 1-Nov. 4, 
1994 


July 8-23, 1994 


Sept. 9-20, 1994 


Soyuz TM-18 
Viktor Afanasyev 
Yuri Usachov 
Valery Polyakov 


STS 60 
Charles F. Bolden 
Kenneth S. 

Reightler, Jr. 

N. Jan Davis 
Ronald M. Sega 
Franklin R. Chang-Diaz 
Sergei K. Krikalev 


STS 62 
John H. Casper 
Andrew M. Allen 
Charles D. Gemar 
Marsha S. Ivins 
Pierre J. Thuot 


STS 59 
Sidney M. Gutierrez 
Kevin P. Chilton 
Jerome Apt 
Michael R. Clifford 
Linda M. Godwin 
Thomas D. Jones 


Soyuz TM-19 
Yuri Malenchenko 
Talgat Musabayev 


STS 65 
Robert D. Cabana 
James Donald Halsell 
Richard J. Hieb 
Carl E. Walz 
Leroy Chiao 
Donald A. Thomas 
Chiaki Naito-Mukai 


STS 64 
Richard N. Richards 
L. Blaine Hammond 
Jerry Linenger 
Susan J. Helms 


14,400 Ib 
3072 rev 
242 mi 


233,290 Ib 
130 rev 
220 mi 


245,457 Ib 
223 rev 
188 mi 


237,048 Ib 
181 rev 
139 mi 


14,400 Ib 
1960 rev 
242 mi 


228,640 Ib 
(landing) 

235 rev 

188 mi 


210,916 Ib 
(landing) 

176 rev 

162 mi 


197 days 3 h 2 min/ 
Tsibliyev and 
Serebrov; 

437 days 
17h 59 min/ 
Polyakov 


8 days 7h9 min 


13 days 23h 
17.5 min 


11 days 5 h 50 min 


125 days 20 h 
53 min; 182 days/ 
Afanasyev and 
Usachov 


14 days 17h 
55 min 


10 days 22h 
50 min 


mechanism, both designed to compensate for the erroneous 
spherical aberration of the primary mirror. They also replaced the 
spacecraft’s solar arrays and installed new gyroscopes, additional 
computer memory, and other systems to increase HST reliability. 
Total EVA time accumulated by all crews: 35 h 28 min. HST was 
set free on Dec. 10. 


This was the fifteenth mission to the Mir station. Docking occurred 
on Jan. 10. Polyakov, a physician cosmonaut who had spent 241 
days in space in 1988 (TM-6), intended to conduct the longest 
human space flight in history to date to test the effects of long 
periods of weightlessness on the human body. He returned on 
TM-20 on Mar. 22, 1995, after 14.6 months in space. During the 
mission, three Progress supply ships were to be sent to Mir, which 
was repaired and maintained by the crews. The previous crew, 
Vasily Tsibliyev and Alexander Serebrov, returned to Earth on Jan. 
14 after 6 months, despite a last-minute slight collision of their 
Soyuz TM-17 with Mir. 


This Discovery mission included the first Russian cosmonaut 
(Krikalev). The mission also carried the commercial Spacehab 
facility (on its second flight) with 12 biotechnology and materials 
processing payloads, and the free-flying Wake Shield Facility, a 12- 
ft-diameter disk designed to generate an ultravacuum environment 
in space in its wake, within which to grow thin semiconductor 
crystal films of gallium arsenide. WSF 1 deployment, however, 
did not take place due to problems with a horizon sensor on 
the satellite; crystals were grown with WSF attached to the robot 
arm. The mission also included deployment of the German satellite 
BREMSAT. 


This Columbia mission was dedicated to microgravity research 
with the U.S. Microgravity Payload 2 (USMP 2) and the Office 
of Aeronautics and Space Technology 2 (OAST 2) payloads. The 
four experiments on USMP 2 were designed to study solidification 
techniques, crystal growth, and gas behavior in zero g. OAST 
2 carried five experiments for various other effects of materials 
and environmental phenomena in space. The Orbiter also carried 
special acceleration measurement systems, and the Shuttle Solar 
Backscatter Ultraviolet (SSBUV) instrument. 


Endeavour, in a57° orbit, carried the first Space Radar Laboratory 
(SRL 1) for internationally conducted studies of the Earth’s global 
environment. SRL consisted of the Spaceborne Imaging Radar-C 
(SIR-C) and the X-Band Synthetic Aperture Radar (X-SAR). Other 
payloads included microgravity experiments (CONCAP-IV) and 
medical investigations. 


This mission docked with Mir on July 3, carrying a Kazakhstan 
cosmonaut (Musabayev) besides a Russian. The two crew 
members replaced Afanasyev and Usachev, who returned to Earth 
in Soyuz TM-18 on July 9, after 6 months in space. Valery Polyakov 
continued his orbital stay. Malenchenko and Musabayev returned 
in TM-19 on Nov. 4, 1994 (with Ulf Merbold of TM-20). 


Columbia carried the second mission of the International Micro- 
gravity Laboratory (IML 2) with furnaces and other facilities to study 
the behavior of materials and life in the weightless environment, and 
to produce a variety of material structures, from crystals to metal 
alloys. More than 200 scientists contributed 82 investigations, of 
which 81 were successfully completed in 19 IML 2 facilities, plus 
other middeck experiments. The crew included the first Japanese 
woman in space (Naito-Mukai). 


Discovery was launched for a mission of atmospheric research, 
robotic atmospheric research, robotic operations, and an EVA. 
LITE (Lidar In-Space Technology Experiment) was flown and 
used in space. Operating from the shuttle payload bay, the 
neodymium YAG laser, an optical radar, probed the Earth’s 
atmosphere to collect measurements on clouds and airborne 
dust. Other payloads were the Robot-Operated Processing System 





Carl J. Meade 
Mark C. Lee 
1Ib=045kg. 1ml= 1.6km. 1ft= 0.3m. 


1 yd = 0.9 m. 


1 fathom = 1.8 m. 


1 Ib/in.2 = 6.8 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





Dates launched 
and recovered 


Sept. 9-20, 
1994 (cont.) 


Sept. 30-Oct. 11, 
1994 


Oct. 4—Mar. 22, 
1994 


Nov. 3-14, 1994 


Feb. 3-11, 1995 


Mar. 2-18, 1995 


Mar. 14—Sept. 11, 
1995 


Designation 
and crew 


STS 68 
Michael A. Baker 
Terrence W. Wilcutt 
Thomas D. Jones 
Steven L. Smith 
Daniel W. Bursch 


A. Weight 
B. Revolutions 


Peter (Jeff) J. K. Wisoff 


Soyuz TM-20 


Alexander Viktorenko 


Elena Kondakova 
Ulf Merbold 


STS 66 


Donald R. McMonagle 


Curtis L. Brown 
Ellen Ochoa 
Joseph R. Tanner 


Jean-Francois Clervoy 


Scott E. Parazynski 


STS 63 


James D. Wetherbee 


Eileen M. Collins 
Bernard A. Harris 
C. Michael Foale 
Janice Voss 

Vladimir G. Titov 


STS 67 

Stephen S Oswald 
William G. Gregory 
Tamara E. Jernigan 
John M. Grunsfeld 
Wendy B. Lawrence 
Samuel T. Durrance 
Ronald A. Parise 


Soyuz TM-21 
Vladimir Dezhurov 
Gennady Strekalov 
Norman Thagard 


Remarks 


(ROMPS), a Simplified Aid for EVA Rescue (SAFER) backpack 
used experimentally and the astronomy satellite Spartan 201, 
deployed and retrieved after 40 h of free flight. 


Endeavour carried the Space Radar Laboratory into space for the 
second time. An international team used its instruments to study 
how the Earth's global environment is changing. 


This mission docked at Mir on Oct. 6, its crew including 
the third Russian woman (Kondakova) since inception of the 
Soviet/Russian space program, and ESA-astronaut Merbold on his 
third space trip. TM-20 was the first of two ESA Euro-Mir missions, 
and its launch had depended on successful docking of Progress 
M 24, accomplished by Mir commander Malenchenko on Sept. 2 
manually. Malenchenko and Musabayev then performed two EVAs, 
on Sept. 9 and 13, to inspect the (undamaged) orbital station where 
it had been struck by M 24 on Aug. 30 and by Soyuz TM-17 after its 
undocking in January, as well as Mir’s Safora control mast. On Oct. 
11, the crew had to cope with a serious power and attitude control 
failure of the station’s core module. After performing experiments in 
life sciences, materials processing, and technology development, 
though limited by a failure of the CSK 1 materials furnace, Merbold, 
with Malenchenko and Musabayev, returned to TM-179 on Nov. 
14, leaving Kondakova and Viktorenko in space with Polyakov (to 
return on Mar. 22, 1995). 


Atlantis carried the Atmospheric Laboratory for Applications and 
Science (ATLAS) on its third flight for Mission to Planet Earth. The 
crew, with a French astronaut (Clervoy) for ESA, used the remote- 
sensing laboratory for studying the Sun’s energy output, the middle 
atmosphere’s chemical makeup, and the effects of these factors 
on global ozone levels. On Nov. 4, they deployed the German- 
built reusable satellite CRISTA-SPAS, carrying cryogenic infrared 
spectrometers, telescopes, and a high-resolution spectrograph for 
atmosphere research. They recovered it on Nov. 12, using a new 
rendezvous method in preparation for later rendezvous/docking 
flights to the Russian Mir station. 


Discovery lifted off with the second Russian cosmonaut (Titov) 
and the first woman shuttle pilot (Collins) within a narrow launch 
window of 5 min, as required by its mission: rendezvous and fly- 
around of the Russian space station Mir at 51.6° inclination. The 
rendezvous, a dress rehearsal of docking missions to follow, took 
place on Feb. 6, validating new flight techniques required for those 
missions, though no docking was performed as yet. Also on board: 
Spacehab on its third flight and Spartan 204, a satellite carrying 
various science experiments. It was deployed by Titov on Feb. 
7 and retrieved with the robot arm on Feb. 9. Foale and Harris 
performed a 4.5-h EVA on Feb. 9. 


Setting a shuttle duration record of about 16'/ days in orbit, 
Endeavour carried the second flight of the ASTRO observation 
platform with three unique telescopes for ultraviolet astronomy. Its 
tasks included mapping areas of space still largely uncharted in 
ultraviolet and measuring intergalactic gas abundance. Ultraviolet 
spectra were taken of about 300 celestial objects, including Jupiter 
and the Moon. Other experiments concerned large-structure 
control dynamics and materials processing. 


This eighteenth crew visit to Mir carried the first American (Thagard) 
on a Russian spacecraft. Rendezvous and docking took place 
on Mar. 16. With the current Mir crew of 3 cosmonauts and 
the 7 astronauts on STS 67/Endeavour, the number of humans 
simultaneously in space was 13, a record (until the STS 67 
landing on Mar. 18). Mir 17 crew Viktorenko, Kondakova, and 
Polyakov returned on Mar. 22 in TM-20, with a new long-dura- 
tion record of 437 days 17 h 59 min set by Polyakov, and longest 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


C. Max. distance 
from Earth Duration 
223,040 Ib 11 days 5h 
(landing) 46 min 
182 rev 
139 mi 
14,400 Ib 31 days 11h 
2637 rev 36 min/Merbold; 
247 mi 169 days 5h 
22 min/ 
Viktorenko 
and Kondakova 
209,857 Ib 10 days 22 h 
(landing) 35 min 
174 rev 
193 mi 
211,318 Ib 8 days 6h 
(landing) 28min 15s 
129 rev 
250 mi 
217,683 Ib 16 days 15h 
(landing) 9 min 
262 rev 
220 mi 
14,400 Ib 115 days 9h 
2820 rev 44 min 
246 mi 
1yd=0.9m. 1fathom=1.8m. 1 Ib/in.2 = 6.8 kPa. 








98 Space flight 


TABLE 1. Crewed space flights 1961-2006 (cont.) 





Dates launched 
and recovered 


Designation 
and crew 


A. Weight 
B. Revolutions 


C. Max. distance 


from Earth 


Duration 


Remarks 





Mar. 14-Sept. 11, 
1995 (cont.) 


May 20, 1995 


June 27-July 7, 
1995 


July 13-22, 1995 


Sept. 3-Feb. 29, 
1995 


Sept. 7-18, 1995 


Spektr 
Crewless 


STS 71 

Robert L. (Hoot) Gibson 

Charles J. Percourt 

Ellen S. Baker 

Gregory J. Harbaugh 

Bonnie Dunbar 

Anatoly Y. Solovyev (up) 

Nikolai M. Budarin (up) 

Vladimir Dezhurov 
(down) 

Gennady Strekalov 
(down) 


Norman Thagard (down) 


STS 70 
Terrence (Tom) Henricks 
Kevin Kregel 
Donald Thomas 
Nancy Currie 
Mary Ellen Weber 


Soyuz TM-22 
Yuri Gidzenko 
Sergei Avdeyev 
Thomas Reiter 


STS 69 
David M. Walker 
Kenneth D. Cockrell 
James S. Voss 
Jim Newman 
Michael L. Gernhardt 


43,000 Ib 


257 mi 


214,700 Ib 
(landing) 

153 rev 

246 mi 


195,195 Ib 
(landing) 

142 rev 

206 mi 


14,400 Ib 
2789 rev 
246 mi 


21,971 Ib 
(landing) 

170 rev 

232 mi 


9 days 19 h 
22 min 17 s; 
75 days 
11h19 min/ 
Solovyev and 
Budarin 


8 days 22 h 
21 min 2s 


179 days 


10 days 20 h 
29 min 


space flight by a woman (Kondakova) of 169 days 5 h 22 min. 
After a 3-month stay, Thagard, Dezhurov, and Strekalov were to 
be returned to Earth by the shuttle Atlantis (STS 71). Known as 
Mir 18, Thagard’s mission was part of Phase One of joint activities 
leading to the International Space Station. At the end of his mission, 
Thagard had established a new United States record for time spent 
in space of 115 days 9 h 44 min. 


Russia’s new four-solar-array research module, launched on a 
heavy-lift Proton, carried extensive atmospheric science instru- 
mentation and about 1760 Ib of United States equipment for car- 
diovascular and other biomedical research by astronaut Thagard, 
later by his replacement, Bonnie Dunbar. Spektr docked automat- 
ically to Mir on June 1, requiring five spacewalks by Dezhurov and 
Strekalov between May 12 and June 2 and the repositioning of the 
older module, Kristall. This prepared the station for the arrival and 
docking of the shuttle Atlantis (STS 71). 


Atlantis opened Phase One of the International Space Station 
program as the first of a series of shuttle—Mir linkups, 20 years 
after the joint United States-Russian ASTP mission. Docking took 
place on June 29, forming the largest human-made structure in 
space. Joint scientific (mostly medical) investigations were carried 
out inside the Spacelab module in the Atlantis cargo bay. Solovyev 
and Budarin then took over as new occupants of Mir (Mir 19), 
while the Mir 18 crew of Dezhurov, Strekalov, and Thagard returned 
to Earth with Atlantis. Undocking was on July 4, 4 days 21 h 
35 min after linkup. 


Discovery carried the seventh Tracking and Data Relay Satellite 
(TDRS 7). It was released out of the payload bay and boosted 
into geosynchronous orbit with its Inertial Upper Stage (IUS). 
Other payloads included a Commercial Protein Crystal Growth 
(CPCG) facility, a Bioreactor Demonstrator, and various biological 
experiments. 


EUROMIR’95 docked to Mir on Sept. 5, after Progress M-28 was 
undocked and jettisoned. ESA astronaul Reiter (Germany) served 
as flight engineer, marking the first time for Europe in Russia’s 
space program. On Oct. 20, he (with Avdeyev) installed a European 
experiment on the outside of Spektr. Two cassettes exposed to 
natural and human-made dust and debris were returned by Reiter 
and Gidzenko during another EVA (the third) on Feb. 8, 1996. The 
Mir 19 crew (Solovyev and Budarin) returned in TM-27 on Sept. 
11. On-board activities included experiments in life sciences and 
micro-g materials processing. Most of ESA’s equipment had been 
brought to Mir by Progress M-28 in July and by Spektr. Another 
load arrived on Progress M-29 (launch Oct. 8, docking Oct. 10, 
at aft end of Kvant 7), including the Munich Space Chair, set up 
Oct. 11 in Spektr. In November, the crew hosted the visiting shuttle 
Atlantis/STS 74. The crew returned 6 days after arrival of TM-23 
with the Mir 27 crew. 


Endeavour carried the Spartan 210 free-flyer on its third flight 
and the Wake Shield Facility (WSF 2) on its second flight as main 
payloads, Spartan released on Sept. 8 and retrieved 2 days 
later, observed the solar wind and the Sun’s outer atmosphere 
in conjunction with the Ulysses probe while it passed over the 
Sun’s north polar region. The WSF, deployed by the robot arm 
on Sept. 11 and retrieved on Sept. 14, was partially successful 
in growing thin semiconductor films in the ultravacuum created 
in its wake. Other payloads included the International Extreme 
Ultraviolet Hitchhiker (IEH 1) and the Capillary Pumped Loop 2/Gas 
Bridge Assembly (CAPL 2/GBA) for microgravity studies of fluid 
systems. Voss and Gernhardt on an EVA tested construction tools, 
an arm-sleeve strap-on computer, and suit modifications for space- 
station assembly. 





1 Ib = 0.45 kg. 





1 mi = 1.6 km. 


1 ft = 0.3 m. 


1yd =0.9m. 


1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 








Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Oct. 20-Nov. 5, STS 73 230,158 Ib 15 days 21h Columbia carried the second U.S. Microgravity Laboratory (USML 
1995 Kenneth D. Bowersox (landing) 53 min 2) Spacelab with science and technology experiments in areas such 
Kent Rominger 255 rev as fluid physics, materials science, biotechnology, combustion 
Kathryn Thornton 174 mi science, and commercial space processing technologies. Students 
Catherine Coleman interacted with the crew, discussing and comparing on-board 
Michael Lopez-Alegria microgravity experiments with similar ground-based experiments. 
Fred Leslie 
Albert Sacco 
Nov. 12-20, 1995 STS 74 205,000 8 days 4 h Atlantis carried a Russian-built Docking Module (DM), two new 
Kenneth Cameron (landing) 31 min solar arrays, and supplies for Mir on the second shuttle-Mir docking 
James Halsell 128 rev mission. After attaching the DM to Mir’s Kristal module with the 
Chris Hadfield 185 mi robot arm and linking the shuttle’s airlock to it on Nov. 15, the 
Jerry Ross Atlantis crew, including a Canadian (Hadfield), was welcomed by 
William McArthur Gidzenko, Avdeyev, and Reiter. During 3 days of joint operation, the 
crews transferred the United States biomedical and microgravity 
science samples and data collected by the Mir 18, 19, and 20 
resident crews from the station to the shuttle, and fresh supplies 
to Mir. On Nov. 18, Atlantis undocked, leaving the DM attached to 
the station for future linkups. 
Jan. 11-20, 1996 SIS 72 217,000 Ib 8 days 22 h Endeavour retrieved, on Jan. 13, the Japanese satellite SFU (Space 
Brian Duffy (landing) 1 min Flyer Unit) launched by an H-2 rocket in March 95. On Jan. 14, the 
Brent W. Jett, Jr. 141 rev Japanese crew member (Wakata) released the OAST-Flyer, a free- 
Leroy Chiao 290 mi flying platform with four autonomous experiments. On Jan. 16, the 
Winston E. Scott satellite was retrieved. On Jan. 14, Chiao and Barry performed a 
Koichi Wakata spacewalk to evaluate a portable work platform. A second EVA, 
Daniel T. Barry by Chiao and Scott, took place on Jan. 17 for further testing of 
spacesuit modifications and space station assembly tools and 
techniques. 
Feb. 21-Sept. 2, Soyuz TM-23 14,400 Ib 193 days 19 h The Mir 21 crew docked to the Mir station on Feb. 23, joining 
1996 Yuri Onufrienko 3027 rev 11 min the three crew members of Mir 20, who returned in TM-22 on 
Yuri Usachev 247 mi Feb. 29. In the first month, on-board activities focused on national 
research, followed by the United States science program NASA 2 
when astronaut Shannon Lucid joined the crew on Mar. 23, On May 
21 and 24, Onufrienko and Usachev performed their second and 
third spacewalk to install and unfurl the Mir Cooperative Solar Array 
brought up by Atlantis (STS 74) and a fourth on May 30 to install a 
European earth resources camera on the Priroda module. On June 
13, they installed a truss structure called Rapanato Kvant 1, taking 
the place of the similar Strela. Later, they worked on the French 
research program Cassiopeia when Mir 22 arrived with French 
astronaut Andre-Deshays. Onufrienko and Usachev returned with 
her on Sept. 2. 
Feb. 22-Mar. 9, STS 75 229,031 Ib 15 days 17 h Columbia carried three foreign astronauts among its crew 
1996 Andrew Allen (landing) 39 min (Cheli/ESA-Italy, Nicollier/ESA-Switzerland, Guidoni/ASl-Italy). On 
Scott Horowitz 250 rev Feb. 25, the Tethered Satellite System, on a reflight from STS 46 in 
Franklin Chang-Diaz 185 mi July 1992, was deployed on the end of its thin 12.5-mi-long tether 
Jeffrey Hoffman line. Experiments could not be completed, however, since, with 12 
Claude Nicollier mi of tether deployed, the cable broke close to the shuttle, and the 
Maurizio Cheli satellite was lost in orbit. On-board activities then focused on the 
Umberto Guidoni operation of the third U.S. Microgravity Payload (USMP 3), con- 
tinuing research aimed at improving basic knowledge of materials 
under zero-g conditions. 
Mar. 22-31, 1996 STS 76 246,335 Ib 9 days 5 h Atlantis lifted off for the third linkup with Russia’s Mir station 
Kevin Chilton (landing) 15 min 53 s as part of Phase 1 of the International Space Station program. 
Richard Searfoss 144 rev After docking on Mar. 23 and hatch opening, the two crews 
Ronald Sega 246 mi of eight conducted joint operations and transferred 5000 Ib of 
M. Richard Clifford supplies and water to Mir. The first spacewalk by United States 
Linda Godwin astronauts (Godwin, Clifford) with a shuttle attached to the Russian 
Shannon Lucid station took place on Mar. 27, to mount four experiments to its 
outside for assessing its environment. After concluding science 
and technology experiments in the Spacehab module in the cargo 
bay, Atlantis undocked on Mar. 28, leaving Lucid, the first United 
States astronaut to be ferried to Mir by shuttle, aboard the station 
to join Onufrienko and Usachev for the next 188 days. 
Apr. 23, 1996 Priroda 43,000 Ib — The Priroda module of Mir was launched on a Proton rocket. 
Crewless — Docking occurred on the first attempt on Apr. 26, despite the failure 
246 mi of two of the module's batteries on Apr. 25. On Apr. 27, Priroda 
1lb = 0.45kg. 1mi= 1.6km. 1ft=0.3m. 1yd=0.9m. 1fathom= 1.8m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 








A. Weight 
B. Revolutions 
Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 23, was rotated from an axial to a radial docking port. The module 
1996 (cont.) carries a set of remote-sensing instruments and some microgravity 
experiments. 
May 19-29, 1996 STS 77 254,879 lb 10d 39 min Endeavour carried into orbit the commercial Spacehab module 
John H. Casper (landing) 18s (fourth flight), with nearly 3000 Ib of experiments and support 
Curtis L. Brown 161 rev equipment, an Inflatable Antenna Experiment (IAE) on the free- 
Andrew S. W. Thomas 177 mi flying Spartan 207 platform, and four technology experiments 
Daniel W. Bursch called TEAMS. The crew included a Canadian (Garneau). The 
Spartan was released on May 20, and the 50-ft-diameter IAE 
inflated on its three 92-ft-long struts, the most complex and 
precise inflatable space structure ever. It was separated from 
Spartan later and burned up in the atmosphere. The satellite 
was retrieved on May 21, followed by further experiments which 
included rendezvous with the passive aerodynamically stabilized, 
magnetically damped satellite PAMS/STU (Satellite Test Unit). 
June 20-July 7, STS 78 256,170 Ib 16d 21h Columbia carried the Life and Microgravity Sciences (LMS) 
1996 Terence Henricks (landing) 48 min Spacelab, equipped to focus on both living organisms in the low- 
Kevin Kregel 271 rev gravity environment and the processing of various substances 
Richard M. Linnehan 177 mi when gravity effects are greatly reduced. Crew included a French 
Susan J. Helms and a Canadian payload specialist (Favier, Thirsk). Mission set a 
Charles E. Brady, Jr. shuttle duration record. 
Jean-Jacques Favier 
Robert Brent Thirsk 
Aug. 17, 1996- Soyuz TM-24 14,400 Ib 32d 18h The Mir 22 crew included a French female researcher-cosmonaut 
Mar. 2, 1997 Valeriy Korzun 243 rev 27 min/André- (André-Deshays). The three cosmonauts joined the Mir 21 crew on 
Aleksandr Kaleri = Deshays; board Mir on Aug. 19. Prior to their arrival, on Aug.1, the crewless 
Claudie André-Deshays 196d17h cargo ship Progress M-31 was undocked and deorbited over the 
26 min/Korzun Pacific; on Aug. 2, Progress 32 arrived. It was undocked on Aug. 18 
and Kaleri to free up a docking port, and linked up again with Mir on Sept. 3. 
Sept. 16-26, STS 79 249,352 Ib 10d 3h Atlantis took off for its fourth docking with Russia’s Mir, the sec- 
1996 William F. Readdy (landing) 18 min; ond flight to carry a commercial Spacehab module in its cargo bay, 
Terrence W. Wilcutt 159 rev 188 d 5h/ and the first to carry a double version of it. Main mission purposes: 
Jay Apt 239 mi Lucid to replace Lucid on Mir with Blaha for a 4-month stay; to carry 
Tom Akers 4600 Ib of logistics, including food, clothing, experiment supplies, 
Carl E. Watz and spare equipment, to the space station; and to return 2200 Ib of 
John E. Blaha (up) science samples and hardware to Earth. Five different experiments 
Shannon Lucid (down) were turned off on the shuttle and rapidly transferred to Mir, the 
first such transfer of powered scientific apparatus. The two vehicles 
docked on Sept. 18, and separated on Sept. 23. The mission began 
anew era of permanent space occupancy by U.S. astronauts. Lucid 
set records for longest space flight, both for women and for U.S. 
astronauts (188 days 5 hours). 
Nov. 19-Dec. 7, STS 80 226,954 Ib 17d 15h53min Columbia carried the 7876-lb (3600-kg) reusable German 
1996 Kenneth D. Cockrell (landing) ORFEUS-SPAS II ultraviolet observatory and the 4650-Ib 
Kent V. Rominger 278 rev (2100-kg) Wake Shield Facility (WSF), on its third flight, with its 
Tamara E. Jernigan 220 mi 4750-Ib (2155-kg) carrier system. The ORFEUS-SPAS satellite was 
Thomas D. Jones released on Nov. 19 and recovered with the RMS-robot arm on 
Story Musgrave Dec. 3. The free-flying WFS was deployed on Nov. 22 and retrieved 
on Nov. 25. Both satellites were highly successful in their mis- 
sions. The crew conducted biomedical and genetic experiments 
in weightlessness. Astronaut Musgrave tied NASA astronaut John 
Young's record for most space flights by any human being (6), and 
at age 61, he became the oldest person to date to fly in space. The 
flight set a new duration record for the space shuttle. 
Jan. 12-22, STS 81 249,936 Ib 10d 4h 56min Atlantis on the fifth mission (of 9) to Mir replaced astronaut John 
1997 Michael A. Baker (landing) Blaha with Jerry Linenger as U.S. Mir resident anddelivered various 
Brent W. Jett, Jr. 160 rev supplies. After docking on Jan. 14, the largest transfer of cargo 
Peter (Jeff) Wisoff 245 mi items to date took place: 1400 Ibs (635 kg) of water, 1137 Ib (516 kg) 
John M. Grunsfeld of U.S. science equipment, 2206 Ib (1000 kg) of Russian logistics, 
Marsha S. Ivins and 268 Ib (122 kg) of miscellaneous material. 
Jerry M. Linenger (up) 
John Blaha (down) 
Feb. 10-Aug. 14, Soyuz TM-25 14,400 Ib 19d 16h 35min/ The Mir 23 crew joined the three station occupants, which included 
1997 Vasiliy Tsibliev 2885 rev Ewald; U.S. astronaut Jerry Linenger. On Feb. 23, a small flash fire broke 
Aleksandr Lazutkin 246 mi 184d 22h 6 min/ out in the Kvant module when a lithium perchlorate “candle,” a 
Reinhold Ewald Tsibliev and secondary oxygen-generating system, malfunctioned. The crew 
(Mir ’97) Lazutkin extinguished the flames within 90 seconds, but had to wear 





1lb=0.45kg. 1mi=1.6km. 1ft=0.3m. 1yd=0.9m. 1 fathom=1.8m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





Dates launched 
and recovered 


Designation 
and crew 


A. Weight 


B. Revolutions 
C. Max. distance 


Remarks 





Feb. 10-Aug. 14, 


1997 (cont.) 


Febstd 21/1997 


April 4-8, 1997 


May 15-24, 1997 


July 1-17, 1997 


Aug. 5, 1997- 
Feb. 19, 1998 


STS 82 
Kenneth D. Bowersox 
Scott J. Horowitz 
Joseph R. Tanner 
Steven A. Hawley 
Gregory J. Harbaugh 
Mark C. Lee 
Steven L. Smith 


STS 83 
James D. Halsell 
Susan L. Still 
Janice Voss 
Michael L. Gernhardt 
Donald A. Thomas 
Roger K. Crouch 
Gregory T. Linteris 


STS 84 
Charles J. Precourt 
Eileen M. Collins 
C. Michael Foale (up) 
Jerry Linenger (down) 
Carlos |. Noriega 
Edward T. Lu 


Jean-Francois Clervoy 


Elena V. Kondakova 


STS 94 
James D. Halsell 
Susan L. Still 
Janice Voss 
Michael L. Gernhardt 
Donald A. Thomas 
Roger K. Crouch 
Gregory T. Linteris 


Soyuz TM-26 
Anatoly Solovyev 
Pavel Vinogradov 


breathing apparatus and subsequently filtering masks because of 
smoke. A few hours later, operations returned to normal. Ewald 
returned to Earth with Korzun and Kaler; in Soyuz TM-24 on Mar. 2. 
The two Elektron primary oxygen systems (based on electrolysis 
of waste water) having failed previously, only one of the secondary 
systems burning chemicals remained until arrival of the crewless 
resupply ship Progress M-34 on Apr. 8. On June 25, the automated 
Progress M-34 collided with Mir when Tsibliev attempted to link 
it to the Spektr module during a redocking exercise trying out 
the TORU manual guidance system. The collision damaged one 
of Spektr’s four solar arrays, a radiator and its hull, resulting in 
its depressurization—a major emergency. The crew was able to 
disconnect power and data cables and close the airtight hatch 
in time. During the subsequent weeks, on-board operations were 
severely curtailed because of the power shortage and were further 
beset by heart problems of the Mir commander (Tsibliev) and 
various subsystems malfunctions. Tsibliev and Lazutkin returned to 
Earth on Soyuz TM-25, one week after the arrival of the replacement 
crew (Soyuz TM-26, Mir-24). 


Discovery caught up with and retrieved the Hubble Space Tele- 
scope (HST) on Feb. 13 for its second scheduled servicing mission 
since its launch in April 1990. After installing the observatory with 
the manipulator arm on a turntable in the cargo bay, Lee, Smith, 
Harbaugh, and Tanner performed critical maintenance of the HST 
and significantly upgraded its scientific capabilities by exchang- 
ing eight degraded or failed components and replacing two major 
instruments, the Goddard High Resolution Spectrometer (GHRS) 
and the Faint Object Spectrograph (FOS) with the more advanced 
Space Telescope Imaging Spectrograph (STIS) and the Near In- 
frared Camera and Multi-Object Spectrometer (NICMOS), respec- 
tively. The HST was deployed and released in perfect condition on 
Feb. 19. 


Columbia carried the Spacelab module configured as the Micro- 
gravity Science Laboratory-1 (MSL-1) with 33 investigations from 
four space agencies in the areas of crystallography, combustion 
science, materials science, and life sciences. The mission was to 
test some of the hardware, facilities, and procedures to be used on 
the International Space Station (ISS). In its international complex- 
ity, the flight was intended to mirror the future work aboard the ISS, 
bringing together academic, industrial, and governmental partners 
from around the world. One of the important new facilities designed 
to enable research in zero-g was the EXPRESS Rack (EXpedite Pro- 
cessing of Experiments to the Space Station). However, on Apr. 6, 
a malfunction of one of the three power-generating fuel cells of the 
shuttle necessitated shutdown of the unit and early mission termi- 
nation. Columbia had to return on Apr. 8 without accomplishing its 
mission. 


Atlantis successfully conducted the sixth Mir docking mission of 
the /SS/Phase | program, and remained linked to Mir for 5 days, 
unloading 1025 Ib (465 kg) water, 845 Ib (883 kg) U.S. science 
equipment, 2576 Ib (1168 kg) Russian logistics including a much- 
needed Elektron oxygen generator unit and a vacuum cleaner, and 
393 Ib (178 kg) miscellaneous material. The mission also served 
to exchange U.S. astronaut Jerry Linenger, who had spent over 
132 days in Mir, against Michael Foale for another 4 months of 
continuing U.S. presence in space. 


Repeat of mission STS 83 of Apr. 4-8, 1997, with same crew 
and payload. Columbia carried the Microgravity Science Labo 
ratory (MSL-1), a continuation of NASAs_ micro-g research 
program in the fields of materials science, protein crystal growth, 
and physics, with 33 investigations developed by scientists 
from four space agencies. Main focus: combustion and other 
processes important to advanced industrial operations and engine 
development. 


Soyuz TM-26 carried a specially trained crew for repairing the 
damaged space station. The Mir 24 crew worked the next 61/> 
months to make repairs and reestablish proper function to all 
onboard systems, except for the Spektr module. When Solovyev 
and Vinogradov returned to Earth on Feb. 19 Mir was again in 
satisfactory condition for safe human occupancy. 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1 ft = 0.3 m. 


from Earth Duration 

213,312 Ib 9d 23h 36 min 
(landing) 

150 rev 

370 mi 

259,705 Ib 3 d 23 h 33 min 
(landing) 30s 

63 rev 

185 mi 

221,087 Ib 9d 5h 21 min 
(landing) 

145 rev 

247 mi 

259,705 Ib 15d 16h 45 min 
(landing) 

251 rev 

185 mi 

14,400 Ib 197d 15h 35 min/ 

~3093 rev Solovyev and 

244 mi Vinogradov 

1 yd = 0.9 m. 1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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Remarks 


Discovery lifted off with the Cryogenic Infrared Spectrometers 
and Telescopes for the Atmosphere-Shuttle Pallet Satellite-2 
(CRISTA-SPAS-2) on its second flight (and the fourth mission 
in a cooperative venture between the German Space Agency 
DARA and NASA). The 7724-lb (3500-kg) satellite was released 
in orbit to measure infrared radiation emitted by the atmosphere; 
on Aug. 16 it was retrieved with the shuttle’s mechanical arm. Other 
payloads included the Japanese Manipulator Flight Demonstration 
(MFD), that is, three separate tests of a robot arm being 
developed by Japan for its module JEM of the /nternational 
Space Station (ISS); the Technology Applications and Science-1 
(TAS-1) package with seven separate experiments for studies of 
atmospheric and topographic physics, solar energy, and testing 
of new thermal control devices; and the International Extreme 
Ultraviolet Hitchhiker-02 (IEH-02) with four experiments to research 
UV radiation from the stars, the Sun, and other sources in the solar 
system. 


Atlantis performed the seventh docking mission with Mir. On 
Oct. 2, Parazynski and Titov performed a 5 h 1 m EVA to retrieve 
four suitcase-sized Mir Environmental Effects Payload (MEEP) 
experiments from the exterior of the docking module where they 
had been attached by STS 76 crew members in March 1976. They 
also left a large (121 Ib or 55 kg) solar array cover cap at the outside 
for planned use in a future EVA as part of Russia’s attempts to repair 
the damaged Spektr module, and evaluated the small jet-backpack 
called SAFER (Simplified Aid for EVA Rescue). 


Columbia lifted off with the U.S. Microgravity Payload-4 (USMP-4), 
the Spartan 201-04 free-flyer, a Collaborative Ukrainian Experiment 
(CUE) for plant space biology research, and several other hitchhiker 
payloads. The launch, for the first time, incorporated a new flight 
maneuver by which the shuttle, at about 6 min into the flight, rolled 
to a “heads up” position, allowing early acquisition of the Tracking 
and Data Relay Satellite System (TDRSS) in geostationary orbit in 
lieu of the Bermuda tracking station and resulting in cost savings. 
On Nov. 21, the Spartan satellite was deployed with the RMS, 
Intended for investigations of physical conditions and processes of 
the solar corona and the hot outer layers of the Sun’s atmosphere, 
the spacecraft did not self-activate and began to tumble on a 
failed attempt by Chawla to regrapple it with the RMS. It was 
manually recaptured on Nov. 24 by Scott and Doi during a 7h 43m 
spacewalk. On Dec. 3, Scott and Doi conducted a second EVA of 
4h 59 m duration to further evaluate equipment and procedures for 
the /SS, including a telescoping (to 17.5 ft) crane for moving large 
Orbital Replacement Units (ORUs), and a prototype free-flying TV 
camera (AERCan/Sprint) for exterior inspections. 


Endeavour completed the eighth docking mission to Mir. The crew 
transferred 774 items of cargo including a new computer and air- 
conditioning equipment. Wolf replaced Thomas (who became the 
seventh U.S. astronaut, NASA 7, to stay on Mir for a long-duration 
mission). Wolf had spent 126 days aboard Mir, bringing the total 
U.S. crew time on Mir to 843 days. Other payloads on STS 89 
included the Spacehab double module with science experiments 
and new research equipment for NASA 7. 


Soyuz TM-27 launched with the new Mir 25 crew and a researcher 
(Eyharts) from France's space agency CNES. After docking to Mir 
on Jan. 31, the space station had five occupants until Feb. 19, when 
the Mir 24 crew (Solovyev and Vinogradov), along with Eyharts, 
departed in Soyuz TM-26. 


Columbia’s prime mission for the Spacelab module (on its 
sixteenth flight) was to conduct microgravity research in the 
human nervous system, including brain, spinal cord, nerves, 
and sensory organs. Acting both as subjects and operators, 
the crew performed 26 experiments organized in eight groups: 
Adult Neuronal Plasticity, Mammalian Development, Aquatic, 
Neurobiology, Autonomic Nervous System, Sensory Motor and 
Performance, Vestibular, and Sleep. Included were studies on a 
variety of species, such as rats, mice, toad fish, zebra fish, crickets, 
and snails. Secondary objectives included measurement of shuttle 
vibration forces, demonstration of the Bioreactor system used for 
cell growth, and three Getaway Special (GAS) payloads. 








TABLE 1. Crewed space flights 1961-2006 (cont.) 
A. Weight 
B. Revolutions 
Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration 

Aug. 7-19, 1997 STS 85 217,910 Ib 11 d 20h 27 min 
Curtis L. Brown (landing) 
Kent V. Rominger 189 rev 
N. Jan Davis 185 mi 
Robert L. Curbeam, Jr. 
Stephen K. Robinson 
Bjami V. Tryggvason 

Sept. 25-Oct. 6,1997 STS 86 251,730 Ib 10d 19h 21 min 
James D. Wetherbee (landing) 
Michael J. Bloomfield 169 rev 
Vladimir G. Titov 243 mi 
Scott E. Parazynski 
Jean-Loup Chrétien 
Wendy B. Lawrence 
David A. Wolf (up) 
C. Michael Foale (down) 

Nov. 19-Dec. 5, 1997 STS 87 226,449 Ib 15d 16h 34 min 
Kevin R. Kregel (landing) 4s 
Steven W. Lindsey 251 rev 
Kalpana Chawla 174 mi 
Winston E. Scott 
Takao Doi 
Leonid K. Kadenyuk 

Jan. 22-31, 1998 STS 89 251,612 Ib 8d 19h 47 min 
Terrence W. Wilcutt (landing) 
Joe F. Edwards, Jr. 138 rev 
James F. Reilly II 247 mi 
Michael P. Anderson 
Bonnie J. Dunbar 
Salizhan S. Sharipov 
Andrew S. Thomas (up) 
David A. Wolf (down) 

Jan. 29-Aug. 25, Soyuz TM-27 14,400 Ib 207 d 51 min/ 

1998 Talgat Musabayev ~3290 rev Musabayev 

Nikolai Budarin 239 mi and 
Leopold Eyharts Budarin 

Apr. 17-May 3, 1998 STS 90 232,500 Ib 15d 21h 50 min 
Richard A. Searfoss (landing) 
Scott D. Altman 255 rev 
Richard M. Linnehan 178 mi 
Kathryn P. Hire 
Dafydd Rhys Williams 
Jay Clark Buckey, Jr. 
James A. Pawelczyk 

1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 


B. Revolutions 





Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
June 2-12, 1998 STS 91 259,834 Ib 9d 19h 48 min The last of nine docking missions with Mir ended Phase 1 
Charles J. Precourt (landing) of the International Space Station (ISS) program as Discovery 


Dominic L. Pudwill Gorie 154 rev 


11d 19h 41 min/ 
Baturin; 198 d 
15 h 30 min/ 
Padalka 


8d 21h 45min 


11d 19h 18 min 


7d 21h 56 min/ 
Bella; 

187 d 22h 18 min/ 
Afanasyev and 
Haignere; 

379 d 14h 52 min/ 
Avdeev 


9d 19h 13 min 


linked up with the space station on June 4. Payloads flying on 
Discovery included the Alpha Magnetic Spectrometer (AMS) of the 
Department of Energy to study high-energy particles from deep 
space, several experiments of advanced technology, human life 
sciences, /SS risk mitigation, and microgravity physics, four Get- 
Away Specials (GAS) experiments, and two Space Experiment 
Module (SEM) canisters with experiments from U.S. middle school, 
high school, and college students. U.S. astronauts performed 
the majority of science research on Mir while their Russian 
hosts primarily concerned themselves with station upkeep and 
maintenance. This international cooperation provided a wealth of 
new knowledge about space flight, particularly with regard to space 
station design, operation, and maintenance, and was the first of its 
kind on this magnitude. 


The Mir 26 crew docked to Mir, using the manual backup system 
TORU due to prior failure of one of two automatic KURS systems 
on Mir. Baturin returned with the Mir 25 crew on Aug. 25. 


Discovery lifted off with a Japanese woman astronaut on her 
second flight (Mukai), and payload specialist Senator John Glenn, 
at 77 years the oldest human ever to fly into space and also 
the first American to fly into Earth orbit. To take scientific 
advantage of Glenn’s advanced age and his unbroken medical 
history records with NASA over the last 40 years, about 10 of 
the flight’s 83 experiments and payloads investigated questions 
pertaining to geriatrics in space, that is, the phenomena of natural 
aging vs. weightlessness-induced effects closely resembling them. 
The other experiments made up one of the most diverse sets 
of commercial and scientific research investigations addressing 
biotechnology, life sciences, solar physics, and astronomy. The 
payload included the Spartan 201-04 free-flyer on its reflight after 
its failed deployment on mission STS 87. It was deployed on Nov. 1 
and successfully retrieved 2 days later. The Discovery mission was 
the last U.S. flight prior to the new era of the International Space 
Station. 


Funded by the U.S. and built by Russia (Khrunichev State Research 
and Production Center, KhSC, in Moscow), the power and control 
module FGB (Funktionalnyi-grusovoi blok) named Zarya (Dawn) 
was launched on a Proton rocket from Russia’s cosmodrome in 
Kazakhstan as the first component of the International Space 
Station. 


Endeavour lifted off with the connecting module Node 1/Unity, 
the first U.S.-built station element for the /SS. Rendezvous and 
connection with /SS/Zarya occurred on Dec. 6. On Dec. 14, the 
crew deployed two satellites: an Argentinian satellite (SAC-A) on 
Dec. 13 and a U.S. Air Force test satellite (VightySat). Additional 
payloads were an IMAX camera for filming from the cargo bay, 
an educational Space Experiment Module (SEM-07), and other 
experiments. 


The new Mir 27 crew docked on Feb. 22, and began some 100 
experiments, with an incubator with 60 quail eggs hatching a few 
days after docking, to study effects of space mission on embryonic 
and postembryonic developments. Fledglings returned to Earth 
(only two survived) with Padalka and Bella in Soyuz TM-28 on 
Feb. 28. On July 18, the Progress M-42 cargo drone docked to 
deliver a backup computer intended to provide motion control 
redundancy during a forthcoming period of approx. 6 months 
of crewless (mothballed) operation of Mir, for the first time in 
10 years. 


The first visit in history to the International Space Station occurred 
when Discovery docked flawlessly to the Unity/Zarya complex 
on May 29. The crew began a spacewalk to transfer two cranes 
(one Russian, one U.S.-made) from the shuttle’s cargo bay to the 
station’s outside, installed two new portable foot restraints, and 
attached three bags with tools and handrails for future assembly 
operations. The shuttle’s hatch to the ISS mating adapter PMA 
2 was opened on May 29 for leak and pressurization tests. On 
May 30 the crew members began transferring supplies, equipment, 





Franklin R. Chang-Diaz 237 mi 
Wendy B. Lawrence 
Janet Lynn Kavandi 
Valery V. Ryumin 
Andrew Thomas (down) 
Aug. 13, 1998- Soyuz TM-28 14,400 Ib 
Feb. 28, 1999 Gennady Padaika ~3109 rev 
Sergei Avdeev 225 mi 
Yuri Baturin 
Oct. 29-Nov. 7, 1998 SiS95 227,783 lb 
Curt Brown, Jr. (landing) 
Steven W. Lindsey 134 rev 
Scott Parazynski 348 mi 
Stephen Robinson 
Pedro Duque 
Chiaki Mukai 
John Glenn 
Nov. 20, 1998 FGB/Zarya 44,088 Ib 
Crewless — 
240 mi 
December 4-15, 1998 STS 88 263,927 Ib 
Robert D. Cabana (orbit) 
Frederick W. Sturckow 185 rev 
Jerry L. Ross 243 mi 
Nancy J. Currie 
James H. Newman 
Sergei K. Krikalev 
Feb. 20-Aug. 27, 1999 Soyuz TM-29 14,400 Ib 
Viktor Afanasyev ~2950 rev 
Jean-Pierre Haignere 227 mi 
Ivan Bella 
May 27-June 6, 1999 STS 96 262,035 Ib 
Kent V. Rominger (orbit) 
Rick D. Husband 153 rev 
Tamara E. Jernigan 234 mi 
Ellen Ochoa 
Daniel T. Barry 
Julie Payette (Canada) 
Valery Tokarev (Russia) 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1yd = 0.9m. 


1 fathom = 1.8 m. 


1 Ib/in.? = 6.9 kPa. 








104 


Space flight 





TABLE 1. Crewed space flights 1961-2006 (cont.) 





A. Weight 
B. Revolutions 











Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
May 27-June 6, and water. They also replaced 18 recharge controller units for the six 
1999 (cont.) Zarya batteries, installed mufflers over noisy FGB fans, and replaced 
a power distribution unit and transceiver of the Node communication 
system. At work conclusion, the crew had transferred a total of 
3567 Ib (1618 kg) material to the /SS, including 686 Ib (311 kg) water. 
After releasing the student satellite Starshine on June 5, Discovery 
returned. 
July 23-27, 1999 STS 93 219,980 Ib 4d 22h 49 min Columbia launched, with the Chandra X-Ray Observatory (AXAF), 
Eileen M Collins (landing) Svis the third in NASA's series of Great Observatories. The observatory 
Jeffrey S. Ashby 177 rev was successfully deployed and subsequently boosted itself to a 
Catherine G. Coleman 79 mi final orbit of 86,992 by 6034 mi (139,188 by 9655 km), with a period 
Steven A. Hawley of 63 h 28 m. Secondary objectives included firing of Columbia’s 
Michel Tognini jet thrusters at various times during the flight to help an Air Force 
satellite gather data on the characteristics of jet plumes in space. 
Also, the Southwest Ultraviolet Imaging System (SWUIS) collected 
data on ultraviolet light originating from a variety of planetary 
bodies. 
Dec. 19-27, 1999 STS 103 211,129 Ib 7d 23h 11 min Discovery conducted the third maintenance/repair mission to the 
Curtis L. Brown (landing) Hubble Space Telescope (HST). The mission, originally planned for 
Scott J. Kelly 119 rev 2000, became necessary when the HST had to be put in “safe” 
Steven L. Smith 367 mi (dormant) mode with only two of its six gyroscopes functioning. 
Jean-Francois Clervoy The crew installed six new gyros and six voltage/temperature 
John M. Grunsfeld improvement kits; replaced the main computer, the Fine Guidance 
Michael Foale Sensor, and a radio transmitter; and installed a new digital recorder 
Claude Nicollier as well as new insulation on two equipment bay doors. 
Feb. 11-22, 2000 STS 99 225,669 Ib 11 d 5 h 38 min Shuttle Radar Topography Mission (SRTM), with primary objective 
Kevin R. Kregel (landing) to acquire a high-resolution topographic map of Earth’s land 
Dominic L. P. Gorie 181 rev mass between 60°N and 56’S, and to test new technologies for 
Gerhard P. J. Thiele 146 mi deployment of large rigid structures and measurement of their 
Janet L. Kavandi distortions to extremely high precision. The 87 cube-shaped “bays” 
Janice Voss of the carbon fiber-reinforced plastic, stainless steel, alpha titanium, 
Mamoru Mohri and Invar radar mast carried active radar antennas for C-band and 
X-band wavelengths. SRTM will produce topographic maps of Earth 
30 times as precise as the best global maps to date. 
Apr. 4—June 16, Soyuz TM-30 14,400 Ib 72 d 19 h 43 min First crew return to Mir after the 14-year-old station had floated 
2000 Sergei Zalyotin 1139 rev for 223 days without occupants in “mothballed” condition. Prior to 
Alexander 210 mi their flight, the crewless Progress M1-1 (first of an upgraded version 
Kaleri with double tankage volume) docked automatically with the station 
on Feb. 3. A second Progress flight to Mir occurred on Apr. 27. 
The crew had the task of returning the station to safe operating 
condition. After 2 weeks of tests, they succeeded in sealing an 
air leak in a hatch between the core module’s transfer compartment 
and the airless Spektr module. On May 12, an EVA was conducted 
to test new techniques of leak repair. The Mir 28 crew returned to 
Earth in TM-30, leaving the renovated station sealed up and again 
on automatic control. 
May 19-29, 2000 STS 101 225,504 Ib 9d20h9min9s_ Theprimary objectives of the second crewed visit to the International 
James D. Halsell (landing) Space Station (ISS) were to remove and replace FGB/Zarya 
Scott J. Horowitz 239 rev hardware that had failed, such as battery systems, and to reboost 
Mary Ellen Weber 155 mi the /SS to an altitude consistent with the projected Service 
Jeffrey N. Williams Module/Zvezda launch and docking. Atlantis docked with /SS on 
James S. Voss May 21, and Williams and Voss reinstalled a loose U.S. crane, 
Susan J. Helms assembled and installed a Russian “Strela” cargo crane, replaced 
Yuri V. Usachev a communications antenna outside the Node Unity, and installed 
eight handrails on the outside wall of Unity. Repair and replacement 
activities extended over the following 4 days. In three separate 
maneuvers, /SS altitude was increased 28.7 mi (45.9 km) to 237.7 x 
229.4 mi (380.3 X 367.0 km). The crew transferred over 3300 Ib 
(1500 kg) of gear from Atlantis to ISS and returned other cargo to 
the shuttle. 
July 12, 2000 Service Module (SM) 42,000 Ib = The Service Module (SM), named Zvezda (Star), the first Russian- 
Zvezda — funded and -owned element of the ISS, was launched on a Proton 
Crewless E rocket from Baikonur Cosmodrome in Kazakhstan. The docking of 
Zvezda with the /SS on July 25, executed very gently at 8 in./s 
(0.2 m/s) by the FGB/Zarya as active partner, marked the arrival of 
the first human living quarters for the new outpost. The new /SS 
stack spanned 119 ft (36 m) in length, and its mass had grown to 
almost 60 tons. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Sept. 8-20, 2000 STS 106 221,271 Ib 11d19h11min Atlantis flew the third logistics/outfitting flight to the space station. 
Terrence W. Wilcutt (landing) 2s Lu and Malenchenko conducted a space walk to prepare the ISS 
Scott D. Altman 185 rev exterior for future operations. During a week of docked operations 
Daniel C. Burbank 200 mi the crew worked as movers, cleaners, plumbers, electricians, and 
Edward T. Lu cable installers, stowing more than 6600 Ib (3 metric tons) of 
Richard A. Mastracchio supplies and filling the disposable Progress M1-3 ship (which had 
Yuri |. Malenchenko docked at Zvezda’s aft port on August 8) with trash and discarded 
Boris V. Morukov materials. 
Oct. 11-22, 2000 STS 92 204,455 Ib 12d21h43min Discovery brought the first truss structure segment (called Z1, 
Brian Duffy (landing) for its zenith port location on Node Unity), and the third docking 
Pamela A. Melroy 202 rev adapter (PMA-3, pressurized mating adapter) to the space station. 
Leroy Chiao 200 mi The Z1 truss was installed on the Node, with necessary electrical 
William S. McArthur connections being hooked up and other deployment activities being 
Peter J. K. Wisoff accomplished during a space walk on Oct. 15. PMA-3 was 
Michael E. Lopez-Alegria installed on the Node’s nadir port during a second space walk. 
Koichi Wakata Two additional space walks completed the assembly tasks for this 
mission. 
Oct. 31, 2000-Mar. Soyuz TM-31, 2R/1S 14,400 Ib 140d 23h38 min The first resident crew (Expedition One) was launched to the 
20, 2001 William Shepherd — International Space Station. After liftoff in Baikonur, the Soyuz-U 
Yuri Gidzenko 236 mi carrier injected the Soyuz TM-31 (ISS mission 2R) into orbit. Docking 
Sergei Krikalev occurred on Nov. 2. The second crewless tanker ship Progress M1-4 
(2P) arrived in November—followed by STS 97 in December withthe 
P6 photovoltaic power module; STS 98 (5A) with the U.S. Laboratory 
Destiny in February 2001; a third crewless logistics flight, Progress 
M-244 (8P), in the same month; and the Lab outfitting mission 
5A.1/STS 102 in March. 
Nov. 30—Dec. 11, STS 97 197,879 Ib 10d 19h 58 min Endeavour flew Mission 4A to the /SS, the first shuttle visit to 
2000 Brent W. Jett (landing) the station with an /SS crew awaiting it. Garneau used the remote 
Michael J. Bloomfield 170 rev manipulator system (RMS) to “park” the 17-ton ITS (integrated 
Joseph R. Tanner 234 mi truss segment) P6 above the cargo bay to allow for its temperature 
Marc Garneau adjustment. Then Garneau attached P6, attached to the Z1 ITS, on 
Carlos |. Noriega top of the Node, assisted by Noriega and Tanner on an EVA. Jett 
deployed the two solar array wings, jointly spanning 240 ft (73 m). 
Deployment of the second wing was delayed one day when the 
tensioning of the two array blankets of the first wing malfunctioned. 
Two additional EVAs were conducted, the latter to correct the 
starboard wing’s tension and to install an electrical potential sensor. 
The P6 provided the /SS with up to 62 kW of power. 
Feb. 7-20, 2001 STS 98 254,694 Ib 12d 21h 20min Atlantis/5A carried U.S. Laboratory module Destiny to the /nterna- 


Mar. 8-21, 2001 


Kenneth D. Cockrell 
Mark L. Polansky 
Robert L. Curbeam 
Marsha S. Ivins 
Thomas D. Jones 


STS 102 


James D. Wetherbee 
James M. Kelly 
Andrew S. W. Thomas 
Paul W. Richards 
James S. Voss (up) 
Susan J. Helms (up) 
Yury V. Usachev (up) 


William Shepherd (down) 


Sergei Krikalev (down) 


Yuri P. Gidzenko (down) 


(orbiter at liftoff) 
201 rev 
239 mi 


198,507 Ib 
(orbiter at liftoff) 

201 rev 

242 mi 


12 d 19 h 49 min 
166d6h 

41 min/Usachev, 
Voss, Helms 


tional Space Station. The 28 x 14 ft (8.5 x 4.3 m) Lab was installed 
at the Node by Ivins using the (RMS) robot arm, assisted by space 
walkers Curbeam and Jones. Two more EVAs followed as the /SS 
crew completed setup and activation of critical and noncritical Lab 
systems. 


Discovery/5A.1, the eighth shuttle mission to the /SS, carried 
resupply and systems/science racks for outfitting the U.S. Lab 
Destiny, all packed in Leonardo, the first of three Italian-built 
multipurpose logistics modules (MPLMs) used as “moving vans” 
During the docked period, the MPLM was transferred to the 
ISS Node's nadir port, unloaded, filled with trash and discarded 
equipment, and returned to the shuttle. In two space walks, crew 
members worked outside to support /SS assembly. In the first crew 
rotation, Usachev, Voss, and Helms became the new station crew, 
replacing Shepherd, Krikalev, and Gidzenko who, after 141 days in 
space, returned to Earth. 





Apr. 19-May 1, STS 100 228,188 Ib (orbiter 11d21h30min Endeavour on Mission ISS-6A carried the 3960-Ib (1800-kg) 
2001 Kent V. Rominger at liftoff) Canada-built Space Station Remote Manipulator System (SSRMS), 
Jeffrey S. Ashby 186 rev the second Italian-built MPLM Raffaello (total weight loaded: 
Chris A. Hadfield 253 mi 9000 Ib or 4100 kg), and three non-U.S. crew members. The mission 
John L. Phillips featured two EVAs by Parazynski and Hadfield, one to support 
Scott E. Parazynski transfer and deployment of the SSRMS Canadarm2 from the 
Umberto Guidoni shuttle payload bay to the /SS Lab module and install a UHF 
Yuri V. Lonchakov antenna, the second for external connections and removal of an ECS 
(Early Communications System) antenna from the Node. After the 
MPLM was berthed at the Node, the crew transferred two EXPRESS 
payload racks and individual components from it to the /SS and 

loaded it with return cargo. Endeavour undocked on April 29. 

1 Ib = 0.45 kg. 1 mi = 1.6 km. 1ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 
A. Weight 
B. Revolutions 
Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 28-May 6, Soyuz TM-32, 2S 14,400 Ib 7 d 22h 4 min The first “taxi” flight to the /SS brought a fresh Soyuz returnvehicle 
2001 Talgat A. Musabaev 123 rev for the /SS crew. The flight also carried the first commercial space 
Yuri M. Baturin 252 mi tourist, U.S. businessman Dennis Tito. The Soyuz capsule docked 
Dennis Tito at the FGB nadir port on Apr. 30. On May 5, the three visitors climbed 
into the old Soyuz TM-31, docked at the Service Module aft end, 
and returned to Earth, landing in Kazakhstan. 
July 12-24, 2001 STS 104 258,222 Ib (orbiter 12 d 18 h 35 min Atlantis, on mission ISS-7A, carried the 6.5-ton Joint Airlock 
Steven W. Lindsey at liftoff) Quest, which was installed at the starboard berthing port of the 
Charles O. Hobaugh 201 rev Node, completing Phase 2 of the /SS program. Also installed were 
Michael L. Gernhardt 250 mi four high-pressure tanks with oxygen and nitrogen. These activities 
Janet L. Kavandi were supported by three space walks by Gernhardt and Reilly, the 
James F. Reilly last one from the new airlock. 
Aug. 10-22, 2001 STS 105 257,748 Ib 11d21h13min; Discovery delivered the Expedition 3 crew (Culbertson, Dezhurov, 
Scott J. Horowitz (orbiter at liftoff) 128 d 20 h 45 Tyurin) to the International Space Station and picked up the 
Frederick W. Sturckow 185 rev min/Culbertson, second station crew (Usachev, Voss, Helms) on the second crew 
Patrick G. Forrester 250 mi Dezhurov, Tyurin rotation mission. As ISS assembly mission 7A.1, Discovery also 
Daniel T. Barry brought equipment and supplies, carried in the MPLM (multipurpose 
Frank L. Culbertson (up) logistics module) Leonardo on its second flight, which was docked 
Vladimir N. Dezhurov (up) temporarily at the Node for unloading and reloading with return 
Mikhail Tyurin (up) cargo. Barry and Forrester conducted two space walks to install 
Yury V. Usachev (down) an ammonia servicer, materials experiment carriers, handrails, and 
James S. Voss (down) cables. 
Susan J. Helms (down) 
Sept. 14, 2001 4R (DC-1) 14,500 Ib — The Russian Docking Compartment (DC-1) Pirs was launched on a 
Crewless = Soyuz-U rocket from Baikonur/Kazakhstan at 7:35 p.m. EDT. The 
250 mi Stikovochnly Otsek No. 1 (SO1) linked up with the /SS on Sept. 
16. The new 16-ft-long (4.88-m) module, with a mass of 8000 Ib 
(3.6 metric tons), is an additional docking port for future Soyuz and 
Progress vehicles arriving at the station, plus an added stowage area 
and an airlock for EVAs in Russian Ortan-M spacesuits. DC-1 was 
launched attached to the cargo ship Progress M-SO1, a Soyuz- 
derived instrumentation and propulsion section of about 3000 Ib 
(1.36 metric tons), which was jettisoned on Sept. 26. /SS mass now 
totaled 303,500 Ib (137.67 metric tons). 
Oct. 21-30, 2001 Soyuz TM-33, 3S 14,400 Ib 9d 19 h 0 min Second /SS “taxi” flight, bringing a fresh Soyuz crew return vehicle 
Viktor Afanasyev ~153 rev (CRV) to the station, good for another 200 days. Haigneré was fly- 
Konstantin Kozeyev 257 mi ing commercially. Crew assignment included the French Andromedé 
Claudie Haigneré (Andromeda) science mission. Afanasyev, Kozeyev, and Haigneré 
(France) departed in the old Soyuz TM-32 and landed in Kazakhstan under 
automatic control. 
Dec. 5-17, 2001 STS 108 ~257,000 Ib Td 19h On first utilization fiight (UF-1) of the /SS program, Endeavour carried 
Dominic L. Gorie (orbiter at liftoff) 36 min; 196 d MPLM Raffaello with approximately 3 tons (2.7 metric tons) of cargo 
Mark E. Kelly 186 rev 20 h 39 min/ plus the fourth expeditionary crew (Onufrienko, Bursch, and Walz) to 
Linda M. Godwin 334 mi Onufrienko, rotate with the Expedition 3 crew (Culbertson, Dezhurov, and Tyurin). 
Daniel M. Tani Walz, Bursch The MPLM was transferred to the Node nadir port with the shuttle 
Yuri Onufrienko (up) RMS. Tani and Godwin conducted an EVA on Dec. 10. After cargo 
Carl E. Walz (up) unloading and its replacement with discarded equipment and waste, 
Daniel W. Bursch (up) Raffaello was returned to the shuttle cargo bay. Before its undocking, 
Frank Culbertson (down) Endeavour conducted three reboosts of the station, with a total mean 
Vladimir Dezhurov (down) altitude increase of 7.6 n.mi. (14 km). 
Mikhail Tyurin (down) 
Mar. 1-12, 2002 STS 109 257,917 Ib 10d 22h Fourth “service call” to the Hubble Space Telescope (HST) was per- 
Scott D. Altman (orbiter at liftoff) 10 min formed by Columbia after undergoing more than 100 improvements 
Duane G. Carey 165 rev since its last fiight (STS 93). During flve space walks, HST received 
John M. Grunsfeld 359 mi new, more durable solar arrays (SA) with their diode boxes (DBAs), 
Nancy J. Currie a large gyroscopic Reaction Wheel Assembly (RWA) for pointing, a 
Richard M. Linnehan new power control unit (PCU), a new advanced camera for surveys 
James H. Newman (ACS), and an experimental cryocooler to restore the dormant Near 
Michael J. Massimino Infrared Camera and Multi-Object Spectrometer (NICMOS) to oper- 
ation. After reboost on Mar. 8 by about 4 mi (6.4 km), HST was 
deployed from Columbia on Mar. 9. 
Apr. 8-19, 2002 STS 110 257,079 lb 10d19h Atlantis flew mission 8A to the /SS, carrying the SO truss and the 
Michael J. Bloomfleld (orbiter at liftoff) 43 min Mobile Transporter (MT). Four EVAs were conducted to install the 
Stephen N. Frick 171 rev SO and the MT. On Apr. 14 the MT rail cart traversed a distance of 
Rex J. Walheim 250 mi 72 ft (22 m) during this first operation of a “railroad in space.” This 
Ellen Ochoa was also the first flight during which the SSRMS robot arm of the 
Lee M.E. Morin ISS was used to maneuver spacewalkers around the station, and 
Jerry L. Ross the flrst where all EVAs of a shuttle crew were performed from the 
Steven L. Smith station's own airlock, Quest. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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Dates launched Designation C. Max. distance 
and recovered and crew from Earth Duration Remarks 
Apr. 25—-May 4, Soyuz TM-34 14,400 Ib 9d21h25 min Third “taxi” fiight to deliver a fresh CRV to /SS. Shuttleworth was 
2002 Yuri Gidzenko ~154 rev the second “space tourist.” A science program, sponsored by Italy 
Roberto Vittori (Italy) 256 mi and South Africa, included 11 experiments. Gidzenko, Vittori, and 
Mark Shuttleworth Shuttleworth returned in Soyuz TM-33, which had been docked to 
(South Africa) the /SS since October 2001. 
June 5-19, 2002 SIS 11 256,889 Ib 13d21h Endeavour, on ISS Mssion UF-2, carried the flfth /SS resident crew 
Kenneth D. Cockrell (orbiter at liftoff) 35 min; 184d (Korzun, Whitsun, and Treschev), returning with the fourth station 
Paul S. Lockhart 217 rev 22h 14min crew (Onufrienko, Bursch, and Walz). Its cargo comprised MPLM 
Franklin Chang-Diaz 240 mi 23 s/Korzun, Leonardo on its third fiight, loaded with 5600 Ib (2500 kg) of equip- 
Philippe Perrin (France) Whitson, ment and consumables; the Mobile Base System (MBS); a replace- 
Valery Korzun (up) Treschev ment wrist roll (WR) joint for the SSRMS; meteoroid/orbital debris 
Peggy Whitson (up) protection shields (ODPs) for the Zvezda Service Module; and new 
Sergei Treschev (up) science payloads. During three spacewalks, astronauts installed the 
Yuri Onufrienko (down) MBS on the MT (mobile transporter), temporarily stowed the ODPs, 
Carl E. Walz (down) and replaced the SSRMS WR joint. 
Daniel W. Bursch (down) 
Oct. 7-18, 2002 SIS 112 256,917 Ib 10d19h Atlantis flew Mission 9A to the ISS. On Oct. 10, its main payload, the 
Jeffrey S. Ashby (orbiter at liftoff) 58 min 14.5-ton (13.1-metric-ton) S1 truss element, was transferred with the 
Pamela A. Melroy 172 rev Canadarm2 from the shuttle cargo bay to the /SS and flrmly attached 
David A. Wolf 252 mi to the SO center truss segment atop the Lab module. The installa- 
Sandra H. Magnus tion for the first time featured the SSRMS operating from its new 
Piers J. Sellers MBS. The crew also transferred about 1000 Ib (450 kg) of supplies, 
‘chi equipment and other cargo to the station. Three spacewalks were 
ee performed to connect the S1 and SO trusses with electrical, flber- 
optical, data, and coolant umbilicals, and to install external systems. 
Oct. 29-Nov. 9, Soyuz TMA-1, 5S 15,920 Ib 10d20h Fourth taxi flight to the /SS, to replace the previous CRV, Soyuz 
2002 Sergei Zalyotin ~170 rev 53 min TM-34. TMA-1 was the first of a new version of the Soyuz, with 
Yuri Lonchakov 260 mi modifications financed by NASA for improved safety and widened 
Frank De Winne crewmember size range. A science program sponsored by Belgium 
(ESA, Belgium) and the Russian space agency Rosaviakosmos (RSA) included 23 
experiments. The crew returned on the “old” Soyuz TM-34. 
Nov. 23-Dec. 7, STS 113 256,747 Ib 14d 6h 47 min; Mission [SS-11A on Endeavour carried the Expedition 6 crew (Bow- 
2002 Jim Wetherbee (orbiter at liftoff) 161d1h ersox, Budarin, and Pettit), the fourth truss element, P1 (of 11 total), 
Paul Lockhart 215 rev 14 min 38 s/ and other equipment and resupply. The 14.5-ton (13.1-metric-ton) 
Michael Lopez-Alegria 247 mi Bowersox, P1 was transferred and installed on the S0 truss portside, and three 
John Herrington Budarin, Petit spacewalks were conducted to connect the P1 and install other gear. 
Kenneth Bowersox (up) After Endeavour undocked on Dec. 2, it launched two tethered mini- 
Nikolai Budarin (up) satellites called MEPSI (microelectromechanical systems-based Pi- 
Donald Pettit (up) cosat Inspector), and returned with the Expedition 5 crew (Korzun, 
Valery Korzun (down) Whitson, and Treschev). 
Peggy Whitson (down) 
Sergei Treschev (down) 
Jan. 16-Feb. 1, STS 107 263,706 Ib 15d 22h 21 min Columbia flew on a research mission with 32 payloads and 59 sepa- 
2003 Rick D. Husband (orbiter at liftoff) rate investigations. Both in the shuttle middeck and in the Spacehab 
William C. McCool 255 rev Research Double Module (RDM), on its first flight in the cargo bay, 
David M. Brown 177 mi the crew worked on a mixed complement of competitively selected 
Kalpana Chawla and commercially sponsored research in the space, life, and physi- 
Michael P. Anderson cal sciences. Columbia was lost with its crew during reentry on Feb. 
Laurel B. Clark 1, 16 min before planned landing at Kennedy Space Center, when 
llan Ramon (Israel) it violently disintegrated over Texas. Debris from the orbiter fell in 
Texas and other states. The Columbia Accident Investigation Board 
(CAIB) later concluded that one of the left wing's leading edge RCC 
(reinforced carbon-carbon) elements or an associated filler strip of 
the heat shield had been compromised during ascent to orbit by a 
piece of debris, possibly a suitcase-sized chunk of foam insulation 
blown off the external tank by the supersonic air stream, hitting the 
leading edge and rendering the wing unable to withstand reentry 
heating longer than about 8 min after entry interface. 
Apr. 25-Oct. 27, Soyuz TMA-2, 6S 15,920 Ib 184d 22h The first crew rotation flight by a Soyuz carried Expedition 7, a 
2003 Yuri Malenchenko ~2889 rev 47 min two-man “caretaker” crew, to stretch out /SS consumables during 
Ed Lu 256 mi the shuttle stand-down. Soyuz TMA-2 docked to the /SS on April 
28, replacing the previous CRV, Soyuz TMA-1. The Expedition 6 
crewmembers (Bowersox, Budarin, and Pettit) returned to Earth in 
Soyuz TMA-1, experiencing an unexpected switch of their onboard 
computer to the backup reentry mode of pure ballistic descent, 
which almost doubled their peak deceleration (~9 g), and missed 
the primary landing site by an “undershoot” of ~300 mi (480 km), 
landing on May 3. Recovery forces reached the landing site after a 
delay of 4.5 h, finding the crew in good health outside the capsule. 
1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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Oct. 15-16, Shenzhou 5 17,000 Ib 21 h 23 min Orbital launch of the first Chinese “taikonaut” on the new human- 

2003 (“Divine Vessel 5”) 14 rev rated Long March (Chang Zheng, CZ) 2F rocket. Landing was about 
Yang Liwei 213 mi 600 mi (1000 km) east of the launch area, the Jinquan Satellite 
Launch Center in the Gobi Desert. 

Oct. 18, Soyuz TMA-3, 7S 15,920 Ib 9 d, 21 h 3 min/ Second /SS crew rotation flight by a Soyuz carried Expedition 
2003-Apr. 29, Michael Foale ~3053 rev Duque; 194d 8 (Foale and Kaleri), the second “caretaker” crew, and visiting 
2004 Alexander Kaleri 249 mi 18 h 35 min/ crewmember Duque. On Oct. 27, the previous CRV, Soyuz TMA-2, 

Pedro Duque Foale and undocked from the of the FGB/Zarya module nadir port, where it had 
(ESA, Spain) Kaleri stayed for 182 days, and landed in Kazakhstan with Malenchenko, 
Lu, and Duque. 
Apr. 18-Oct. 24, Soyuz TMA-4, 8S 15,920 Ib 10d20h52min/ Third crew rotation flight by a Soyuz carried Expedition 9 crew 
2004 Gennady Padalka ~2951 rev Kuipers; 187d (Padalka and Fincke) plus visiting cosmonaut/researcher Kuipers. 
Michael Fincke 241 mi 21h17 min/ TMA-4 docked to the /SS on Apr. 21, replacing the previous CRV, 
André Kuipers (ESA, Padalka and Soyuz TMA-3, which, when it undocked on Apr. 29, had reached an 
Netherlands) Fincke in-space time of 194 days. In it, Expedition 8 crewmembers Foale 
and Kaleri plus Kuipers landed in Kazakhstan. 

Oct. 13, Soyuz TMA-5, 9S 15,920 Ib 9 d 21 h 30 min/ Fourth /SS crew rotation flight by a Soyuz carried Expedition 10 
2004-Apr. 24, Leroy Chiao ~3031 rev Shargin; 192d (Chiao and Sharipov), the fourth “caretaker” crew, plus visiting cos- 
2005 Salizhan Sharipov 241 mi 19 h 35 min/ monaut/researcher Shargin. On Oct. 16, TMA-3 docked to the /SS 

Yuri Shargin (Russia) Chiao and at the DC-1 docking compartment's nadir (downward)-pointing port. 
Sharpov On Oct. 23, the previous CRV, Soyuz TMA-4, undocked from the FGB 
nadir port, where it had stayed for 186 days (187 days in space), and 

landed in Kazakhstan with Padalka, Fincke, and Shargin. 

Apr. 14—Oct. 10, Soyuz TMA-6, 10S 15,920 Ib 9d21h22min/ _ Fifth crew rotation flight by a Soyuz carried Expedition 11 crew 
2005 Sergei Krikalev ~2818 rev Vittori; 179 d (Krikalev and Phillips) plus visiting crewmember Vittori. TMA-6 

John Phillips 235 mi 23 min/ docked to the /SS on Apr. 16, replacing the previous CRV, Soyuz 

Roberto Vittori (ESA, Krikalev and TMA-5, which, when it undocked on Apr. 24, had reached an in- 

Italy) Phillips space time of 192 days. In it, Chiao, Sharipov, and Vittori landed in 
Kazakhstan. 

July 26-Aug. 9, STS 114 267,825 Ib 13d21h Return-to-flight mission of Discovery. Prior to final approach to the 

2005 Eileen M. Collins (orbiter at liftoff) 32 min 22s ISS, Discovery performed an R-bar or rendezvous pitch maneuver 
James M. Kelly 219 rev (RPM) at ~600 ft (180 m) distance under the /SS, a 360° backflip 
Andrew S. W. Thomas 218 mi to allow digital imagery of its thermal protection system from the 
Charles J. Camarda ISS. About 15,000 Ib (6800 kg) of cargo was transferred from the 
Wendy B. Lawrence shuttle's flight container, the MPLM Raffaelo, to the /SS and approx- 
Stephen K. Robinson imately 8600 Ib (3900 kg) from the station to the shuttle for return 
Soichi Noguchi (Japan) to Earth. Crewmembers performed three EVAs, tested new safety 
equipment and repair procedures, and carried out a first-of-its kind 
spacewalking heat shield repair. 

Sept. 30, Soyuz TMA-7, 11S 15,920 Ib 9d 21h 14 min/ Sixth /SS crew rotation flight by a Soyuz carried Expedition 12 crew 
2005-Apr. 8, William McArthur ~2992 rev Olsen; 189 d (McArthur, Tokarev) and the third “space tourist” (Olsen). TMA-7 
2006 Valery Tokarev 229 mi 19 h 53 min/ docked to the /SS on Oct. 3. On Oct. 10, Soyuz TMA-6 undocked 

Gregory Olsen McArthur, from the FGB nadir port, where it had stayed for 177 days, and 
(United States) Tokarev landed in Kazakhstan with Krikalev, Phillips, and Olsen. 

Oct. 12-17, Shenzhou 6 17,000 Ib 115 h 32 min Second orbital flight of a crewed spacecraft on China's Long March 

2005 Fei Junlong 75 rev 2F rocket, this time carrying two “taikonauts.” Both crewmembers 
Nie Haisheng 209 mi left their seats, floated in space, and at one time took off their space 
suits. 

Mar. 29-Sept. Soyuz TMA-8, 12S 15,920 mi 9d 22h 18 min./ Seventh /SS crew rotation flight by a Soyuz carried Expedition 13 
28, 2006 Pavel V. Vinogradov ~2885 rev Pontes; 182d crewmembers Vinogradov and Williams, and visiting crewmember 

Jeffrey Williams 229 mi 22 h 43 min/ Pontes. On Apr. 8 the previous CRV, Soyuz TMA-7, undocked from 
Marcos Cezar Vinogradov the /SS and returned to Kazhakstan, carrying McArthur, Tokarev, and 
Pontes (Brazil) and Williams Pontes. 

July 4-17, 2006 STS 121 266,962 Ib 12d18h Discovery, on Mission ULF1.1 to the /SS, flew the first of two 
Steven Lindsey (orbiter at liftoff) 37 min; 171d test flights after return to flight. The joint crews transferred the 
Mark Kelly 202 rev 3h 54 min/ fully loaded MPLM Leonardo, conducted three EVAs, reberthed the 
Stephanie Wilson 220 mi Reiter loaded MPLM in the shuttle cargo bay, performed external inspec- 
Michael Fossum tions of the orbiter heat shielding, and completed cargo transfers. 
Piers Sellers Reiter remained on the /SS when Discovery returned to Earth, joining 
Lisa Nowak Expedition 13. 
Thomas Reiter 
(ESA, Germany) [up] 

1 Ib = 0.45 kg. 1 mi = 1.6 km. 1 ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 1 Ib/in.2 = 6.9 kPa. 
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TABLE 1. Crewed space flights 1961-2006 (cont.) 


Dates launched 
and recovered 


A. Weight 
B. Revolutions 


Remarks 





Sept. 9-21, 2006 


Sept. 18, 2006 — 


Dec. 9-22, 2006 


Designation C. Max. distance 
and crew from Earth Duration 

STS 115 269, 840 Ib 11d19h6 min 
Brent Jett (orbiter at liftoff) 

Chris Ferguson 187 rev 
Dan Burbank 217 mi 
Heidemarie 
Stefanyshyn-Piper 
Joe Tanner 
Steve MacLean (Canada) 

Soyuz TMA-9, 13S 15,920 Ib 10d21h4min 
Michael Lopez-Allegria — 17 s/Ansari 
Mikhail Tyurin 216 mi 
Anousheh Ansari 

STS 116 265,466 Ib 12d 20h 
Mark Polansky (orbiter at liftoff) 45 min 
William (Bill) Oefelein 202 rev 
Nicholas Patrick 222 mi 


Robert (Bob) Curbeam 
Joan Higginbotham 
Christer Fuglesang 
(ESA, Sweden) 

Sunita Williams (up) 
Thomas Reiter (down) 


Atlantis, on Mission 12A to the /SS, flew Shuttle Flight Test 2. Prior 
to final approach, Atlantis performed an RPM, similar to that of STS 
114. Preliminary views of the thermal protection system did not indi- 
cate any signs of damage. Crews installed the P3/P4 truss structure 
with the second set of solar arrays (of four), deployed the two solar 
array wings and a huge radiator panel from the P4 truss, conducted 
three EVAs, and resupplied the station. 


This flight carried new Expedition 14 crewmembers Lopez-Alegria 
and Tyurin, replacing Expedition 13 crewmembers Vinogradov and 
Williams, plus Iranian-American “taxi cosmonaut” Ansari, the first 
female spaceflight participant to visit the space station, flying under 
contract with the Russian Federal Space Agency. Expedition 13 
crewmember Reiter remained on the station until Dec. 20, joining 
Expedition 14. On Sept. 28, Soyuz TMA-8 undocked from the /SS 
FGB nadir port and landed in Kazakhstan with Vinogradov, Williams, 
and Ansari. Return and landing of Soyuz TMA-9 was scheduled for 
Mar. 19, 2007, which would give Lopez-Alegria and Tyurin 182 d in 
space. 


Discovery, on Mission 12A.1 to the /SS, delivered and installed 
the P5 truss segment (serving as spacer at the end of the P4 
truss), reconfigured the electrical power and thermal control sys- 
tems to their final configurations and activated them, and transferred 
extra oxygen for EVA prebreathing and nitrogen to storage tanks 
on the outside of the airlock. Astronauts conducted four EVAs, in- 
cluding one that was not originally planned to support continuing 
attempts by ground controllers to retract the 4B solar array of the 
P6 truss. Williams replaced Reiter as a member of Expedition 14 on 
the /SS. 





1 Ib = 0.45 kg. 


1 mi = 1.6 km. 


1ft = 0.3 m. 1 yd = 0.9 m. 1 fathom = 1.8 m. 


1 Ib/in.2 = 6.9 kPa. 
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reduce g loads and heat pulse, especially for higher- 
energy missions such as return from high altitudes 
and lunar or planetary trips, advanced reentry vehi- 
cles fly a lifting reentry. Some lift can even be ob- 
tained from a basically ballistic capsule. A basic fea- 
ture of the Soviet Soyuz and the American Gemini 
and Apollo spacecraft, as well as the United States 
space shuttle and future crewed vehicles, is that the 
return trajectory is extended by applying to the vehi- 
cle an aerodynamic crosswind or lifting force L that 
is created by offsetting the center of gravity from 
the center of aerodynamic pressure of the spacecraft 
docated at the centerline for symmetrical bodies) in 
order to let the airflow hit the vehicle asymmetrically 
at an angle of attack. The direction and magnitude of 
this force is controlled by signals from an on-board 
guidance computer. Although often only a small frac- 
tion of the drag force D (L/D for Apollo-type bodies 
is typically 1/; to 1/5), this lift is sufficient to mod- 
ulate the trajectory for landing point control and 
reduction of heating (not total, or integrated, heat 
load) and deceleration load peaks (Fig. 1). 
See AEROTHERMODYNAMICS; ATMOSPHERIC ENTRY; 
SPACE SHUTTLE. 

Subsystems. In addition to the thermal protection 
system and reentry and landing systems, basic fea- 
tures of crewed spacecraft include the following 
subsystems: (1) crew equipment (couches and re- 
straints, operational aids, clothing, food and water, 
hygiene equipment, medical supplies, and survival 
equipment); (2) displays and controls; (3) electri- 
cal power system; (4) environmental control and life 
support system; (5) reaction control system (for at- 
titude maneuvers); (6) on-board propulsion system 
(for velocity-change maneuvers); (7) telecommuni- 
cations system (on-board voice and data/telemetry 
radio-frequency equipment and antennas); (8) guid- 
ance and navigation system, which is interrelated 
with (9) the stabilization and control system (iner- 
tial sensors, optical instruments, and computer); 
(10) mechanical systems (escape, docking, land- 





Apollo Gemini 

Thickness at 
stagnation point 1.7 in. (4.3 cm) 1 in. (2.5 cm) 
Max. heat rate 

Btu/(ft2)(s) 620 120 

J/(m2)(s) 7.04 x 106 1.36 x106 
Heat load 

Btu/ft2 36,000 12,100 

J/m2 4.09 x 108 1.37108 
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Fig. 1. Apollo and Gemini reentry conditions; 1 ft = 0.3 m; 
1 nmi = 1.852 km. (NASA) 


ing); and (11) extravehicular activity (EVA) systems, 
including space — suits, life support umbilicals 
or backpacks, mobility aids, restraints, and hand- 
holds. See SPACE NAVIGATION AND GUIDANCE; SPACE 
TECHNOLOGY. 

Environmental controls. The main system that distin- 
guishes crewed spacecraft from other spacecraft is 
the environmental control and life support system. 
It provides the crew with a suitably controlled atmo- 
sphere. Because all atmospheric supplies must be 
carried into space, it is essential to recirculate and 
purify the spacecraft atmosphere to keep the total 
weight of the vehicle within reasonable limits. In 
future years, partially or completely closed life sup- 
port systems will even recover oxygen and water 
from the atmosphere for reuse, reducing resupply 
requirements. 

Soviet and, today, Russian spacecraft have used 
a nitrogen-oxygen environment at normal sea-level 
pressure. United States spacecraft have used pure 
oxygen at a pressure of 5 Ib/in.? (34.5 kilopascals) 
during space flight. It provides the lightest, simplest 
system possible, and was adopted for this reason. 
Although some medical disadvantages were feared, 
they did not materialize. However, the technique re- 
quires complete control of oxidizable (inflammable) 
materials on board. For the extended mission dura- 
tion of post-Apollo-type vehicles, such as Skylab, the 
space shuttle and the International Space Station, 
NASA has used a more Earth-like atmosphere. See 
SPACE BIOLOGY. 

Supplies. All supplies for the entire flight must be 
carried in the spacecraft, unless special supply mis- 
sions are to be flown, for example, to a station in 
Earth orbit. An Earth-like environment is required 
because of human dependence on certain gaseous 
constituents for breathing. There must be provision 
for controlling the metabolically produced carbon 
dioxide (CO2) in the spacecraft atmosphere; control- 
ling water vapor and maintaining a comfortable hu- 
midity; reducing odor and removing toxic products; 
maintaining a comfortable thermal environment, at- 
mospheric pressure, sound-pressure level, and radia- 
tion level; meeting drinking water and nutritional re- 
quirements; disposing of waste; supporting the crew 
during emergencies; and remedying weightlessness. 

Space suits. Space suits or pressure suits are mo- 
bile spacecraft or chambers that house the astro- 
nauts and protect them from the hostile environment 
of space. They provide atmosphere for breathing, 
pressurization, and thermal control; protect astro- 
nauts from heat, cold, glare, radiation, and microme- 
teorites; contain a communication link and hygiene 
equipment; and must have adequate mobility. The 
suit is worn during launch, docking, and other criti- 
cal flight phases. During noncritical flight phases, the 
space suit is removed for greater comfort. This prac- 
tice has also been adopted by Soviet cosmonauts. 

The use of two gases at atmospheric pressure on 
board, while excellent for intravehicular activities 
(IVA), becomes somewhat disadvantageous when 
extravehicular activity is required, for two princi- 
pal reasons. First, adequate mobility in a space suit 


pressurized to over 3-4 Ib/in.2 (21-28 kPa) has 
proved extremely difficult. Second, at lower suit 
pressures, suitable for mobility, only pure oxygen 
can be used for breathing. In this case, because of 
the lower gas density, cooling may have to be pro- 
vided to the astronaut with water, circulating in tubes 
sewn into an undergarment. 

Power. The various life-support, communications, 
and other systems in a spacecraft depend upon ad- 
equate power. For very short missions conventional 
battery supplies may be used, but for longer-duration 
missions battery supplies must be constantly 
recharged from some other source such as a solar 
or nuclear supply. Fuel cells, which generate elec- 
tric power from the chemical combination of suit- 
able reactants such as hydrogen and oxygen, have 
proved effective in Apollo, Skylab, and the shuttle. 
Solar power is particularly useful within the orbit 
of Mars. Nuclear power may be used anywhere in 
space, thus removing the requirement to stay close 
to the Sun, but requires special provision for shield- 
ing spacecraft occupants and equipment from the 
radiations. See FUEL CELL; NUCLEAR BATTERY; SOLAR 
POWER; SPACE POWER SYSTEMS. 

Communications. Radio and television communica- 
tions channels permit a two-way exchange of infor- 
mation and instructions that are important for both 
normal flight and emergencies. Through telemeter- 
ing links, the operations center monitors the health 
and operation of the spacecraft and its crew, and re- 
ceives various kinds of engineering, operational, and 
research data. 

A space flight communications system requires 
suitable transmitters, receivers, display units, re- 
corders, power supplies, and antennas. The anten- 
nas impose challenging design requirements. The 
ground antennas must be able to track the spacecraft, 
and should be large enough to minimize the size and 
complexity of the spacecraft antenna. For near-Earth 
orbits the spacecraft or its antenna must be suffi- 
ciently maneuverable to keep the antenna pointed at 
the receiving station on the ground; for deep-space 
missions rapid maneuverability is not required, but 
accurate pointing is necessary. See SPACE COMMUNI- 
CATIONS; SPACECRAFT GROUND INSTRUMENTATION. 

Reliability. Because of the potential hazards of space 
flight to personnel, crewed spacecraft must meet 
stringent requirements of safety and hardware re- 
liability. Reliability is associated with the probabil- 
ity that the spacecraft systems will operate properly 
for the required length of time and under the spec- 
ified conditions. To assure survival and return in all 
foreseeable emergencies, the design of a crew-rated 
spacecraft includes standby systems (double or triple 
redundancy) and allows for launch escape, alterna- 
tive and degraded modes of operation, contingency 
plans, emergency procedures, and abort trajectories 
to provide maximum probability of mission success. 
The priority order of this reliability requirement is 
(1) crew safety, (2) minimum achievable mission ful- 
fillment, (3) mission data return, and (4) minimal 
degradation. See RELIABILITY (ENGINEERING). 

Crew recovery. For launch escape, the Mercury used 
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Fig. 2. Mercury spacecraft, showing approximate dimensions. 1 m = 3.3 ft. (NASA) 


a solid rocket mounted on a tower (Fig. 2) to pro- 
vide the necessary propulsive force to escape froma 
malfunctioning launch vehicle. This device was jet- 
tisoned after the critical flight phase. The Gemini 
and early Soviet spacecraft had ejection seats with 
parachutes. These were used on some of the early 
Vostok flights to recover the crew on land, but in 
Gemini they were intended only for launch escape. 
Because a vehicle launched from the Russian sites 
passes over land during the extremely critical flight 
phase immediately after launch, the Russians provide 
for landing on land to effect a safe recovery from an 
abort in this region. It is also used as the prime re- 
covery method after reentry. However, Mercury and 
Gemini were designed for launch over the Atlantic 
Ocean, eastward from Cape Canaveral, Florida, so 
that a water-landing capability was required. Also, 
because much of the orbital track was over water, 
a water-landing technique would be necessary to 
cope with the majority of possible random abort 
situations. The same logic applied to the lunar mis- 
sions of Apollo. Although the United States technique 
of water recovery has been highly developed and 
has proven eminently satisfactory for non-reusable 
spacecraft, more advanced spacecraft, such as the 
space shuttle, are designed for landing on runways 
much like an airplane. 

In-flight maintenance. The mission reliability of space- 
craft can be increased by providing for inflight main- 
tenance and repairs by the crew. The decision to use 
in-flight repair on crewed spacecraft, however, must 
take into consideration the effect on the overall prob- 
ability of mission success and flight crew safety, and 
the associated weight penalties which result from 
this reliability improvement technique. 

Emergencies. Both equipment and operational pro- 
cedures are carefully planned and designed to mini- 
mize harmful results from emergencies. On the oper- 
ational side, procedures are worked out in advance 
for a wide range of imaginable occurrences. On the 
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equipment side, ample use is made of redundancies, 
so that if one piece of equipment fails an alternate 
means of performing that equipment’s functions is 
available. The value of redundancy was evident in the 
Apollo 13 emergency. When an explosion occurred 
in one of the Apollo fuel cell oxygen tanks, substan- 
tially disabling the spacecraft, the ability of the at- 
tached lunar module to provide oxygen, power, ori- 
entation, and propulsion made it possible to return 
the astronauts safely to the ground after circumnavi- 
gating the Moon. 

Preventive measures. AS an example of the extreme 
complexity of crewed space flight, the Saturn 5 
launch vehicle of the United States, with the Apollo 
spacecraft combined, had between 5 and 6 million 
functional parts, miles of wiring, and thousands of 
connections. To assure the functional reliability re- 
quired for the lunar landing of humans, preventive 
measures had to include conservative design (with 
a wide margin of safety), stringent technical and ad- 
ministrative controls, and extensive premission sim- 
ulation and testing, far beyond the required safety 
level, under various mission conditions. Without 
high-speed digital computers, a successful lunar 
landing program would not have been possible and 
the space shuttle could not fly. See HUMAN-MACHINE 
SYSTEMS; SERVOMECHANISM; SIMULATION. 

Physiological stresses. Experience gained from both 
the United States and Russia’s crewed space flight 
programs has provided valuable physiological data. 
Apollo, Skylab, Salyut, and Mir were especially fruit- 
ful. Voice, heart rate, respiratory rate, blood pres- 
sure, and body temperature are monitored during pe- 
riods of dynamic stress (for example, launch, reentry, 
and extravehicular activity). Extensive ground stud- 
ies and laboratory tests have been performed. 

The stress imposed by long exposure to weight- 
lessness is still a major question. Astronauts adapt 
fairly quickly to the sensation of weightlessness, but 
the complete role played by the inner ear system in 
this adaptation has not been determined. 

Cardiovascular deconditioning was detected post- 
flight during the Gemini program. Conditioning 
equipment was developed subsequently to be used 
in space to measure the amount of deconditioning 
and to act as a countermeasure to weightlessness. 
The most significant type of this equipment is a 
lower-body negative pressure device which exposes 
the lower half of the body to reduced pressure; en- 
gorgement of the veins in the legs and abdominal 
viscera results, thereby counteracting blood pooling 
in the chest cavity and increased pressure in the left 
atrium at 0 g. This effect is believed to prevent the 
loss in blood volume otherwise produced as engorge- 
ment of the chest veins stimulates diuresis. 

Mineral balance is also markedly affected by 
weightlessness. Loss of the bone calcium secreted 
in urine and feces is observed, calling for counter- 
measures. Similar loss of nitrogen indicates loss of 
muscle tissue, which is shown clearly by circumfer- 
ential measurement of the astronauts’ legs. These ef- 
fects are considered serious, and although Skylab, 
Salyut, and Mir have shown that humans can stay in 


space for appreciable periods of time, it is the con- 
sidered opinion of those experts who have reviewed 
the data that for crewed missions lasting much longer 
than a year the causes of mineral loss have to be bet- 
ter understood and methods developed to prevent 
and control these losses. Since similar mineral losses 
are observed during prolonged bed rest, bed-rest sub- 
jects are being used to study this problem further. 
Biomedical monitoring throughout a space flight is 
considered important to the safety and effectiveness 
of the crew. Although life support and environmen- 
tal parameters (for example, total pressure, oxygen 
partial pressure, temperature, and radiation level) 
can be expected to provide the earliest indication 
of non-nominal conditions, their significance and ur- 
gency for corrective action usually require correla- 
tion with biomedical data. The medical evaluation 
of potential microbial and chemical contamination 
aboard the spacecraft requires microbiological, pul- 
monary, and metabolic studies. Basic hematological 
and biochemical measurements are being obtained at 
frequent intervals during the initial portion of a space 
flight to obtain a database. See AEROSPACE MEDICINE. 


Soviet/Russian Programs 


In the former Soviet Union, ideas and concepts of 
crewed space flight date back at least to the last 
century, to K. E. Tsiolkovskiy, now considered one 
of the pioneers of space flight. After the first Sput- 
nik launch on October 4, 1957, developments of 
crewed space flight capability followed in quick 
succession, leading from the first-generation Vostok 
(East) to the second-generation Voskhod (Ascent) 
and to the third-generation Soyuz (Union) space- 
craft. Originally engaged in an aggressive program 
to land a cosmonaut on the Moon before the United 
States lunar orbit mission of Apollo 8 in December 
1968, the Soviet Union redirected its aims, after four 
test failures of its N1 heavy-lift launcher, toward the 
development of permanent human presence in Earth 
orbit. 

Vostok. The Vostok (Fig. 3), a single-seater for 
short-duration missions and ballistic reentry from 
Earth orbits, consisted of a near-spherical cabin 
covered entirely with thermal protection system ma- 
terial, having three small viewports and external 
radio antennas. It contained a life-support system, 
radios, simple instrumentation, and an ejection seat. 
The 7-ft (2-m) sphere of the cabin was attached to 
a service module resembling two truncated cones 
base to base, with a ring of gas pressure bottles on 
the upper cone close to the cabin. The module con- 
tained chemical batteries, orientation rockets, the 
main retropropulsion system, and additional support 
equipment. 

In all, six Vostok cosmonauts were launched from 
1961 to 1963, including the first woman in space, 
V. V. Tereshkova (Vostok 6). The launch vehicle was 
a two-stage rocket with four additional strap-on 
booster units surrounding the first stage, forming 
a pyramidlike assemblage. All five propulsion units, 
feeding into 20 rocket nozzles, were started on the 
launch pad; later, the four strap-on units fell away, 





Fig. 3. Vostok, a first-generation Soviet spacecraft. (NASA) 


leaving the core unit to continue burning for a time. 
After its burnout, the single-engine second stage 
took over, inserting the payload into the target orbit. 
Payload capability to orbit was about 10,400 Ib 
(4750 kg). 

Voskhod. The second-generation Voskhod, essen- 
tially a greatly modified Vostok (for example, the 
heavy ejection-seat system was removed), was a 
short-duration multiperson craft and was designed 
to permit extravehicular activity or spacewalking by 
one of the crew. The first three-member crew flew 
on Voskhod 1, while on the double-seater Voskhbod 2 
the place of the third seat was used for an expand- 
able airlock to permit A. A. Leonov on March 18, 
1965, to egress into space without evacuating the 
main cabin of air, making him the first human to per- 
form extravehicular activity. The two Voskhbod space- 
craft were launched in 1964-1965 by an improved 
version of the Vostok launch vehicle, fitted with a 
new upper stage, that had the capability to lift the 
heavier Voskbods and—upon further uprating—the 
subsequent Soyuz vehicles. 

Soyuz. The Soyuz design is much heavier, larger, 
and more advanced in its orbital systems, permit- 
ting extended orbital stay times. It consists of three 
main sections or modules: a descent vehicle, an or- 
bital module, and an instrument-assembly module 
(Fig. 4), with a total habitable volume of about 
350 ft? (10 m>). The three elements are joined 
together, with the descent vehicle in the middle. 
Shortly before atmospheric reentry, the two outer 


modules are jettisoned. The spacecraft has a total 
length of 24.5 ft (7.5 m), and the span of its solar 
panel wings is 27.5 ft (8.4 m). 

Descent vehicle. The descent vehicle is intended for 
crew location during launch, orbital flight, descent, 
and landing. Roughly bell-shaped, it is covered with 
an ablative thermal protection system and designed 
for lifting reentry, permitting a controlled descent 
with the blunt end, at an angle of attack, facing for- 
ward. There are two portholes on its side, and one 
opening with a protruding periscope sighting device 
for the navigation system. All the control organs and 
most of the displays are located in this module. 

Orbital module. The forward orbital module, about 
7 ft (2 m) in diameter, is used by the crew as scien- 
tific laboratory and living quarters for orbital work, 
rest, and recreation. Consisting of two hemispheres 
joined by a cylindrical insert, it contains a “sofa” 
and folding table, a “sideboard,” a control panel 
with instrumentation and equipment of the main 
life support system, and scientific and photographic 
equipment. A docking system is located at the front 
end of the orbital module. Originally of the probe- 
and-socket design, like the United States system on 
Apollo, for the American-Soviet Apollo-Soyuz Test 
Project (ASTP), this system was replaced by the pe- 
ripheral “international” (androgynous) docking sys- 
tem designed by NASA and Soviet engineers. 

Launch vehicle. The Soyuz launch vehicle (Fig. 5), 
capable of placing 15,000 Ib (6800 kg) in orbit, has 
a total height of 161.7 ft (49.3 m), with 33.8 ft 
(10.3 m) maximum diameter. Its first stage, assisted 
by the four strap-on units, has a total vacuum thrust 
of 504 tons (4.47 x 10° newtons), generated by five 
engines with a total of 20 nozzles. In addition, there 
are 12 steering nozzles. The side units have a length 
of 62 ft (19 m), the central unit 92 ft (28 m). The sec- 
ond stage is 26 ft (8 m) in length and 8.5 ft (2.6 m) 
in diameter; its four-nozzle engine creates 30 tons 
(2.7 x 10° N) of thrust. Total lift-off weight of the 
Soyuz is about 300 metric tons. See ROCKET PROPUL- 
SION; ROCKET STAGING. 

Flight history. Since its first, ill-fated flight in April 
1967, resulting in the death of cosmonaut V. M. 
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Fig. 4. Soyuz, a third-generation Soviet spacecraft. 1m = 3.3 ft. 
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Fig. 5. Soyuz launch vehicle SL-4. (NASA) 























Komarov because of failure of the parachute land- 
ing system, the Soyuz spacecraft has served as the 
mainstay of the Soviet/Russian crewed space pro- 
gram in many functions, for example, as a develop- 
ment test bed of autonomous navigation, guidance, 
maneuvering, and rendezvous procedures; research 
laboratory for biomedical and other scientific and 
technical studies; and logistics vehicle for the Salyut 
space stations. 

Soyuz spacecraft carried originally up to three cos- 
monauts. However, when an accidental explosive 
decompression of the descent cabin during reentry 
caused the death (by pulmonary embolisms) of the 
Soyuz 11 crew—G. T. Dobrovolskiy, V. N. Volkov, 
and V. I. Patsayev—on June 30, 1971, the wearing 
of pressure suits during launch, ascent to orbit, de- 
scent to Earth, and other critical maneuvers was rein- 
troduced for safety. The third seat was removed to 
make room for the necessary additional “suit loop” 
of the life-support system and the emergency air sup- 
ply system, and the Soyuz thereafter carried only two 
cosmonauts. An improved version, the Soyuz T (for 
“Transport”) was introduced in 1980. A new version, 
the Soyuz-TM, with extended mission duration and 
new subsystems, replaced this vehicle in February 
1987, after a crewless test flight in May 1986. (The 
third seat was reintroduced with Soyuz TM-24 in 
August 1996.) In October 2002, the Soyzuz TM was, 
in turn, replaced by the Soyuz TMA, with modifi- 
cations financed by NASA for improved safety and 
widened crewmember size range. 

By December 31, 2006, there had been 94 crewed 
Soyuz missions (up to and including TMA-9) with 
229 cosmonauts (some of them on more than one 
flight, and including foreign guest cosmonauts) and 
two aborted launch attempts (Soyuz 18A and an at- 
tempt on September 26, 1983). Thirty-two of these 
flights docked with, and transferred crews to, Salyut 
space stations, 49 with the Mir space station, and 13 
with the International Space Station USS). The first 


mission to the JSS carried the first resident crew and 
the second carried the first “space tourist,” Dennis 
Tito. The Soyuz TMA craft have served as lifeboats, 
providing assured crew return capability to the JSS, 
and from April 2003 through September 2006 they 
shouldered the burden of providing crew rotation 
for the station, after the loss of the shuttle Columbia 
temporarily brought shuttle operations to a stand- 
still. 

Salyut. These orbital stations were 20 m (5 ft) 
long and 4 m (13 ft) wide, and weighed 19 tons. Be- 
tween 1971 and 1981, five of them operated in low 
Earth orbit successively for some time. By the end of 
that period, the most successful of them, Salyut 6, 
had seen seven host (long-stay) crews and nine 
visiting crews, including eight non-Soviet cosmo- 
nauts (from Czechoslovakia, Poland, East Germany, 
Hungary, Vietnam, Bulgaria, Cuba, Mongolia, and 
Romania). Also during that period, Salyut 6 was re- 
supplied by 12 Progress crewless automatic cargo 
carriers. 

Salyut 6 was the first of a “second generation” of 
Salyut space stations. Unlike its predecessors, it had 
two docking ports, instead of only one. This enabled 
it to receive visiting crews and resupply ships. To- 
gether with other new on-board systems, this feature 
was the key to missions of considerably extended du- 
ration. 

Its successor, Salyut 7, was launched on April 19, 
1982. By the end of 1985, Salyut 7 had seen five long- 
stay host crews and four visiting crews, which in- 
cluded the first cosmonauts from India and France as 
well as two flights of Svetlana Savitskaya, the second 
woman in space and the first female spacewalker. 
After serious systems problems arose in 1985, the 
crew of Soyuz T13 performed extensive repair and 
replacement work on the station and saved it from 
becoming inoperative. With the flights of Soyuz T13 
and T14, Salyut 7 saw the first successful space sta- 
tion crew rotation, although the mission of the T14 
crew had to be abruptly terminated because of ill- 
ness of one of the cosmonauts. The crew of Soyuz 
T15 conducted the first transfer between two space 
stations when they visited Salyut 7 in May and June 
1986 from the new permanent orbital facility, Mir. 

Mir. On February 19, 1986, the Soviet Union 
launched the core vehicle in its new Mir (Peace) 
space station complex series. This complex repre- 
sented a new-generation space station which evolved 
from the Salyut and Kosmos series vehicles. The 
core, an advanced version of Salyut, had six dock- 
ing ports, a large number of portholes, and was 43 ft 
(13.13 m) long and 13.6 ft (4.15 m) wide, with a mass 
of 21 metric tons (23 tons). It consisted of four sec- 
tions: transfer section [8.2 ft (2.2 m) diameter], work 
section [10.3 ft 3.15 m)], intermediate section, and 
equipment section [both 13.6 ft (4.15 m)], the lat- 
ter containing two rocket engines of 66 Ibf (294 N) 
thrust each. Its two winglike solar panels were larger 
than those of Salyut 7: 818 ft? (76 m?) versus 441 ft? 
(41 m?), with a wingspan of 97.5 ft (29.73 m). 

The core could be expanded by individual labora- 
tory modules launched separately and connected to 


its various docking ports. These modules were usu- 
ally dedicated to different scientific and technical dis- 
ciplines or functions. These included technological 
production with a shop, astrophysics, biological re- 
search, and medical. Launch of the first block, the ser- 
vice module Kvant 2, took place on November 26, 
1989. The second block, Kristall, launched on 
May 31, 1990, became the third major addition to 
Mir. Nearly identical to Kvant 2, this building block 
was equipped with furnaces for materials experi- 
ments in space and a docking port for the origi- 
nally planned Buran shuttle, later abandoned. The 
third module, Spektr, was launched on May 20, 1995; 
and the fourth module, Priroda, was launched on 
April 23, 1996. 

After launch, Mir’s laboratory modules were 
docked initially to the axial position on the multiport 
docking fixture on the forward end of the station. 
They then were transferred by means of their own 
articulated robot arm to one of the four outward- 
facing ports on the docking fixture of the core mod- 
ule, forming a spokelike array around the core. 

The station complex had about 23 kW power, 
generated by solar-cell arrays, and about 10,600 ft? 
G00 m°) of pressurized (habitable) volume. Maxi- 
mum crew size, for relatively short durations, was 
six persons, limited by the number of triple-crewed 
Soyuz-TM crew transport-escape vehicles that could 
be docked to the station. With the launch of Mir, the 
Soviet Union had had eight different stations in Earth 
orbit, including one failed attempt (Salyut 2 in April 
1973). 

The first Mir crew, Leonid Kizim and Vladimir 
Solovyev, was launched on board Soyuz T15 on 
March 13, 1986. Docking and transfer to the station 
took place about 49 h later on March 15. More crews 
followed in succession, keeping the station continu- 
ously occupied until April 1989, when the crew of 
Soyuz TM-7 left Mir for a crewless period until the ar- 
rival of TM-8 in September 1989. By the time Mir was 
decommissioned and deorbited on March 23, 2001, 
having been in operation for 15 years, the station had 
been visited by 39 crews, totaling 88 cosmonauts, 
including seven times by the space shuttle Atlantis 
and once each by Endeavour and Discovery. Dur- 
ing this time, Mir occupants set new marks three 
times for the longest human stay in space: 326 days 
by Yuri Romanenko in 1987, followed by 366 days 
by Vladimir Titov and Musa Manarov in 1988, and 
437 days by Valery Polyakov in 1944-1995. In its 
later years, Mir required increasing maintenance and 
repairs by its crews, particularly after two serious 
emergencies in 1997: an on-board fire, caused by 
a backup oxygen generator device, and a collision 
of the automated Progress M-34 freighter with Mir’s 
Spektr module during a docking exercise. When U.S. 
support of Mir ended at the conclusion of the joint 
Shuttle/Mir program (ISS Phase 1), further crewed 
operations of the space station were increasingly 
difficult and impractical for the Russian Space 
Agency. The last-best-one crew—cosmonauts Sergei 
Avdeev and Viktor Afanasyev, and French guest 
cosmonaut Jean-Pierre Haigneré—returned to Earth 


on August 27, 1999, in Soyuz TM-29. 

Most of Mir’s on-board systems were turned off on 
September 8. The station entered, for the first time in 
10 years, a period of crewless dormancy, and prepa- 
rations began for the possibility of a final decision by 
its Russian owners to terminate it with a controlled 
reentry into the Earth’s atmosphere in early spring 
2000. Mir gained another few months of life with 
the arrival of the final crew, Sergei Zalyotin and 
Alexander Kaleri in Soyuz TM-30, who prepared it 
for a flight extension. However, a deal sought by a 
commercial firm, MirCorp, fell through, and Mir’s 
end came on March 23, 2001, when it was deorbited 
and splashed into the South Pacific Ocean. See SPACE 
STATION. 

Energia. With the successful first test flight, from 
Baikonur, of the powerful expendable heavy-lift 
launcher Energia on May 15, 1987, evenifits payload 
did not reach orbit, the Soviet Union gained a tremen- 
dous new capability in launching large and heavy 
satellites, space station components, interplanetary 
spacecraft, and even the new Soviet space shuttle. 
The launcher has two stages, in parallel arrangement, 
and the payload is mounted not on top but on the side 
of the second core stage. Four liquid-fueled boost- 
ers (kerosine and liquid oxygen) make up the first 
stage. They are “strapped” to the cryogenic core 
stage, which is powered by four engines of 148 met- 
ric tons (1.45 meganewtons) thrust each (liquid hy- 
drogen and liquid oxygen). With a total thrust at 
liftoff of about 3000 metric tons (30 MN) and a liftoff 
mass of 2650 tons (2400 metric tons), a height of 
only 197 ft (0 m), and a largest diameter of 65.6 ft 
(20 m), the Energia can carry 110 tons (100 metric 
tons) to Earth orbit or inject 20 tons (18 metric tons) 
to geostationary orbit, 35 tons (32 metric tons) to the 
Moon, and up to 31 tons (28 metric tons) to Venus 
and Mars. 

With its second, and last, flight on November 15, 
1988, the Energia launched the Soviet space shuttle 
Buran. 

Buran. Described as the first of a fleet of Soviet 
shuttles, the space glider Buran (Blizzard) accom- 
plished its first orbital test flight on November 15, 
1988. The test mission was flown automated, that 
is, without crew, and lasted 3 h 25 min. The flight, 
while concluding with a return to the intended 
landing site, was only partially successful. It did not 
lead to further development efforts of this vehicle. 
The delta-winged Buran appeared to copy the major 
features of the United States shuttle, without, how- 
ever, its own main propulsion system (which was 
provided by the expendable Energia core stage). 
The main mission of the Buran program reportedly 
was to support the Soviet space station Mir and its 
successors. After the collapse of the Soviet Union, 
the Energia/Buran programs were terminated. 


European Program 


European efforts in space have long had the basic 
goal of achieving European space autonomy: the ca- 
pability of Europe to develop, launch, and operate 
its own satellites and space vehicles independent 
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of the space systems of other nations, including the 
United States. This goal has been accomplished for 
only crewless launches of commercial and scientific 
satellites. In order to conduct human space activities, 
European astronauts have made use of the United 
States space shuttle and the Soviet/Russian Soyuz 
launcher. 

European Space Agency (ESA). The multinational 
European space program is the responsibility of the 
European Space Agency, representing 13 member na- 
tions (Austria, Belgium, Denmark, France, Germany, 
Ireland, Italy, Netherlands, Norway, Spain, Sweden, 
Switzerland, and the United Kingdom, with Finland 
as associate member and Canada a cooperative na- 
tion). The ESA conducts operations at field installa- 
tions in the Netherlands, Germany, and Italy; track- 
ing stations in a number of countries; and its launch 
center at Kourou, French Guyana. 

Ariane 5. The growing weight of geostationary 
satellites and the economic advantage of launch- 
ing two satellites on one carrier led the European 
Space Agency to develop a more powerful version 
of the successful commercial expendable launcher 
Ariane. Ariane 5 is a 2'/,-stage launch vehicle, with 
a core stage (propelled by a cryogenic engine), two 
reusable solid boosters, similar to those of the United 
States space shuttle, and an upper stage. Devel- 
oped by the French space agency [Centre Nationale 
d’Etudes Spatiales (CNES)] for the European Space 
Agency, Ariane 5 is able to launch payloads of up to 
23 tons (21 metric tons) to low Earth orbit, and is 
also used for launching geostationary satellites. 

The ECA (enhanced capability) version of the 
Ariane 5, designed to lift 10 tons (9 metric tons) 
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to geostationary transfer orbit, enough for two big 
communications satellites at once, uses a new cryo- 
genic upper stage, an improved Vulcain 2 main stage 
engine, and solid boosters loaded with 10% more 
propellant. By December 31, 2006, the Ariane 5-G 
(generic) had flown 24 flights and the Ariane 5-ECA 
had flown 7 (after initial failures of both vehicles, in 
June 1996 and December 2002, respectively). 


United States Programs 


During the first decade after its inception in 1961, 
the United States crewed space program was con- 
ducted in three major phases—Mercury, Gemini, and 
Apollo. Each crewed flight led to increased knowl- 
edge of the systems and techniques needed to oper- 
ate successfully in space, and each phase represented 
a significant advancement. 

Development. Many differences in the three 
crewed United States spacecraft are readily appar- 
ent, such as size and weight (Fig. 6). The major 
differences, however, are in the complexity and 
refinement of subsystems. Apollo’s requirement for 
hardware “maturity” was significantly higher than for 
the earlier programs. Each subsystem became pro- 
gressively more complex, with many more demands 
made upon it and a correspondingly greater capa- 
bility. For example, only Apollo had its own guid- 
ance and navigation system. Another good exam- 
ple of increased system complexity was electrical 
power. For the Mercury craft, it was supplied by six 
batteries; Gemini required seven batteries and two 
chemical fuel-cell power plants. For Apollo, electri- 
cal power was supplied by five batteries and three 
fuel cells. 
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Fig. 6. Comparison of United States spacecraft and launch vehicle configurations through 1972. (NASA) 


Physically, the three systems do not appear greatly 
different, but in terms of operational demand they 
differed considerably. The Mercury electrical sys- 
tem had to supply power to sustain the 3000-lb 
(1360-kg) spacecraft and its single astronaut for 
1!/⁄ days. In Gemini, sufficient power had to be pro- 
vided to operate a 7000-Ib (3175-kg) craft contain- 
ing two astronauts for as long as 2 weeks, while the 
Apollo system was designed to support a 100,000-Ib 
(45-metric-ton) spacecraft carrying three persons for 
up to 2 weeks. 

Perhaps the greatest difference between Apollo 
and the earlier spacecraft was in reliability. Every- 
thing had to work and keep working no matter 
what the circumstances. Unlike the previous crewed 
space programs, in which the crew could return to 
Earth almost within minutes if an emergency arose, 
the Apollo crew needed as much as 6 days to get back 
from a position outbound toward the Moon from the 
Earth’s vicinity, requiring lunar circumnavigation to 
return safely to Earth. Such a situation occurred in 
the flight of Apollo 13 in 1970 (Table 1). 

Mercury. After Alan Shepard’s first ride on 
Mercury 3 atop a Redstone rocket, John Glenn in 
1962 became the first American to orbit the Earth. 
His vehicle, a Mercury, was in its basic characteris- 
tics similar to the Soviet Vostok, but it weighed only 
about a third as much, as necessitated by the smaller 
missiles of the United States at that time (the Red- 
stone and Atlas). 

The one-person Mercury capsules (Fig. 3) used 
ballistic reentry and were designed to answer the 
basic questions about humans in space: how they 
were affected by weightlessness, how they with- 
stood the gravitational forces of boost and entry, 
how well they could perform in space. The Mercury 
flights proved that humans not only could survive 
in space, but could also greatly increase humanity’s 
knowledge of the universe. Between 1961 and 1963, 
a total of six astronauts flew on Mercury spacecraft, 
the last flight (Gordon Cooper’s Faith 7) lasting a 
record 34 h. See WEIGHTLESSNESS. 

Gemini. The second United States step into space 
was the Gemini Program. With the two-person 
Gemini capsule (Fig. 7), for the first time a crewed 
spacecraft had been given operational maneuverabil- 
ity in space. Its reentry module flew a lifting reentry 
trajectory for precise landing point control (Fig. 1). 
In addition, its design permitted extravehicular 
activity by one of the crew. The goal of the Gem- 
ini missions was to find out how humans could ma- 
neuver themselves and their craft, and to increase 
knowledge about such things as celestial mechanics, 
space navigation, and rendezvous and docking, as re- 
quired later for the Apollo Program. All these extra 
capabilities required that the weight of the space- 
craft be increased by 130% over the Mercury. This 
in turn called for a more powerful launch vehicle, 
the Titan 2, which had its origin as an Air Force 
ICBM. 

Gemini has a record of 10 successful crewed 
flights between 1965 and 1966, with 20 astronauts 
and a total mission duration of 969 h 52 min in space. 
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Fig. 7. Gemini spacecraft. (NASA) 


It set many records, including the longest duration 
(almost 14 days, Gemini 7), the first rendezvous by 
two maneuverable spacecraft (Gemini 6 and 7), and 
the first docking (Gemini 8). Moreover, on June 3, 
1965, Gemini 4 crew member Edward White per- 
formed the first spacewalk by an American astro- 
naut, drifting and maneuvering in space for 21 min 
at the end of a 25-ft (7.5-m) umbilical/tether. On later 
Gemini missions, EVAs were performed by E. A. 
Cernan, M. Collins, R. E Gordon, and E. E. Aldrin, 
totaling 13 h 23 min, including open-hatch or stand- 
up EVAs (SEVAs). 

Apollo. The third-generation spacecraft, Apollo, 
was 82 ft (25 m) tall and had five distinct parts: 
the command module (CM), the service module 
(SM), the lunar module (LM), the launch escape sys- 
tem (LES), and the spacecraft/lunar module adapter 
(SLA). The three modules made up the basic 
spacecraft; the LES and SLA were jettisoned early in 
the mission after they had outlived their function. 

Command module. The command module served as 
the control center for the spacecraft and provided 
210 ft? (6 m>) of living and working quarters for 
the three-member crew for the entire flight, except 
for the period when two persons entered the lunar 
module for the descent to the Moon and return. The 
command module was the only part of the spacecraft 
that returned to Earth, flying a lifting trajectory with 
computer-steered maneuvers (Fig. 1). 

Resembling a cone of almost 10-ft (3-m) height 
and 12.7-ft (3.9-m) width, the command module con- 
sisted of two shells separated by a layer of insulation: 
an inner crew compartment (pressure vessel) and an 
outer heat shield. 

Service module. Attached to the command module 
at its heat shield end was the service module, a 
22.8-ft-tall (7-m), 12.7-ft-wide 3.9-cm) cylinder hous- 
ing the electrical power subsystem, most of the re- 
action control engines, part of the environmental 
control subsystem, and the service propulsion sub- 
system, including the main rocket engine for inser- 
tion into orbit around the Moon, for return from the 
Moon, and for course corrections. 

Lunar module. The lunar module (Fig. 8) carried two 
astronauts from the orbiting command/service mod- 


Space flight 


117 


118 


Space flight 


S-band steerable 
antenna 

















radar antenna 


alignment optical 
telescope 


S-band inflight t 
antenna (2) 


forward 
entrance hatch A 


Fig. 8. Apollo lunar module. (NASA) 


ule (CSM) down to the surface of the Moon, provided 
a base of operations there, and returned the two 
astronauts to a rendezvous with the command ser- 
vice module in orbit. It consisted of two compo- 
nents: the ascent stage (on top) and the descent 
stage. The latter had a descent engine and propel- 
lant tanks, landing gear assembly, a section for sci- 
entific equipment for use on the Moon, another sec- 
tion to transport a collapsed wheeled vehicle on the 
last three Apollo flights (the lunar rover), and extra 
oxygen, water, and helium tanks. The ascent stage 
contained the crew compartment with about 160 ft* 
(4.6 m*) of habitable volume, the ascent engine with 
propellant tanks, and all lunar module controls. Its 
subsystems were essentially the same kinds as those 
of the command module and service module. The 
lunar module measured about 23 ft (7 m) in height 
degs extended), weighed about 32,500 lb (14.6 met- 
ric tons), and was constructed of aluminum with ti- 
tanium fittings. 

Flight history. Including Earth-orbital flights by 
Apollo 7 and 9, a total of 11 crewed missions were 
launched in the Apollo Program between 1968 and 
1972 (Table 1). Of these, nine crews (27 astronauts) 
went to the Moon, with six landings (Fig. 9). Twelve 
astronauts walked on the Moon, accumulating al- 
most 300 h of lunar stay time and 166 person- 
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hours of surface exploration. They traversed almost 
60 mi (97 km) and brought about 850 Ib (385 kg) of 
rock and soil samples back to Earth. Approximately 
33,000 lunar photographs and 20,000 reels of mag- 
netic tapes of geophysical data were collected, caus- 
ing a major revolution in planetary science by provid- 
ing the first deep understanding of a celestial body 
other than Earth. See MOON. 

Lunar roving vehicle. On the last three lunar landings, 
Apollo 15-17, lunar exploration was supported by 
the lunar roving vehicle (LRV; Fig. 10). This vehi- 
cle weighed 460 Ib (209 kg) on Earth (77 lb or 35 kg 
Moon weight) and 1600 lb (725 kg) fully loaded. Mea- 
suring 122 in. G m) in length and 6 ft (2 m) in width, 
it could carry two astronauts in their space suits and 
backpack life support units, scientific equipment, 
and lunar samples. Each wheel was driven by its 
own separate traction drive assembly consisting of 
a '/-horsepower (0.2-kW) dc brush-type electric 
motor, harmonic drive reduction gear, and brake as- 
sembly. Power was provided by two nonrecharge- 
able 36-V silver-zinc batteries, each containing 
23 cells, having a capacity of 115 ampere-hours 
and weighing 59 lb (27 kg). Both rear and front 
wheels were steered by a hand-operated control stick 
through two separate electric motors on the axles. 
Turning radius was 122 in. 3.1 m). The lunar roving 
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Fig. 9. Astronaut Edwin Aldrin, Jr., stands next to the passive seismometer experiment on the surface of the Moon. On July 
20, 1969, Aldrin and Neil Armstrong became the first humans to land on the Moon. A television camera, American flag, laser 


reflector, and lunar module Eagle are behind Aldrin. (NASA) 


vehicle had a range of 36 mi (58 km), traveled at a 
maximum speed of 8.5 mi/h (13 km/h), and negoti- 
ated slopes as steep as 25. During its transport to the 
Moon, it was stowed in folded and collapsed condi- 
tion in the lunar module descent stage in a volume 
measuring 4.4 x 4.4 x 5.4 ft (1.3 x 1.3 x 1.6 m). 
After touchdown, semiautomatic deployment was 
accomplished by the astronauts’ pulling nylon tapes 
and releasing cables. By using the lunar roving vehi- 
cle, the astronauts were able to spend more time on 
the Moon’s surface during exploration. 

Extravehicular activity. Extravehicular activities be- 
came of paramount importance in Apollo. In addi- 
tion to the 12 astronauts who walked on the Moon, 
four crew members performed in-flight EVAs, with a 
total of 3 h 46 min. Counting the nine space exits by 
Gemini astronauts, a total of 41 individual EVAs had 
been accumulated by the end of the Apollo Program. 
To these, Skylab astronauts later added 19 more 
(10 occasions, Table 2). 

Launch vehicle. Prior to Apollo, the United States’ 
launch vehicle inventory included the Atlas-Agena 
B with 5000 Ib (2250 kg) Earth-orbital payload capa- 
bility, and the Titan 2 with 7800 lb (3500 kg). Clearly, 
a more powerful booster was required to lift the 
Apollo spacecraft to Earth orbit and thence to the 
Moon. 

At the Army Ballistic Missile Agency (ABMA) in 
1958, a team of engineers under Wernher von Braun 
set out to prove that vastly more powerful space 





Fig. 10. Lunar roving vehicle used in the Apollo 15 mission. (NASA) 
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TABLE 2. Skylab mission summary 
Crewed periods First Second Third Total 
Launch 5/25/73 7/28/73 11/16/73 
9:00 a.m. EDT 7:10a.m. EDT 9:01 a.m. EDT 
Splashdown 6/22/73 9/25/73 2/8/74 
9:49 a.m. EDT 6:19 p.m. EDT 11:17a.m. EDT 
Duration (day:h:min) 28:0:49 59:01:9 84:01:16 171:13:14 
Revolutions 404 858 1214 2476 
Distance, 10° mi (10° km) 11.5 (18.5) 24.5 (39.4) 34.5 (65.5) 70.5 (113.5) 
SEVA* 0:37 (6/25/73) 
EVA‘ 1 duration (h:min) 3:30 (6/7/73) 6:29 (8/6/73) 6:33 (11/22/73) 
EVA 2 duration 1:44 (6/19/73) 4:30 (8/24/73) 7:01 (12/25/73) 
EVA 3 duration 2:45 (9/22/73) 3:28 (12/29/73) 
EVA 4 duration 5:19 (2/3/74) 

Total EVA 5:51 13:44 22:21 41:56 
Solar observatory photos 30,242 77,600 75,000 182,842 
Earth resources photos 8,886 14,400 17,000 40,286 

“Standup (in spacecraft hatch) extravehicular activity. 

*Extravehicular activity (completely outside of spacecraft). 








rockets could be built from existing hardware by 
clustering engines and tanks. The project evolved 
into the Saturn Program of NASA. There were three 
types of launch vehicles in the Saturn family: the 
Saturn 1 which had a perfect record of ten suc- 
cessful flights, the Saturn 1B with eight successful 
flights, and the Saturn 5, which became the Apollo 
lunar launch vehicle and had the capability to lift 
250,000 Ib (113 metric tons) into low Earth orbit 
and to send 100,000 Ib (45 metric tons) to the 


Moon. The development of the Saturn 5 proceeded 
through three phases, with the two preceding 
launch vehicles as stepping stones. Saturn 1 used 
primarily modified existing equipment. Saturn 1B 
used a modified first stage of the Saturn 1 with 
uprated engines and a new second stage and instru- 
ment unit. Saturn 5 introduced new first and second 
stages, departing from the clustering approach, and 
used as its third stage and instrument unit the Saturn 


1B’s second stage and instrument unit (Table 3). 








TABLE 3. Saturn launch vehicles* 








Payload Engines, Propellants, Stage 
Launch vehicle Length, Weight, to low Diameter, number weight and, thrust,* 
and stage ft 103 Ib orbit, Ib ft and type type’ 10° Ib 
Saturn 1, block | 162.5 1120 (gross) No orbital 
(with payload) capability 
1st stage, S-1 81.6 103 (dry) = 21.4 8 H-1’s 875,000 Ib, RP-1/LOX 1504 s.l. 
2d stage, dummy 44 142 (ballast) 18.3 
Saturn 1, block II 187.5 1128 (gross) 24,000 
(with payload) 
1st stage, S-1 80.3 105.7 (dry) 21.4 8 H-1’s 875,000 Ib, RP-1/LOX 1504 s.l. 
2d stage, S-4 41.4 13.9 (dry) 18.3 6 RL10A-3’s 100,000 Ib, LH2/LOX 90 v. 
Instrument unit 2.8 3.1 21.4 
Saturn 1B 224 1313 (gross) 40,000 
(with spacecraft) 
1st stage, S-1B 80.3 84.4 (dry) 21.4 8 H-1’s 885,000 Ib, RP-1/LOX 1650 s.l. 
2d stage, S-4B 58.4 25 (dry) 2A 1J-2 230,000 Ib, LH2/LOX 200 v. 
Instrument unit 3 4.3 2i 
Saturn 5 363 6200 (gross) 270,000 
(with spacecraft) 
1st stage, S-1C 138 303 (dry) 33 5 F-1’s 4,578,000 Ib, RP-1/LOX 7600 s.l. 
2d stage, S-2 81.5 95 (dry) 33 5 J-2’s 942,000 Ib, LH2/LOX 1160 v. 
3d stage, S-4B 58.13 33.6 (dry) ane 1 J-2 232,000 Ib, LH2/LOX 230 v. 
Instrument unit 3 4.3 21.7 





141 ft = 0.3 m. 1 Ib = 0.45 kg. 1 Ibf = 4.4 newtons. 
TRP-1 = type of kerosine, LOX = liquid oxygen (—297°F, — 183°C), LH» = liquid hydiogen (—422.9°F,—252.8°C). 
łs.l. = thrust at sea level, v. = thrust in vacuum. 
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Fig. 11. Space station Skylab in orbit. The sunshade rigged 
by crew repair is visible. (NASA) 


In addition to its lunar mission in the Apollo pro- 
gram, the Saturn 5, in a two-stage version, served 
also—in its last and thirteenth flight—to launch the 
first United States space station, Skylab. 


Skylab 


The experimental space station Skylab (Fig. 11) was 
the largest object ever placed in space, and the first 
crewed project in the U.S. Space Program with the 
specific purpose of developing the utility of space 
flight in order to expand and enhance humanity’s 
well-being on Earth. To that end, Skylab’s equipment 
included Earth resources remote sensing instrument 
and the first crewed solar telescopes in space. A total 
of three crews of three astronauts each carried out 
experiments and observations on Skylab in a num- 
ber of different areas, among them observations of 
the Earth, solar astronomy, stellar astronomy, space 
physics, geophysics, biomedical studies, zero-gravity, 
biological studies, zero-gravity technology studies, 
and spacecraft environment. Originally begun in the 
early 1960s, under the names Apollo Extension Sup- 
port Study and Apollo Applications Program, Skylab 
emerged as a cluster of five major components: 
orbital workshop (OWS), airlock module (AM), multi- 
ple docking adapter (MDA), Apollo telescope mount 
(ATM), and command/service module (CSM). The 
CSM, adapted from the Apollo Program, served as 
crew carrier, launched by Saturn 1B’s. 

Skylab was launched crewless by Saturn 5 on May 


14, 1973 (Skylab 1), and suffered a structural fail- 
ure of its combined meteoroid/thermal shield dur- 
ing the first minute of flight which eliminated most 
of its electrical power supply. Subsequently visited 
by three astronaut crews for periods of 28, 59, and 
84 days (Tables 1 and 2), the space station under- 
went repair in orbit, a first for the space program, 
which proved that humans are eminently capable 
of performing heavy emergency repair, service, and 
maintenance operations in space, thereby setting the 
stage for the future of orbital operations by space 
shuttle crews. 

After the third crew had returned to Earth on 
February 8, 1974, the United States first space station 
remained dormant for 5 years while NASA worked on 
plans to save Skylab from orbital decay and reentry 
for future missions. The intention was given up in 
December 1978 when it became clear that the esti- 
mated decay rate of the orbit was approaching faster 
than anticipated and the likely date of space shuttle 
rescue mission was moving ahead. Under active con- 
trol by NASA, Skylab finally returned to Earth on July 
11, 1979, impacting in the Indian Ocean, with a few 
pieces landing in Australia. 


Apollo-Soyuz Test Project 


The last flight of an Apollo spacecraft occurred as 
the United States part in the world’s first interna- 
tional crewed space flight, the Apollo-Soyuz Test 
Project on July 15-24, 1975. During that time, three 
American astronauts and two Soviet cosmonauts 
linked up in space for a total joint period of 47 h 
17 min (Fig. 12). 

The ASTP mission was planned to accomplish 
spacecraft rendezvous, docking, undocking, and 
crew transfer between spacecraft of disparate techni- 
cal and management philosophies, the Soviet Soyuz 
and the United States Apollo, as well as interaction of 
spacecrafts crews. The successfully conducted mis- 
sion tested compatible rendezvous and docking sys- 
tems being developed for future United States and 
Soviet crewed spacecraft and space stations; in ad- 
dition, the crews practiced docking and undock- 
ing, conducted intravehicular crew transfer, and per- 
formed 31 scientific experiments and engineering 
investigations. 

Docking. The new compatible rendezvous and 
docking system provided the basis for a standardized 
international system for docking of crewed space- 
craft. Thus, it enhanced the safety of crewed flights 
in space and provided the opportunity for conduct- 
ing subsequent joint missions and experiments. 

To enable Apollo and Soyuz to dock, NASA de- 
veloped and constructed a third element, the dock- 
ing module (DM), that also served both crews as an 
airlock and transfer tunnel between the spacecraft 
(Fig. 13). It was 10 ft (3.2 m) long and 4.6 ft (1.4 m) 
wide, and weighed 4660 Ib (2097 kg). It held two 
suited crew members and was attached to the for- 
ward end of Apollo. On the Soyuz side, it used the 
compatible docking system designed by NASA and 
Soviet engineers (Fig. 14). This docking system was 
of the androgynous type, which uses the principle of 
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Fig. 12. An artist’s concept of the first international “handshake” in space, by the American and Soviet crews of the 
Apollo-Soyuz Test Project mission, July 17, 1975. (NASA) 
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Fig. 13. Apollo and Soyuz spacecraft in docked configuration during the Apollo-Soyuz Test Project mission with docking 
module serving as connecting link and airlock. 1 m = 3.3 ft. (NASA) 





Fig. 14. Universal docking system, as used on Apollo- 
Soyuz Test Project (American version), suspended for 
testing. (NASA) 


reverse symmetry to allow the mechanical linkage of 
otherwise dissimilar spaceships. Docking is usually 
performed by one (active) spacecraft on the other 
(passive) partner. 

Since the Soyuz internal atmosphere consisted 
of nitrogen-oxygen at sea-level pressure as opposed 
to Apollo’s pure oxygen environment at 5 Ib/in.? 
(34.5 kPa), the docking module was required to serve 
as an air lock. To reduce lengthy waiting periods for 
denitrogenation, the Soyuz pressure was reduced to 
about 10 Ib/in.? (69 kPa) before docking. During the 
mission itself, four transfers between spacecraft were 
accomplished by crew members of both nations. 

Spacecraft modifications. To fit the mission needs, 
several modifications were made to both spacecraft. 
For Apollo, they included an increased number of 
propellant tanks for the reaction control system, 
added equipment to operate the docking module and 
the American-Soviet rendezvous and docking sys- 
tems, and provisions for scientific and technical ex- 
periments. Among the major modifications of Soyuz 
were the new docking mechanism, additional com- 
munications equipment to accommodate the United 
States ultrahigh frequency of 296 MHz, a transponder 
(a combined receiver-transmitter that beams a sig- 
nal when triggered by another radio signal) to allow 
Apollo to calculate distance during rendezvous, and 
alignment devices to aid Apollo in performing its ac- 
tive docking role. 

Accomplishments. In addition to the rendezvous 
and docking test, the ASTP demonstrated that pre- 
cision launch of complex vehicles from widely sepa- 
rated sites to meet the trajectory alignment require- 
ments for a joint space mission was feasible, and that 
hardware, operating, and language differences can 
be surmounted for joint international space efforts, 
with enough planning and training in advance. 


Space Transportation System 


After the end of the joint American-Soviet space mis- 
sion, the United States crewed space flight program 
had reached the end of a pioneering era of capa- 
bility development. Attention began to focus on the 
routine application of newly acquired know-how sys- 
tems and experience, specifically in the form of the 
emerging Space Transportation System (STS), with 
the ability to transport inexpensively a variety of 
useful payloads to orbit, as the mainstay and “work 
horse” of the United States space program in the 
1980s and 1990s. Its design is aimed at reducing the 
cost and increasing the effectiveness of using space 
for commercial, scientific, and defense needs. 

The two major components of the Space Trans- 
portation System are the space shuttle and the Space- 
lab. This system is complemented by a number of 
upper stages depending on the mission beyond Earth 
orbit, for example, the Payload Assist Module (PAM), 
the Transfer Orbit Stage (TOS), and the Inertial 
Upper Stage. 

Space shuttle. The space shuttle (Fig. 15), as a 
reusable system, has the capability to conduct space 
missions in response to national and worldwide 
needs and the flexibility to respond to policy, 
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Fig. 15. Space shuttle on its crawler/transporter/launch platform. (NASA) 
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Fig. 16. Space shuttle vehicle. 1m = 3.3 ft. (NASA) 


discovery, innovation, and newly emerging needs. 
The shuttle provides extended capability in the areas 
of (1) easy access to and from low Earth orbit for 
nonastronaut passengers, (2) multipurpose reusable 
transportation, (3) standard equipment for a variety 
of spacecraft, (4) standard equipment for a variety 
of missions, (5) modularized subsystems for easy re- 
furbishment, (6) refurbishable spacecraft/payloads 
in orbit, and (7) a technology base for more flexi- 
ble future missions. The space shuttle flight system 
(Fig. 16) has three major elements: the orbiter, an 
external tank (ET) containing the liquid propellants 
to be used by the orbiter main engines for ascent, 
and two solid-propellant rocket boosters (SRBs) 
“strapped on” to the external tank. The orbiter and 
solid rocket booster casings are reusable; the ex- 
ternal tank is expended on each launch. The total 
weight of the system at launch is 4,406,000 Ib 
(1,998,500 kg). 

Orbiter and external tank. The orbiter contains the 
crew and payloads. In size and weight (150,000 Ib 
or 68,000 kg when empty of consumables) compa- 
rable to a DC-9 jet transport, it delivers payloads of 
65,000 Ib (29,500 kg) with lengths to 80 ft (18 m) and 
widths to 15 ft (4.5 m). The crew compartment ac- 
commodates seven crew members and passengers 
for some missions, but as many as ten persons in 
emergency operations. 

The three main propulsion rocket engines in the 
aft fuselage of the orbiter, with a vacuum thrust of 
470,000 Ibf (2,090,000 N) each, use liquid hydro- 
gen and liquid oxygen from the external tank, which 
is jettisoned shortly before orbit insertion after ap- 
proximately 8 min of burn time. The orbiter also 
carries 46 auxiliary engines with a total vacuum 
thrust of 45,200 Ibf (201,000 N). The external tank 
has a length of 153.7 ft (46.85 m) and a diameter 
of 331.0 in. (8.41 m). Its weight is 1,619,000 Ib 
(734,400 kg) at launch and 69,000 Ib (31,300 kg) 
when empty of fuel. 

Solid rocket boosters. The two solid rocket boosters 
burn in parallel with the main engines of the orbiter 
to provide initial ascent thrust. Primary elements of 


each solid rocket booster are the motor, including 
casing, propellant, igniter and nozzle, forward and 
aft structures, separation and recovery systems, and 
thrust vector control subsystems. Each solid rocket 
booster has a length of 149.1 ft (45.45 m), a diam- 
eter of 146 in. @.71 m), a weight of 1,265,000 Ib 
(573,800 kg) at launch and 182,000 Ib (82,600 kg) 
after the fuel has been expended, and a thrust of 
2,700,000 Ibf (12,000,000 N) at launch. The solid 
propellant grain is shaped to reduce thrust some 
time after liftoff to prevent overstressing the vehi- 
cle during the period of maximum dynamic pres- 
sure. The solid rocket boosters separate after about 
2 min of flight from the external tank at an altitude of 
30 mi (50 km) by pyrotechnic devices and eight small 
rockets; they descend on a system of parachutes and 
splash in the ocean about 170 mi (274 km) from 
the launch site—Kennedy Space Center in Florida or 
Vandenberg Air Force Base in California—to be re- 
trieved for reuse. 

Testing and flight. The first shuttle orbiter, named En- 
terprise, was rolled out at Palmdale, California, on 
September 17, 1976. Used for ground tests and ap- 
proach and landing flights tests during 1977 and 
1978, the Enterprise was superseded by the first 
space mission orbiter, Columbia, for the first orbital 
flight on April 12-14, 1981, piloted by astronauts 
John Young and Robert Crippen (Fig. 17). Three 
additional shuttles—Challenger, Discovery, and 
Atlantis—were added subsequently, so that by the 
end of 1985 the total shuttle fleet consisted of four 
vehicles. Planning for the construction of a possible 
fifth orbiter was held in abeyance pending a better 
understanding of future launch-frequency require- 
ments. 

By mid-January 1986, 24 shuttle missions to low 
Earth orbits has been conducted (up to and includ- 
ing 61-C), all of them successfully, although a num- 
ber of problems had to be overcome in the course 
of the program (Table 1). Highlights of these flights 
included the retrieval of two errant communications 
satellites (on STS 51-A), the in-space repair of two 
satellites (on STS 41-C and STS 51-1), the first un- 
tethered extravehicular activity of a human with 
an autonomous maneuvering system (on STS 41-B; 
Fig. 18), assembly of two large space structures 
(STS 61-B), and three flights of the space laboratory 
Spacelab (STS 9, STS 51-B, and STS 61-A). 

With the launch of the twenty-fifth shuttle mission, 
51-L, on January 28, 1986, however, tragedy struck 
the American space program. At approximately 
11:40 a.m. EST, 73 s after liftoff, the flight of Chal- 
lenger, on its tenth mission, abruptly ended in an ex- 
plosion triggered by a leak in the right solid rocket 
booster, killing the seven-member crew. 

The loss of Challenger reduced NASA’s shuttle 
fleet by one-fourth. An independent commission 
to investigate the causes of the 5/-Z accident was 
established. Continuation of the United States 
crewed space program was suspended pending a 
thorough reassessment of flight safety issues and 
implementation of necessary improvements. The re- 
covery period lasted 32 months (compared to 18 


Fig. 17. Launch of the space shuttle Columbia on its first 
orbital flight, April 12, 1981. (NASA) 


months after the Apollo 7 fire in 1967 which killed 
three astronauts). During this time, all shuttle sys- 
tems, operations, and its entire management were 
subjected to meticulous reviews and modifications, 
overviewed by outside expert committees. The solid 
booster joints, whose inherent design weaknesses 
had caused the explosion of the Challenger, were 
redesigned with 73 changes and tested thoroughly. 
The orbiter itself, in the process, underwent 228 
modifications, not counting 100 software changes. 
The main propulsion system (63 changes) and even 
the external tank were improved, along with the 
assembly and launch facilities, as well as all pre- 
flight procedures. In July 1987, NASA ordered a new 
orbiter as Challenger’s replacement, called Endeav- 
our. 

Shuttle flight operations resumed on September 
29, 1988, with the long-awaited launch of Discov- 
ery on the mission STS 26 to place a Tracking and 
Data Relay Satellite (TDRS) in orbit. Subsequent 
flights followed with increasing confidence in the 
new systems and operations. At the end of 1989, 
after seven highly successful missions since the Chal- 
lenger accident, NASA was ready to resume full-scale 
space shuttle operations, signaling a total recovery 
from the accident. 





Highlights of post-Challenger missions included 
the launch of the interplanetary research probes 
Magellan to Venus (on STS 30) and Galileo to Jupiter 
[on STS 34, the retrieval of the 10.5-metric-ton 
(11.5-ton) research satellite Long Duration Exposure 
Facility (LDEF) [on STS 32], the placement in orbit of 
the Hubble Space Telescope (on STS 31), the launch 
of the European solar probe Ulysses (on STS 41), 
and the intricate repair and maintenance missions to 
the Hubble telescope (on STS 61, 82, 103, and 109). 
On STS 60, in February 1994 the first Russian cosmo- 
naut, Sergei Krikalev, participated as a crew member, 
and in March 1995 Endeavour, on mission STS 67, es- 
tablished a long-duration record for the space shuttle 
program of 16 days 15 h 9 min. On STS 77, in June 
1995, 20 years after the Apollo-Soyuz Test Project, 
Atlantis carried out the first of a series of dockings 
with the Russian Mir space station (Fig. 19), form- 
ing the largest human-made structure in space so far. 
Joint activities were carried out and crew members 
exchanged between the two spacecraft before sep- 
aration. Eight more dockings between space shut- 
tles and Mir were conducted in 1995-1998 as Phase 
1 of the JSS program, with seven U.S. astronauts 
stationed for extended durations on the Russian 
facility. Since 2000, most of the shuttle missions have 
been in support of the assembly and operation of the 
International Space Station. 

On February 1, 2003, after completing a success- 
ful research mission (STS 107), Columbia was lost 
with its crew during reentry. The Columbia Accident 
Investigation Board, chaired by Admiral (ret.) Harold 
W. Gehman, Jr., concluded that, unbeknown to crew 





Fig. 18. Astronaut Bruce McCandless performing first free 
(untethered) extravehicular activity with an autonomous 
maneuvering system during shuttle flight STS 41-B on 
February 7, 1984. 
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Fig. 19. Space shuttle Atlantis docked to the Kristall module on the Russian space station Mir, photographed by cosmonant 
Nikelai Budarin on July 4, 1995. (NASA) 


and ground, one of the left wing’s leading-edge re- 
inforced carbon-carbon (RCC) elements had been 
punctured during ascent to orbit by a “suitcase-sized” 
chunk of foam insulation, blown off the external tank 
by the supersonic airstream, hitting the leading edge 
and rendering the wing unable to withstand reentry 
longer than about 8 min after entry interface. Shuttle 
operations were halted as NASA and its contractors 
labored on intensive return-to-flight (RTF) efforts. 
The space shuttle returned to the skies on July 26, 
2005, with the liftoff of Discovery on mission STS 
114 to the JSS. The flight was successful but engi- 
neers were concerned about the shedding of some 
insulating foam material off the external tank. Prior 
to its final approach to the station, Discovery per- 
formed a maneuver to allow digital imagery of its 


thermal protection system from the JSS, and during 
docking crewmembers performed a first-of-its-kind 
spacewalking heat shield repair. With more work re- 
quired on redesign of the foam-based external tank 
insulation, new sensors for detailed damage inspec- 
tion, and a boom to allow astronauts to inspect the 
vehicle externally during flight, the second RTF test 
flight was delayed for nearly a year. However, with 
test flights in July and September 2006 and an addi- 
tional flight in December, the shuttle fleet resumed 
its operations. 

As of December 31, 2006, the four U. S. shuttles, 
in over 24 years of operation, have flown a total of 
117 missions, carrying 597 men and 101 women into 
space (including the lost crews of seven on STS 5 1- 
L/Challenger and STS 107/Columbia). Participating 


in these flights were 81 crew members from 16 for- 
eign nations. See SPACE SHUTTLE. 

Spacelab. The Spacelab was a major adjunct of 
the shuttle. Developed and funded by member na- 
tions of the European Space Agency, the large pres- 
surized Spacelab module with an external equip- 
ment pallet was designed to be the most important 
payload carrier during the space shuttle era. With 
its large transport capacity Gin weight, volume, and 
power supply), the Spacelab was intended to sup- 
port a wide spectrum of missions in science, appli- 
cations, and technology by providing versatile and 
economical laboratory and observation facilities in 
space for many users. Another objective was to re- 
duce significantly the time from experiment concept 
to experiment result, compared with previous space 
practice, and also to reduce the cost of space ex- 
perimentation. It allowed the direct participation of 
qualified men and women scientists and engineers 
to operate their own equipment in orbit. Primary 
uses of the Spacelab were identified in the areas of 
process engineering and materials research, biomed- 
ical research, Earth observations and meteorology, 
communication and navigation, space flight technol- 
ogy, astrophysics, astronomy, solar physics, energy 
research, atmospheric and space physics, and basic 
chemical and physical sciences. 

The Spacelab design consisted of three elements: 
(1) a pressurized module which could be assem- 
bled from modular sections in two sizes—a short 
version for missions requiring only minimal supple- 
mental subsystems in a “shirt-sleeve” environment, 
and a longer version for more extensive experiments; 
(2) a pallet or instrument mounting platform to sup- 
port sensors and antennas requiring vacuum and 
uninterrupted space exposure; and (3) support 
equipment such as air locks, viewports, storage 
vaults, and pointing systems. Depending on assem- 
bled size, the Spacelab was designed to have a total 
pressurized volume of 880-3500 ft? (25-100 mô). 
In some missions, a pallet-only mode could be flown. 
Spacelab’s average power use was designed to be in 
the 3-6-kW range. Orbiting the Earth at 115-445 mi 
(185-555 km) altitude and 28-104° inclination on 
missions lasting 7-30 days, the Spacelab was de- 
signed to remain firmly attached inside the open 
shuttle cargo bay. A connecting tunnel was designed 
to provide access to and from the shuttle orbiter 
crew compartment. The laboratory was designed to 
provide working space for up to four payload spe- 
cialists, not counting the commander, pilot, and mis- 
sion specialist (flight engineer) or the space shuttle 
itself. 

The first flight of the European-built Spacelab, 
SI-1, took place November 28-December 8, 1983, 
on board the Columbia as the ninth shuttle mission. 
Its six-member crew included the first foreign astro- 
naut in the United States space program, the German 
mission specialist Ulf Merbold. SZ-17 and subsequent 
Spacelab flights, notably the first foreign chartered 
mission, the German D-/, but also the Japanese SI-/ 
on STS 47 and the second German Spacelab mission, 
D-2 on STS 55, proved the concept of the crewed 
space laboratory and confirmed the unique worth 


of humans in space to operate, repair, maintain, and 
troubleshoot scientific equipment and conduct com- 
plex research programs. Spacelab flew 16 missions 
before it was decommissioned in 1998. See SPACE 
BIOLOGY; SPACE PROCESSING. 


International Space Station 


Interest in the development of a permanent crewed 
platform in Earth orbit dates back to the very begin- 
nings of human space flight. While the practical real- 
ization of the concept was accomplished and proven 
by the Soviet Salyut/Mir and the American Skylab 
programs, the real breakthrough happened on 
January 29, 1998, when representatives of 16 nations 
(United States, Russia, Canada, Japan, the 11 nations 
of the European Space Agency (ESA), and Brazil) 
signed a partnership agreement for the joint devel- 
opment and operation of an International Space 
Station USS). With so many nations sharing in this 
unique megaprogram, the JSS has become a focal 
point of international understanding, cooperation, 
and collaboration, and thereby a strong catalyst for 
peace enhancement and conflict reduction. 

The goal of the JSS program is to establish a per- 
manent platform for humans to live and work in 
space in support of science and technology research, 
business, education, and exploration. Its objectives 
are to provide a world-class laboratory complex 
uniquely located in a microgravity and vacuum envi- 
ronment where long-term scientific research can be 
carried out to help fight diseases on Earth, unmask 
fundamental processes leading to new manufactur- 
ing processes and products to benefit life on Earth, 
and observe and understand the Earth’s environment 
and the universe. The station is expected to create 
new opportunities for the private sector for business 
development and entrepreneurial ventures. As a “vir- 
tual classroom” in space, the ISS will inspire and sup- 
port educators and students worldwide. As a testbed 
for studies of long-term effects of space on humans 
and for the development of new space-flight tech- 
nologies, it will also become an essential gateway to 
future human space exploration. 

When completed, the International Space Sta- 
tion will have a mass of about 1,040,000 Ib (470 met- 
ric tons). It will be 356 ft (109 m) across and 290 ft 
(88 m) long, with almost an acre (0.4 hectare) of 
solar panels to provide up to 110 kilowatts of power 
to six state-of-the-art laboratories. 

1998. The on-orbit assembly of the JSS began 
with the launch of the U.S.-owned, Russian-built 
Functional Cargo Block (FGB) Zarya (“Dawn”) on 
November 20, 1998, from the Baikonur Cosmo- 
drome in Kazakhstan on a Proton rocket. The sec- 
ond element, the U.S.-built Node (connecting mod- 
ule) Unity followed on the space shuttle Endeavour, 
launched on December 4, 1998, from the Kennedy 
Space Center, Florida. The shuttle crew attached 
Unity and Zarya during a 12-day mission, beginning 
the station’s orbital construction which will extend 
over 12 years until its completion in 2010. 

1999. Accomplishments during 1999 included the 
first crewed logistics-supply flight of a space shuttle 
to the early assembly in orbit in May and June; testing 
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of hardware and software for station elements to be 
launched in 2000 and 2001; successful implementa- 
tion of a newly developed Multi-Element Integrated 
Test (MEIT), which functionally connects the first 
several U.S. elements of the station on the ground 
in a series of integrated tests; the completion of the 
Mission Control Center and the Payload Operations 
Integration Center; and delivery of the U.S. labora- 
tory module Destiny and the airlock module to the 
launch site in Florida in preparation for MEIT, along 
with other components. Interface testing between 
the U.S. Mission Control Center in Houston and the 
Russian Mission Control Center in Moscow was suc- 
cessfully completed. 

2000. Among the key events in 2000 was the July 
12 launch of the service module Zvezda (Star), the 
first Russian-funded and owned element of the sta- 
tion, which linked up with the station on July 25. The 
docking, executed very gently at 0.6 ft/s (0.2 m/s) by 
Zarya as the active partner, marked the arrival of the 
first human living quarters and an initial command 
post for the station. Zvezda is also equipped with the 
propulsion system that will maintain the attitude and 
orbit of the space station. The station now spanned 
119 ft G6 m) in length, and its mass had grown to 
almost 60 tons (54 metric tons). On August 8 the 
automated Progress MI-3 docked with the station, 
carrying fuel for Zvezda’s propellant tanks, as well 
as equipment and supplies. 


On October 31, a Soyuz-U carrier lifted off from 
Baikonur and placed in orbit Soyuz TM-31, carry- 
ing the first resident crew for the space station. The 
Expedition One crew consisted of American Com- 
mander William Shepherd and veteran Russian cos- 
monauts Yuri Gidzenko and Sergei Krikalev. On 
November 2, the Soyuz craft docked with the sta- 
tion and the crew entered, to begin what is intended 
to be a new era of permanent human habitation in 
space. The Expedition One crew remained on the 
station for 136 days, until March 18, 2001. 

An Endeavour assembly mission on November 
30-December 11 installed two solar array wings, 
jointly spanning 240 ft (73 m), at the space station 
(Fig. 20). The array is now providing the station with 
up to 62 kW of power. 

2001. /SS assembly continued in 2001, notably 
with the installation of the United States Laboratory 
module Destiny on February 10. The 28 x 14 ft 
(8.5 x 4.3 m) laboratory was transported on Atlan- 
tis. The Canada-built Space Station Remote Manip- 
ulator System (SSRMS) and the U.S. airlock module 
Quest also arrived on space shuttles, and the Rus- 
sian Docking Compartment (DC-1) Pirs was carried 
on a Russian Soyuz-U rocket, Quest provided station 
residents with their own “door” to space for extrave- 
hicular activities (ŒŒVAs), and DC-1 gave the station 
a second airlock for spacewalks in Russian Orlan-M 
spacesuits. 





Fig. 20. International Space Station, photographed from the space shuttle Discovery on December 19, 2006. (NASA) 


In the first crew rotation, the Expedition Two 
crew, consisting of Russian commander Yuri 
Ussachev and American flight engineers James Voss 
and Susank Helms, replaced Expedition One in 
March. In August, Expedition Two was replaced by 
the Expedition Three crew, consisting of American 
commander Frank Culbertson and Russian cosmo- 
nauts Vladimir Dezhurov and Mikhail Tyurin. The 
crews in both rotations were transported on the 
space shuttle Discovery. Successive expedition 
crews have served on the JSS during stays of four 
to six months. 

2002. In April 2002, the first of eleven truss el- 
ements, SO (Starboard Zero), was attached on top 
of Destiny, becoming the centerpiece of the 109-m- 
long (356-ft) truss for carrying the solar cell arrays 
of the station. In June, a new station crew (Expe- 
dition 5) delivered the Mobile Base System (MBS), 
to provide mobility for the SSRMS. The second truss 
segment, S1, arrived in October and was attached 
to SO on the starboard side. Its counterpart on port, 
P1, followed in November and was also successfully 
mounted. 

2003-2005. Early in 2003, further progress in ISS 
assembly was brought to a halt by the standdown 
of the space shuttles after the Columbia loss. Crew 
rotation flights could be accomplished only by Rus- 
sian Soyuz vehicles. As an immediate consequence 
of the unavoidable reduction in resupply missions 
to the station, which now could be supported only 
by Russian crewless automated Progress cargo ships, 
station crew size was reduced from three to a two- 
person “caretaker” crew per expedition, except for 
brief stays by visiting researchers or commercial 
“tourists” arriving and departing on the third seat of 
Soyuz spacecraft. Operations procedures had to be 
revised accordingly, and such vital areas as onboard 
systems maintenance and spares provision had to be 
replanned carefully to continue crewed occupancy 
and a viable science research program on board de- 
spite the sudden constriction in logistics. 

2006. With the return to flight of the shuttle, ISS 
assembly resumed. In July the shuttle carried a third 
astronaut (Thomas Reiter) to the station to join Ex- 
pedition 13. In September, the P3/P4 truss structure 
was installed with a second set of solar arrays (of 
four), and the two solar array wings and a huge ra- 
diator panel were deployed from the P4 truss. In 
December, the P5 truss segment, serving as a spacer 
at the end of the P4 truss, was installed, and the 
electrical power and thermal control systems were 
reconfigured to their final configurations and acti- 
vated. 

As currently planned, the station will require a 
total of 80 assembly flights. Interspersed among 
these missions will be logistics missions by the shut- 
tle, Russian Soyuz flights to launch crews, and mul- 
tiple Progress tanker flights for refueling and supply- 
ing the growing structure in orbit. 


Vision for Space Exploration 


On January 14, 2004, U.S. President Bush announced 
the Vision for Space Exploration, a long-term plan 


for returning astronauts to the Moon to prepare for 
voyages to Mars and other planets in the solar system. 
The plan called for completion of the International 
Space Station and retirement of the space shuttle by 
2010, and the development of new spacecraft and 
launch systems. 

Project Constellation. To support this initiative, 
NASA is developing a family of spacecraft, launch- 
ers, and associated hardware, collectively known as 
Project Constellation. 

The Orion spacecraft will be shaped like an Apollo 
capsule but will be significantly larger. It will have a 
crew module shaped similarly to the Apollo com- 
mand module, capable of holding four to six crew 
members, and a cylindrical service module, contain- 
ing the spacecraft’s propulsion systems and supplies. 
It will be reusable for up to 10 flights, allowing the 
construction of a fleet of such spacecraft. Plans call 
for it to be launched into low-earth orbit, either 
for solo flights or for missions to the International 
Space Station, using the proposed Ares I rocket, 
which would consist of a single solid rocket booster 
derived from those used in the space shuttle system, 
and a liquid-fueled second stage. 

Lunar flights would involve the use of a larger 
launch vehicle, the proposed Ares V, which would 
use five RS-68 engines (more powerful and less ex- 
pensive than the space shuttle main engines) and a 
pair of five-segment solid rocket boosters. (Alterna- 
tive plans have also been considered which would re- 
place the Ares I and V rockets with a single launcher, 
the Ares IV.) The Ares V would carry a combination 
of the Earth Departure Stage (EDS), an enlarged ver- 
sion of the S-IVB upper stage of the Saturn V rocket, 
and the Lunar Surface Access Module (LSAM). The 
Ares V would fly for about 8 minutes before separat- 
ing from the EDS/LSAM combination, which the EDS 
would then place into low-Earth orbit. The crewed 
Orion spacecraft would be launched separately on 
the Ares I at least two to four weeks later, and dock 
with the EDS/LSAM combination. After the systems 
were configured for lunar flight, the EDS would in- 
ject the combination into a translunar orbit and then 
be jettisoned. 

Like the Apollo lunar module, the LSAM would 
consist of an ascent stage, which would house the 
four-person crew, and a descent stage, which would 
have the landing legs, most of the crew’s oxygen and 
water, and scientific equipment. Unlike the Apollo 
lunar module, the LSAM’s engines would place the 
Orion/LSAM combination in lunar orbit. The entire 
crew would transfer to the LSAM, which would un- 
dock from Orion and descend to the lunar surface. 
After completing operations there, the crew would 
enter the LSAM’s ascent stage and lift off, powered 
by a single engine and using the descent stage as a 
launch pad. After docking with and transferring to 
the Orion spacecraft in lunar orbit, the crew would 
jettison the LSAM and fire the Orion’s single engine 
for the return journey to Earth. 

Lunar base. In December 2006, NASA announced 
plans to eventually construct a solar-powered lunar 
base located near one of the poles of the Moon, 
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rather than landing at several sites, as was done 
in the Apollo missions. An incremental buildup is 
envisioned, with astronauts making several seven- 
day visits to the Moon until their power supplies, 
rovers, and living quarters are operational. These vis- 
its would be followed by 180-day missions to prepare 
for journeys to Mars. Robotic precursor missions 
would provide landing site reconnaissance, natural 
resource assays, and risk reduction for the human 
landers. A decision on a site will probably not be 
made until after collecting data from several such 
missions. The Lunar Reconnaissance Orbiter (LRO) 
is scheduled to launch in 2008 to scout the lunar 
surface and search for resources. The Lunar Crater 
Observation and Sensing Satellite is to launch with 
the LRO and then crash into the lunar surface 
to search for water ice. 
Jesco von Puttkamer; Jonathan Weil 
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Space navigation and guidance 


The determination of the position and velocity of a 
space probe with respect to a target body or a known 
reference body such as the Earth (navigation) and, 
based upon this determination, the application of 
propulsive maneuvers to alter the subsequent path 
of the probe (guidance). 

Navigation. If the position and velocity of a space 
probe, plus some secondary quantities such as the 
positions and masses of any perturbing celestial bod- 
ies, can be determined at a given instant, then the 
future path of that vehicle can be predicted for all 
time by the application of the laws of celestial me- 
chanics. A simple way of understanding this is that, 
an instant later, the new position of the space probe 
will be the original position incremented by the ve- 
locity times the time duration. Similarly, the new ve- 
locity will be the original velocity incremented by the 
acceleration acting on the probe times the time du- 
ration. Additionally, the acceleration at the new time 
will be slightly different since gravitational acceler- 
ations acting on the probe depend on the position. 
Other accelerations may also affect the probe, such 
as thrust due to engine firings, solar radiation pres- 
sure, drag, and so forth. Thus, at the future time, the 


probe will arrive having a new position and a new 
velocity; these coordinates can be used as the basis 
for predicting an additional short interval into the 
future. By taking sufficiently short steps, a very ac- 
curate path of the future trajectory can be calculated. 
See CELESTIAL MECHANICS. 

Space navigation can be viewed as determining 
the current position and velocity of the probe and 
then using that determination as the basis of pre- 
dicting future motion. This determination is made 
by taking a series of measurements relating to the 
probe’s location and motion and combining these 
measurements in such a manner as to make the most 
accurate estimate of the probe’s current position and 
velocity, taking into account possible small errors or 
inaccuracies in the measurements themselves. One 
of the most powerful (and accurate) measurements 
which can be made is the relative velocity between 
an Earth tracking station and the space probe it- 
self. This is accomplished by broadcasting to the 
probe an electromagnetic signal which consists of 
a single tone having a stable frequency. The probe 
will receive this signal shifted in frequency in ex- 
act proportion to the relative station-probe velocity 
and then immediately will rebroadcast it back to the 
Earth. This frequency shift is known as the Doppler 
effect. It is also possible to measure the station- 
probe distance (or range) using electromagnetic sig- 
nals. Modern systems can make these measurements 
to an accuracy of about 0.02 mm/s (0.001 in./s) 
for the velocity and 1 m G ft) for the range at dis- 
tances extending to the outer limits of the solar sys- 
tem. An additional, very powerful measurement is 
known as delta-differential-one-way-range (denoted 
A-DOR or delta-DOR). It is performed by simultane- 
ously observing both a space probe and a nearby 
extragalactic radio source (quasar) from two track- 
ing stations. These four simultaneous measurements 
are differenced to form an extremely accurate es- 
timate of the angular distance between the probe 
and quasar as seen in the sky. The determination of 
the position and velocity of the probe relative to the 
Earth and hence the Sun can be made by observing 
these quantities over a period of time. See DOPPLER 
EFFECT. 

The probe’s orbit relative to the Earth, thus deter- 
mined, is most useful when the target body’s position 
relative to the Earth is also accurately known. This 
is true for lunar missions and missions to the plan- 
ets. For missions to the satellites of the outer planets 
Qupiter and beyond) or Pluto, for a mission to a body 
such as a comet or asteroid, the position of the tar- 
get may be sufficiently uncertain as to make a strictly 
Earth-relative navigation scheme inadequate. Here it 
is necessary to make measurements that directly in- 
volve the target. Typical of these is to obtain optical 
measurements of the locations of the target relative 
to a star as seen from the probe itself using an on- 
board camera. Again, a series of measurements of the 
angular distance between the target and a known star 
as the probe approaches can be used to determine 
the probe’s position and velocity, this time relative 
to the target. 


Guidance. In its simplest form, guidance consists 
of comparing the predicted future motion of the 
probe against that which is desired, and if these are 
sufficiently different, executing a propulsive maneu- 
ver to modify that future position. Such maneuvers 
are called trajectory correction maneuvers (TCMs). 
Typically, the probe will contain a number of small 
rocket motors which can be fired in any desired di- 
rection by first rotating the spacecraft away from 
its cruise orientation and then holding the new at- 
titude fixed while the rocket motor is firing. The 
cruise attitude is achieved by orienting the space- 
craft relative to celestial references. Typically, the roll 
axis of the probe is oriented toward the Sun. A roll 
of the probe until a predetermined star enters the 
field of view of a fixed star sensor is then sufficient 
to fix the probe’s complete attitude in space. The 
reorienting of the probe for the desired rocket fir- 
ing is accomplished by gyroscopes within the probe 
which measure and command the desired turns from 
the celestial reference orientation. An alternative, for 
both probes cruising in a fixed orientation as well as 
probes that are designed to spin about a primary 
axis, is to fire multiple thrusters oriented in differ- 
ent directions either in sequence or simultaneously 
such that the combination of the several maneuvers 
adds up to the desired modification to the trajectory. 
See STAR TRACKER. 

Modern missions generally employ sequences of 
these basic navigation and guidance functions. For 
example, for a typical trajectory to Mars, the navi- 
gation plan will call for a maneuver (TCM-1) shortly 
after launch (usually about 10 days) to correct for 
most of the errors due to the launch vehicle. A 
month or two later, TCM-2 is performed to cor- 
rect for most of the errors from TCM-1. As the 
probe approaches Mars, one to three or even four 
more TCM’s will be performed depending on a va- 
riety of requirements related to the specific mis- 
sion. For each TCM, Earth-based tracking (Doppler, 
ranging, and usually A-DOR) will be used to esti- 
mate the trajectory prior to performing the maneu- 
ver. Missions to bodies requiring optical navigation 
will also include this data for the final approach 
maneuvers. 

An entirely different method of making very large 
changes to the orbit of a space probe utilizes the grav- 
ity of intermediate bodies rather than the engines on 
the probe. These multiplanet missions require not 
only complex trajectory design techniques but also 
very accurate navigation to the intermediate targets. 
Such missions may gather scientific information at 
both the intermediate bodies and the ultimate target 
or may simply use the intermediate bodies as an ex- 
tremely efficient means of getting to the final target. 
At its simplest, the method of “gravity-assist” uses a 
close flyby of a massive body to exchange energy and 
momentum between the space probe and the body. 
Although large changes to the orbit of the probe are 
made, no discernible effect is observed on the mas- 
sive body due to the enormous difference in mass 
between the probe and flyby body. Illustration a 
shows that in the frame of reference of the flyby body 
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Qupiter in this case) the speed of the probe both ap- 
proaching and departing Jupiter is the same although 
the direction changes. In illus. b it is seen that since 
Jupiter is moving in the Sun’s frame of reference both 
the speed and direction of the probe are changed. In 
the 1970s the first missions to use gravity-assist were 
Mariner Venus/Mercury and both the Pioneer and 
Voyager missions to the outer solar system. Since 
that time Galileo (aunched in 1989) and Cassini- 
Huygens (launched in 1997) have used gravity-assist 
both for the interplanetary transfer and in orbit 
at Jupiter and Saturn, respectively. Missions have 
been launched to the extremes of the Solar System 
(MESSENGER in 2004 to Mercury orbit using Earth, 
Venus, and Mercury flybys, and New Horizons in 
2006 to Pluto using a Jupiter flyby), and the Stardust 
mission utilized an Earth flyby on its way to comet 
Wild 2 and returned a sample of comet dust to the 
Earth. Other than further missions to Mars, almost 
all of the future exploration of the solar system may 
make use of gravity-assist. All of these orbit designs 
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include a series of trajectory correction maneuvers 
on approach to each flyby body as well as one 
or more “clean-up” trajectory correction maneuvers 
shortly after the flyby. 

The guidance maneuvers discussed so far are de- 
signed to make small corrections to the planned 
orbit. For missions designed to orbit another body, 
a large terminal guidance maneuver, also called an 
orbit insertion maneuver (for example, MOI for Mars 
orbit insertion, or JOI for Jupiter orbit insertion), 
is applied at the encounter with the target body. 
These maneuvers are quite large and may use the 
large majority of the propellant carried by the probe. 
For example, the Galileo JOI was about 0.65 km/s 
(0.4 mi/s), while MOIs for Mars Odyssey (2001), 
Mars Express (2003), and Mars Reconnaissance Or- 
biter (2006) were about 1.4 km/s (0.9 mi/s), 0.8 km/s 
(0.5 mi/s), and 1.0 km/s (0.6 mi/s), respectively. For 
orbiter missions, navigation and guidance takes place 
much as for interplanetary cruise with both orbit 
determination and correction maneuvers being per- 
formed on a regular basis. See GUIDANCE SYSTEMS; 
SPACE PROBE. 

Nomenclature. Often “navigation” is used to mean 
all of orbit design, orbit determination, and maneu- 
ver design and execution. The terminology “guid- 
ance and control (G&C)” or “guidance, navigation, 
and control (GN&C)” is then used to denote the op- 
eration of the space probe itself—knowledge and 
control of its pointing and orientation in space 
plus the timing and firing of the onboard en- 
gines. Dennis V. Byrnes; David W. Curkendall 
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On-board assemblages of equipment to generate, 
store, and distribute electrical energy on satellites 
and spacecraft. A reliable source of electrical power 
is required for supplying energy to the spacecraft 
and its payloads during launch and through several 
years of operational lifetime in a space environment. 
Present-generation spacecraft fly power systems 
from tens of watts to several kilowatts. Each of the 
three fuel cells on the space shuttle delivers 12 kW 
continuous power and 16 kW peak power. The Inter 
national Space Station’s solar arrays will generate 
110 kW total power, with approximately 46 kW 
available for research activities. See SATELLITE (SPACE- 
CRAFT); SPACE FLIGHT; SPACE PROBE; SPACE SHUTTLE; 
SPACE STATION. 

With few exceptions, the power systems for 
United States satellites have used photovoltaic gen- 
eration for power, batteries for energy storage, and 
a host of electrical equipment for appropriate regu- 


lation, conversion, and distribution. Fuel cells have 
been limited primarily to use in the crewed space 
program, supplying power for Gemini, Apollo, and 
the space shuttle. Radioisotope thermoelectric gen- 
erators (RTGs) have powered, or augmented solar 
power on, many planetary missions and probes, 
powered lunar instruments left on the Moon by the 
Apollo missions, and augmented the solar-array bat- 
tery power system on at least one Earth-orbiting 
spacecraft. 

Many factors influence the final configuration of a 
power system. Basic to the initial consideration are 
the nature of the mission (Earth-orbiting or plane- 
tary) and mission lifetime. Other relevant factors in- 
clude (1) spacecraft and payload requirements with 
consideration to average and peak power loads; 
(2) effect of environment such as orbit period, length 
of time the spacecraft is in sunlight and shadow, 
radiation particle damage, and space charging; and 
G) constraints imposed by the spacecraft such as 
weight, volume, spacecraft shape and appendages, 
satellite attitude control, electromagnetic radiation 
limitations, characteristics of payloads, and thermal 
dissipation. For spacecraft that are shuttle-launched, 
additional considerations include compatibility with 
shuttle payload safety requirements and, in some 
cases, the retrievability by the shuttle for return to 
Earth. The weight of the power system ranges from 
15 to 25% of the spacecraft weight. 

Solar cells. The photovoltaic effect was discovered 
in 1877, but was not exploited until the silicon cell 
was invented in 1954. The first cells had conver- 
sion efficiencies of approximately 6%, and were little 
more than laboratory curiosities until the space pro- 
gram began in 1958. 

The first solar cells flown, pn cells (a thin p layer 
diffused into an n-type substrate), were selected be- 
cause of the lower sensitivity of the material. How- 
ever, these cells degraded in a radiation environ- 
ment much more rapidly than the np cells (a thin 
n-layer diffused into a p-type substrate). Since the 
early 1960s, the np cell has been used almost exclu- 
sively for United States spacecraft. 

Development efforts for solar cells have empha- 
sized improved conversion (sunlight to electrical) 
efficiency, increased resistance to charged-particle 
radiation, reduced manufacturing costs, and reduced 
weight. Conversion efficiency obtained in the con- 
ventional solar cells was approximately 11% until 
1972, when the violet cell was perfected with an 
efficiency of over 13%. 

The following additions have pushed silicon solar- 
cell efficiency up to about 14%: a p* layer (called 
a black-surface field) that reduces hole-electron re- 
combinations at the cell’s back surface; a textured 
front surface and multilayer antireflection coatings 
that decrease reflected light; and a back-surface mir- 
ror that reduces operating temperature. For silicon 
cells, maximum achievable efficiency appears to be 
16-18%. Consequently most development efforts are 
focused on improving the design of the array or per- 
fecting the gallium aluminum arsenide/gallium ar- 
senide (GaAlAs/GaAs) cell technology. 


The potential of GaAlAs/GaAs solar cells to ob- 
tain efficiencies of 20-25% has prompted major 
developments in this technology. A liquid-phase epi- 
taxially grown GaAlAs/GaAs cell has achieved effi- 
ciencies as high as 18%. A metal/organic chemical 
vapor deposition process for cell fabrication and 
a multibandgap gallium aluminum arsenide/gallium 
indium arsenide (GaAlAs/GaInAs) cell that uses alu- 
minum and indium material to enhance efficiency 
are other GaAlAs/GaAs developments. Develop- 
ments anticipated for GaAlAs/GaAs cells indicate 
higher-performance cells with significant cost sav- 
ings for space application. See SOLAR CELL. 

Solar arrays. The process of assembling individual 
cells into series and parallel circuits to obtain the re- 
quired working voltage and power is intricate and 
complex. There are two basic cell-mounting config- 
urations. Body-mounted arrays, because of limited 
available space, occasionally use a flexible shingle ar- 
rangement. This arrangement usually provides more 
cells per unit area, thereby providing a higher output 
for a given area. However, flat mounting, with space 
provided between cells for electrical interconnects, 
is most frequently used (Fig. 1). 

Many solar arrays now in use are in the 1-10-kW 
range, such as the array for the Solar Maximum Mis- 
sion (Fig. 2). The four solar arrays for the Interna- 
tional Space Station produce 31 kW each. 
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Fig. 1. Cross section of a solar array, illustrating the 
materials and components used in assembly with the 
flat-mounting technique. 


The flexible roll-up solar-array (FRUSA) technol- 
ogy was first demonstrated in 1971 on an Air Force 
flight-test validation program. Solar arrays such as 
the FRUSA have a specific power of 20 W/lb 
(44 W/kg). Studies for a solar-array-powered propul- 
sion stage have demonstrated the feasibility of a 
12.5-kW array with a specific power of 30 W/lb 
(66 W/kg), and work has been undertaken to eventu- 
ally achieve a specific power of 90 W/lb (200 W/kg). 

The Hubble Space Telescope used two rolled-up 
arrays. They developed cooling-heating problems 
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Fig. 2. One of the two solar-array wings flown on the Solar Maximum Mission spacecraft. Overall dimensions are 7.2 by 


11.5 ft (2.2 by 3.5 m). 
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which caused vibration during the transitions of 
the telescope from shadow to sunlight. The ar- 
rays were redesigned and replaced during a tele- 
scope reservicing mission. See HUBBLE SPACE TELE- 
SCOPE. 

With the advent of GaAlAs/GaAs cells, solar arrays 
using solar concentrators have been studied. The 
useful range of conventional solar arrays for deep- 
space missions is limited to about 1.5 astronomical 
units (AU) [Mars orbit]. Studies have shown that, 
with solar concentration, the photovoltaic power 
system’s useful range can be extended to at least 
9 AU [Saturn orbit]. A concentrator system can 
be used with either silicon or GaAlAs/GaAs cells. 
However, the GaAlAs/GaAs solar arrays potentially 
have several advantages over silicon solar arrays, in- 
cluding higher beginning-of-life (BOL) efficiencies, 
less degradation in a radiation environment, higher 
operating temperatures, more rapidly increasing 
performance with concentrator ratios, and greater 
ability to recover from radiation damage at lower 
temperatures. With concentrator ratios of 8:25, an 
overall power density at beginning of life of 9 W/lb 
(20 W/kg) is projected. The potentially low cost of 
concentration may make this approach economically 
more attractive than RTGs. 

One concentrator system is aboard an HS 702 
geosynchronous spacecraft, designed by Hughes 
Space and Communications Company. The satellite, 
Galaxy XI, was launched on December 21, 1999. 
The solar arrays were deployed on January 21, 2000, 
and for the first time television cameras were used 
to watch the deployment. Most concentrator designs 
use a lens to focus more sunlight onto the cells. The 
Hughes design used reflector panels on each side of 
the solar panel. The cells were Ga-As and produced 
12.4 kW at approximately 25% efficiency. 

Another mission that used concentrators was Deep 
Space 1. It had 720 lenses to focus sunlight onto the 
solar arrays, producing 2500 W at Earth’s orbit. 

Other proposals have been made to improve the 
efficiencies and thus power outputs of solar arrays. 
One proposal is to stack cells on top of each other, 
each efficient at absorbing certain bandgap frequen- 
cies (colors) of light but allowing other frequencies 
to be transmitted to the next cell. A GaAs three- 
junction multibandgap cell has a calculated effi- 
ciency of over 34%. Another proposal would use 
the same principle, but instead of stacking, the cells 
would be placed side by side and a prism would 
spread the white light into the rainbow of colors 
that the cells would absorb. Five different cells 
would have approximately 32% efficiency in 
normal sunlight, and if concentrators were used be- 
fore the prisms to concentrate the light by eight 
times, the efficiency would be 43%. See SOLAR 
ENERGY. 

Batteries. Although both primary and secondary 
(rechargeable) batteries have been used in the space 
program, satellites now in use almost always employ 
rechargeable batteries. Five different rechargeable 
electrochemical systems are used in space power 
systems: nickel-cadmium (NiCd), silver-cadmium 


(AgCd),  silver-zinc (AgZn), and nickel-hydrogen 
(Ni), and lithium-ion (Li-ion). 

The predominant factor in the use of nickel- 
cadmium batteries is the exceptional long-cycle 
life capability (Fig. 3). An excess of 8 years (over 
40,000 cycles) has been obtained on nickel-cadmium 
batteries operating in a near-Earth environment. At 
synchronous altitudes, several satellites have oper- 
ated in excess of 10 years with nickel-cadmium 
batteries. 

Sealed silver-cadmium batteries have seen limited 
applications in space power systems, even though 
the energy density is higher than that of a nickel- 
cadmium battery. This limited use is due primarily 
to its very low cycle life (ess than one-tenth that of 
the nickel-cadmium battery). Missions that use the 
silver-cadmium battery usually require a spacecraft 
with ultraclean magnetic properties. With an appro- 
priately designed and wired silver-cadmium battery, 
the magnetic field is typically less than 1 gamma 
(1 nanotesla) at 1 ft GO cm). Lifetimes in excess of 
4 years have been obtained on missions with low 
battery duty cycles and a “cold storage” of the bat- 
tery during nonduty cycles. 

Because silver-zinc batteries offer excellent energy 
density but have a very limited life, particularly when 
used in a cycling mode, they have been used only in 
space power systems that require only a few days to 
a few months of operation. 

Nickel-hydrogen batteries have replaced nickel- 
cadmium batteries on some space missions. They 
have a higher energy density and energy per volume 
than nickel-cadmium. They have been used on the 
Satcom V communication satellite, the Earth Obser- 
vation Satellite, and the Hubble Space Telescope. In 
fact, the original batteries for the Hubble Space Tele- 
scope were nickel-cadmium, but they were changed 
to nickel-hydrogen because the telescope is colder 
than was expected, leading to more power being 
used by the heaters. 

Lithium-ion batteries were formerly considered 
too dangerous for space missions since lithium can 
catch fire or explode. However, this changed with 
the launch of Syracuse IIIA from French Guiana in 
October 2005, the first spacecraft to carry a lithium- 
ion battery. The advantage of lithium-ion batteries is 
that they can store the same amount of energy as 
a nickel-hydrogen battery in a smaller volume with 
50% less mass. Also, after a month of storage, they 
still have 98% of their charge. This means that no 
recharge is needed on the launch pad. 

There are several batteries under development to 
replace nickel-cadmium (since cadmium is toxic) 
and nickel-hydrogen (since such batteries are ex- 
pensive to produce). Nickel-metal hydride (NiMH), 
silver-metal hydride (AgMH), and sodium-sulfur 
(NaS) batteries are the prime contenders. Nickel- 
metal hydride batteries are similar to nickel-hydrogen 
except that the hydrogen is stored in the metal 
hydride solid rather than as a gas. The pressure of 
nickel-metal hydride is about 50 Ib/in.? (350 kilo- 
pascals), compared to 1000 Ib/in.? (7 megapascals) 
for nickel-hydrogen. The nickel-hydrogen batteries 
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Fig. 3. A 20-Ah battery designed for space flight. Hermetically sealed nickel-cadmium cells are series-connected to provide 


spacecraft power at approximately 27 V. 


are therefore in the form of a round pressure ves- 
sel, whereas the nickel-metal hydride batteries can 
be made rectangular. Nickel-metal hydride batteries 
have twice the energy density of nickel-cadmium and 
twice the energy per unit volume. They would also 
have twice as much energy per unit volume as nickel- 
hydrogen. Silver-metal hydride batteries might have 
three times the energy density of nickel-cadmium. 
See ENERGY STORAGE. 

Flywheels. Flywheels store mechanical energy 
using a spinning wheel. When connected to a motor/ 
generator, the solar arrays spin up the motor/fly- 
wheel. When the arrays are in shadow, the flywheel 
spins down and the generator produces electricity. 
Thus, the flywheel can be used to “store” electric- 
ity like a battery. Prior to 2001, NASA worked on 
development of a flywheel energy storage (FES) sys- 
tem for the International Space Station. The solar 
arrays provide power for the station, and part of that 
electricity is used to charge the batteries. When the 
station enters the shadow of the Earth, the batteries 
supply the power to the station through the Battery 
Charge/Discharge Unit (BCDU). The flywheel system 
was intended to replace the batteries and the BCDU. 
The flywheels were expected to store more energy 
than the batteries, allowing extra electrical capacity 
for the experiments. Batteries have a limited amount 
of charge/discharge cycles, whereas flywheels can 
be charged and discharged virtually forever. There- 
fore they last longer and were expected to save hun- 
dreds of millions of dollars over the life of the station. 
Testing at Johnson Space Center was scheduled to 


start in 2001 with a planned launch to the Interna- 
tional Space Station in 2004. However, this program 
was canceled by the Bush Administration in 2001. 
See SPACE STATION. 

Fuel cells. The technology resulting from the Gem- 
ini and Apollo fuel-cell programs served as the 
basis for the space shuttle fuel-cell development. The 
shuttle fuel cell has three sets of 32 series-connected 
cells that provide an operating voltage range of 27.5- 
32.5 V and power of 16 kW (with maximum 21 kW). 
Each cell is designed for up to 2400 h of operation 
and weighs 281 lb (127 kg). The shuttle power plant 
consists of three fuel cells that feature a startup time 
of 15 min (compared with 24 h for Apollo) and in- 
stantaneous shutdown (17 h for Apollo). 

Fuel cells have four distinct advantages for 
crewed missions: high power, high specific power 
(56.9 W/Ib or 125.8 W/kg), high efficiency (70%), 
and a benign by-product—water. In fact, the astro- 
nauts drink the water produced by the shuttle fuel 
cells. See FUEL CELL. 

Tethers. Tethers are a space power system cur- 
rently in development. A conducting tether many 
kilometers long is deployed from a spacecraft in 
orbit about a planet with a magnetic field, such as 
Earth. The solar wind issues electrons, which travel 
in spiral paths around the magnetic field lines. As the 
conducting tether crosses the field lines, it induces 
an electric field, the same principle that terrestrial 
generators use. Using an electron gun on the space- 
craft to shoot electrons back into space completes 
the circuit and current flows. The electromotive 
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force (emf) for low-Earth-orbit (LEO) satellites is a 
few tenths of a volt per meter. A 6-mi (10-km) tether 
would produce several kilovolts, leading to several 
kilowatts of power. The TSS-1 (Tethered Satellite 
System 1) shuttle payload tried to test this idea but 
had problems reeling out the tether. Tether voltages 
as high as 3500 V and current levels of 480 mA were 
obtained in a subsequent flight, generating a mod- 
est amount of power, before the tether severed at 
12.2 mi (19.6 km). This was better than was pre- 
dicted by the analytical models. See ELECTROMAG- 
NETIC INDUCTION. 

Solar dynamic systems. Solar dynamic systems 
have been tested to achieve higher efficiencies and 
power while using less mass and volume than a solar 
array would use. The principle is that a parabolic or 
nearly parabolic reflector would concentrate solar 
energy on a receiver. The receiver has a thermal stor- 
age area (for when the station enters Earth’s shadow) 
and a fluid. The solar energy heats the fluid, which 
turns a turbine-generator to supply electricity. Effi- 
ciencies of roughly 30-40% are possible. 

A joint NASA and Russia program to put a solar dy- 
namic system on the space station Mir was canceled 
in February 1996 after several successful tests. The 
flight hardware built for the Mir program is currently 
in storage at the NASA Glenn Research Center. Plans 
by NASA to use most of the existing hardware and 
redesign the system to be installed on the Interna- 
tional Space Station were canceled. 

Nuclear power systems. Nuclear power system 
concepts encompass radioisotopes and nuclear re- 
actor heat sources coupled with thermal-to-electric 
converters. Nuclear power is now limited to a few 
experimental applications. Significant advances in 
materials must be made before a prototype converter 
can be demonstrated. The relatively heavy weight 
(2.5-3 W/Ib or 5.5-6.6 W/kg), high cost, and lack 
of demonstrated reliability make nuclear power sys- 
tems unattractive for Earth-orbiting space power sys- 
tems below the kilowatt range. 

Estimates indicate that the nuclear system could 
become competitive with solar power at 25 kW 
or greater. However, with the reentry of a nuclear- 
reactor-powered Soviet satellite (Cosmos 954) in 
Canada during the summer of 1979, the use of 
nuclear reactors in space moved from the technical 
domain to the political arena. The need and desir- 
ability of nuclear power in space is a controversial 
issue with worldwide ramifications. 

RTGs have been used for space-borne power pri- 
marily for special or unique applications. Present 
RTGs are descended from power sources developed 
by the Space Nuclear Auxiliary Power (SNAP) pro- 
gram starting in the 1950s. Although RTGs have been 
used occasionally in Earth-orbiting satellites, their 
principal space use has been to power space probes 
and lunar equipment. RTGs remain on the lunar sur- 
face from Apollo 12, 14, 16, and 17 missions. These 
units used lead telluride thermoelectric material and 
produced approximately 75 W of power. The Galileo 
probe to Jupiter, the Ulysses probe of the Sun, and 
the Cassini probe to Saturn use the most advanced 


RTG so far, the general-purpose heat source (GPHS) 
RTG. These use silicon-germanium thermoelectric 
material and produce 300 W of power. See NUCLEAR 
BATTERY; THERMOELECTRICITY. 

For power levels above 1 kW, a dynamic isotope 
power system (DIPS) is desirable. The radioisotope 
is used to power a Rankine, Brayton, or Stirling 
device. This system has roughly three times the 
efficiency of a thermoelectric generator. NASA is 
currently (2006) testing Stirling generators. See 
BRAYTON CYCLE; RANKINE CYCLE; STIRLING ENGINE. 

Jeffrey C. Mitchell 
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Griffin and J. R. French, Space Vehicle Design, 2d ed., 
American Institute of Aeronautics and Astronautics, 
Washington, DC, 2004. 
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Space probe 


An automated crewless vehicle, the payload of a 
rocket-launching system, designed for flight missions 
to other planets, to the Moon, and into interplanetary 
space, as distinguished from Earth-orbiting satellites 
(see table). 

The space probe is used primarily for scientific 
purposes, which are stated as the mission objectives. 
Missions have been designed to explore many di- 
verse targets, such as the rocky bodies of the inner 
solar system, including Mercury, Venus, Mars, and 
the asteroids; the giant gaseous planets in the outer 
solar system, Jupiter, Saturn, Uranus, and Neptune; 
and icy bodies originating in the outer solar system, 
such as comets, Kuiper Belt objects, and the dwarf 
planet Pluto. Other missions are designed to study 
solar physics, the properties of interplanetary space, 
or targets far beyond the solar system. Most space- 
craft launched to a planet or other body also study 
the environment of charged particles and electro- 
magnetic fields in interplanetary space during their 
cruise phase en route to the destination. 

Missions may also be categorized by complexity. 
The simplest are flyby spacecraft, which study their 
target body during a relatively brief encounter period 
from a distance of hundreds to thousands of miles 
as they continue past. Next are orbiters, which cir- 
cle a planet or other body for extended study; some 
may carry atmospheric descent probes. Even more 
complex are lander missions, which touch down on 
a planet or other body for the collection of on-site 
data; some may bear exploration rovers designed to 
range beyond the immediate landing site. Finally, the 
most complex space probes envisaged are sample- 
return missions, which would collect specimen ma- 
terial from a target body and return it to Earth for 
detailed study. 


Spacecraft Subsystems 


In the broadest terms, a space probe may be con- 
sidered a vehicle that transports a payload of sensing 








Important space probes 








Name Launch date Comments 

Luna 1 Jan. 2, 1959 Lunar flyby; passed within 3278 mi (5257 km) of the Moon 

Pioneer 4 Mar. 3, 1959 Cosmic rays; passed 37,300 mi (60,000 km) from Moon 

Luna 2 Sept. 2, 1959 Impacted Moon 

Luna 3 Oct. 4, 1959 Photographed far side of Moon 

Pioneer 5 Mar. 11, 1960 Study of space between Earth and Venus; magnetic fields and cosmic rays 

Mariner 2 Aug. 26, 1962 First planetary flyby; Venus probe 

Ranger 7 July 28, 1964 Lunar impact and approach photography 

Mariner 4 Nov. 28, 1964 Mars flyby; photography, magnetic fields, cosmic rays 

Ranger 8 Feb. 17, 1965 Lunar impact and approach photographs 

Ranger 9 Mar. 21, 1965 Lunar impact in Alphonsus; approach photography 

Zond 3 July 18, 1965 Flyby of Moon; photographs, other data 

Pioneer 6 Dec.16, 1965 Solar orbiter 

Luna 9 Jan. 31, 1966 First soft landing on Moon 

Luna 10 Mar. 31, 1966 First probe to orbit Moon; collected data, but no camera instrument 

Surveyor 1 May 30, 1966 Soft landing on Moon; environmental data and photography 

Lunar Orbiter 1 Aug. 10, 1966 Lunar photographs 

Pioneer 7 Aug. 17, 1966 Solar orbiter 

Luna 11 Aug. 24, 1966 Lunar orbiter; TV camera failed 

Luna 12 Oct. 22, 1966 Lunar orbiter; successful return of photographs and other data 

Lunar Orbiter 2 Nov. 6, 1966 Lunar orbital photography 

Luna 13 Dec, 21, 1966 Lunar lander; surface photography and soil information 

Lunar Orbiter 3 Feb. 5, 1967 Lunar orbital photography 

Venera 4 June 12, 1967 Analysis of Venus atmosphere; lander failed during descent 

Mariner 5 June 14, 1967 Venus flyby; atmospheric and magnetospheric data 

Surveyor 3 Apr. 17, 1967 Lunar surface photography and surface properties 

Lunar Orbiter 4 May 4, 1967 Lunar orbital photography 

Lunar Orbiter 5 Aug. 1, 1967 Lunar orbital photography 

Surveyor 5 Sept. 8, 1967 Lunar surface photography and surface properties, including elemental analysis 
of surface 

Surveyor 6 Nov. 7, 1967 Same as Surveyor 5; landing in Sinus Medii 

Pioneer 8 Dec. 13, 1967 Solar orbiter 

Surveyor 7 Jan. 7, 1968 Same as Surveyor 5 

Luna 14 Apr. 7, 1968 Lunar orbiter 

Pioneer 9 Nov. 8, 1968 Solar orbiter 

Zond 6 Nov. 10, 1968 Lunar flyby; crashed on return to Earth 

Venera 5 Jan. 5, 1969 Same as Venera 4 

Venera 6 Jan. 10, 1969 Same as Venera 4 

Mariner 6 Feb. 25, 1969 Photography and analysis of surface and atmosphere of Mars 

Mariner 7 Mar. 27, 1969 Same as Mariner 6 

Luna 15 July 14, 1969 Lunar orbiter (crashed during attempted lunar landing) 

Zond 7 Aug. 8, 1969 Reentered Aug. 14, 1969; third crewless circumlunar flight; recovered in the 
Soviet Union 

Venera 7 Aug. 17, 1970 Lander capsule transmitted 23 min from surface of Venus, Dec. 15, 1970 

Luna 16 Sept. 12, 1970 Reentered Sept. 24, 1970; crewless Moon lander touched down on Sea of 
Fertility Sept. 20, 1970; returned lunar soil samples 

Zond 8 Oct. 20, 1970 Circled Moon; recovered Oct. 27, 1970 

Luna 17 Nov. 10, 1970 Landed on Moon Nov. 17, 1970; crewless Moon rover 

Mars 2 May 19, 1971 First Mars landing 

Mars 3 May 28, 1971 Mars probe 

Mariner 9 May 30, 1971 Mars probe 

Luna 18 Sept. 2, 1971 Lunar lander; failed during descent 

Luna 19 Sept. 28, 1971 Lunar photography mission 

Luna 20 Feb. 14, 1972 Lunar sample return 

Pioneer 10 Mar. 2, 1972 Jupiter encounter; transjovian interplanetary probe 

Venera 8 Mar. 27, 1972 Venus landing July 22, 1972 

Luna 21 Jan. 8, 1972 Lunar lander with rover 

Pioneer 11 Apr. 5, 1973 Jupiter encounter and trasjovian interplanetary probe; also Saturn encounter 

Mars 4 July 21, 1973 Mars orbiter 

Mars 5 July 25, 1973 Mars orbiter 

Mars 6 Aug. 5, 1973 Mars lander 

Mars 7 Aug. 9, 1973 Mars lander 

Mariner 10 Nov. 3, 1973 Venus and Mercury encounter 

Luna 22 May 29, 1974 Lunar orbiter 

Helios 1 Dec. 10, 1974 Inner solar system, solar wind exploration 

Venera 9 June 8, 1975 Venus probe 

Venera 10 June 14, 1975 Venus probe 

Viking 1 Aug. 20, 1975 Mars lander and orbiter 

Viking 2 Sept. 9, 1975 Mars lander and orbiter 

Helios 2 Jan. 15, 1976 Interplanetary; similar objectives to those of Helios 1 

Luna 24 Aug. 9, 1976 Lunar sample return 

Voyager 2 Aug. 20, 1977 Jupiter, Saturn, Uranus, and Neptune encounters; also satellites and ring 
systems 

Voyager 1 Sept. 5, 1977 Same objectives as Voyager 2 with some orbital differences giving differing 
encounter trajectories 

Pioneer Venus Orbiter May 20, 1978 Returning atmospheric, surface, and particle and field information 

Pioneer Venus Aug. 8, 1978 Penetration of Venus atmosphere by four probes; returned atmospheric data 

Multi-Probe Bus 

Venera 11 Sept. 8, 1978 Venus lander; returned information on surface properties; detection of lightning 
and thunderlike sounds 

Venera 12 Sept. 14, 1978 Similar mission to Venera 11 


(conti.) 
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Important space probes (cont.) 
Name Launch date Comments 
Venera 13 Oct. 30, 1981 Venus lander 
Venera 14 Nov. 4, 1981 Venus lander 
Venera 15 June 2, 1983 Venus lander; surface topography 
Venera 16 June 7, 1983 Similar mission to Venera 15 
International Cometary = Originally International Sun-Earth Explorer 3 (ISEE 3) Earth satellite, redirected 
Explorer (ICE) using a lunar swingby on Dec. 22, 1983, to encounter with Comet 
Giacobini-Zinner; plasma and magnetic field measurements 
Vega 1 Dec. 15, 1984 Venus probe-Halley intercept 
Vega 2 Dec. 21, 1984 Venus probe-Halley intercept 
Sakigake Jan. 8, 1985 Halley intercept; precursor to Suisei, upgraded to full mission 
Giotto July 2, 1985 Halley intercept 
Suisei Aug. 19, 1985 Halley intercept; plasma and magnetic field measurements 
Phobos 1 July 7, 1988 Mars/Phobos probe, lost by command error 
Phobos 2 July 12, 1988 Mars/Phobos probe; some data, but communications lost 
Magellan May 4, 1989 Venus radar mapper 
Galileo Oct. 18, 1989 Jupiter orbiter and atmospheric probe 
Hiten/Hagoromo Jan. 24, 1990 Moon orbiter and relay probe; orbiter transmitter malfunctioned 
Ulysses Oct. 6, 1990 Solar polar orbiter 
Mars Observer Sept. 25, 1992 Contact lost 3 days before Mars arrival 
Clementine Jan. 25, 1994 Orbited Moon; thruster malfunction prevented asteroid flyby 
Solar and Heliospheric Dec. 2, 1995 Orbits L1 libration point to study the Sun 
Observatory (SOHO) 
Near Earth Asteroid Feb. 17, 1996 Asteroid orbiter 
Rendezvous 
Mars Global Surveyor Nov. 7, 1996 Mars orbiter 
Mars 96 Nov. 16, 1996 Mars orbiter and landers; launch vehicle failed 
Mars Pathfinder Dec. 4, 1996 Mars lander and rover 
Advanced Composition Aug. 25, 1997 Orbits L1 libration point to study charged particles 
Explorer 
Cassini Oct. 15, 1997 Saturn orbiter/Titan descent probe 
Lunar Prospector Jan. 6, 1998 Lunar orbiter 
Nozomi July 4, 1998 Mars orbiter; failed to reach Mars orbit 
Deep Space 1 Oct. 24, 1998 Test of ion engine and 11 other advanced technologies; asteroid and comet 
flybys 
Mars Climate Orbiter Dec. 11, 1998 Lost during Mars arrival 
Mars Polar Lander Jan. 3, 1999 Lost during Mars arrival 
Stardust Feb. 7, 1999 Comet flyby, dust sample return 
Mars Odyssey April 7, 2001 Mars orbiter 
Wilkinson Micowave June 30, 2001 Solar orbiter studying cosmic background radiation 
Anisotropy Probe 
Genesis Aug. 8, 2001 Solar wind sample return; hard impact upon landing in Utah 
Comet Nucleus Tour July 3, 2002 Intended for two comet flybys; failed shortly after launch 
Hayabusa May 9, 2003 Comet sample return mission; sample collection uncertain 
Mars Express June 2, 2003 Mars orbiter 
Beagle 2 June 2, 2003 Lander carried on Mars Express; failed during descent 
Spirit (Mars Exploration June 10, 2003 Mars rover 
Rover A) 
Opportunity (Mars July 7, 2003 Mars rover 
Exploration Rover B) 
Spitzer Space Aug. 25, 2003 Infrared telescope in Earth-trailing heliocentric orbit 
Telescope 
Smart 1 Sept. 27, 2003 Lunar orbiter 
Rosetta March 2, 2004 Comet orbiter-lander 
Messenger Aug. 3, 2004 Mercury orbiter 
Deep Impact Jan. 12, 2005 Released impactor to excavate crater in comet nucleus 
Mars Reconnaissance Aug. 12, 2005 Mars orbiter 
Orbiter 
Venus Express Nov. 9, 2005 Venus orbiter 
New Horizons Jan. 19, 2006 Pluto flyby 
Solar Terrestrial Oct. 26, 2006 Twin spacecraft in solar orbit 
Relations 
Observatory 








instruments to the vicinity of a target body. Thus, the 
spacecraft must include a number of subsystems to 
provide power, to communicate with Earth, to main- 
tain and modify attitude and perform maneuvers, to 
maintain acceptable on-board temperature, and to 
manage the spacecraft overall. See SPACE TECHNOL- 
OGY. 

Scientific instruments. The scientific payload may 
be divided into remote-sensing instruments such as 
cameras, and direct-sensing instruments such as mag- 
netometers or dust detectors. They may be classified 


as passive instruments, which detect radiance given 
off by a target body, or active, which emit energy 
such as radar pulses to characterize a target body. 
Examples of passive remote-sensing instruments 
are cameras and other imagers, polarimeters, pho- 
tometers, spectrometers, and radiometers. Passive 
direct-sensing instruments include those that study 
magnetic fields and charged particles, such as high- 
and low-energy particle detectors, plasma instru- 
ments, plasma wave detectors, dust detectors, and 
magnetometers, as well as instruments on landers 


such as those of the Viking Mars mission. Ac- 
tive remote-sensing instruments include synthetic- 
aperture radar and altimeters. 

Early missions up to and including Voyager used 
vidicon tubes similar to those in terrestrial television 
cameras of that period as the detector of the on- 
board camera. Some, including Lunar Orbiters and 
Luna missions, actually used film which was devel- 
oped and scanned on board. The image information 
was then digitized before being transmitted to Earth. 
Subsequent missions have used solid-state charge- 
coupled devices (CCDs), similar to those in contem- 
porary television cameras. A spacecraft’s camera sys- 
tem frequently is outfitted with two sets of optics to 
provide wide- and narrow-field views; the narrow- 
field camera is the equivalent of a small on-board 
telescope. In addition, the camera system includes a 
filter wheel with various colored filters which may 
be placed in the optical path. Color images are made 
by rapidly taking a series of three exposures through 
different filters; these are transmitted to Earth and 
combined into a single frame during image pro- 
cessing. Most spacecraft cameras, such as those on 
Voyager, Galileo, or Cassini, capture light in dis- 
crete snapshot-type exposures. By contrast, some 
orbiters, such as Mars Global Surveyor and Mars Re- 
connaissance Orbiter, carry pushbroom-type cam- 
eras; these collect data continuously, building up a 
long ribbonlike image as the spacecraft passes over 
the planet’s surface. See ASTRONOMICAL IMAGING; 
CHARGE-COUPLED DEVICES; COLOR FILTER; REMOTE 
SENSING; TELEVISION CAMERA TUBE. 

Power subsystem. Electrical power is required for 
all spacecraft functions. The total required power 
ranges from about 300 to 2500 W for current mis- 
sions, depending on the complexity of the space- 
craft. The power subsystem must generate, store, and 
distribute electrical power. 

All space probes launched so far have generated 
power either via solar panels or via radioisotope 
power systems. Solar panels are generally em- 
ployed on missions to the solar system’s inner plan- 
ets, where sunlight remains relatively strong. The 
strength of sunlight diminishes as the square of the 
distance from the Sun. Missions beyond the orbit of 
Mars have therefore relied on radioisotope systems. 
In addition, radioisotope systems have been used on 
the Viking Mars landers because of the unreliability 
of sunlight during dust storms and other periods. 

Solar panels typically employ photovoltaic devices 
constructed of materials such as crystalline silicon 
or gallium arsenide. A silicon cell 2.4 in. ( cm) in 
diameter, for example, produces a current of about 
1 A at 0.25 V under direct sunlight at a distance of 
1 astronomical unit (1 AU, the average distance from 
Earth to the Sun, is about 9.3 x 107 mi or 1.5 x 
10% km). Solar panels are mounted on arrays which 
may be articulated to optimize pointing at the Sun; 
alternatively, they may be off-pointed during peri- 
ods to generate less electrical energy in the interest 
of on-board power management. The performance 
of solar panels degrades gradually at a rate of per- 
haps 1-2% per year. On the solar-powered Magellan 


mission, power loss was unusually great. After a 15- 
month interplanetary cruise and 4 years in orbit at 
Venus, careful power management was required in 
the mission’s final days before Magellan was inten- 
tionally directed into the planet’s atmosphere to burn 
up in October 1994. See ASTRONOMICAL UNIT; SOLAR 
CELL. 

The other principal power source, the radioiso- 
tope power system, produces electrical power 
through the radioactive decay of plutonium-238. The 
radioisotope device is not a nuclear reactor, how- 
ever; as the plutonium naturally decays, it gives off 
heat which is converted to electricity. In early de- 
vices this conversion was accomplished by ther- 
mocouples made of silicon-germanium junctions, 
whereas one newer design uses thermocouples com- 
posed primarily of lead telluride, and another envis- 
ages the use of a dynamic Stirling converter. Waste 
heat not converted to electricity is radiated into 
space via metal fins on the generator casing. The out- 
put of radioisotope power systems gradually decays 
at about 1-2% per year, slightly more rapidly than 
actively used solar panels. See NUCLEAR BATTERY. 

Because they typically go through alternating 
sunlight and shadow, spacecraft powered by solar 
panels must carry batteries to store electricity for 
periods when the panels are in shade. Nickel hydro- 
gen, nickel metal hydride, and lithium batteries are 
often used. Their performance degrades after many 
cycles of charging and partial discharging, but the 
batteries may be reconditioned by discharging them 
completely, much like rechargeable batteries in ter- 
restrial applications. Because the electrical power 
output of radioisotope power systems does not vary 
significantly from hour to hour, spacecraft equipped 
with them do not require batteries, but may require 
short-term energy-storage devices, such as capaci- 
tors, to provide for high-pulse current needs. 

The other function of the power subsystem is to 
distribute electrical power. A typical spacecraft sup- 
plies its various on-board subsystems with a specific 
constant bus voltage, although some instruments 
may require special voltages. During the course of 
the mission, various instruments are switched on 
and off as they are used to make observations. This 
switching is performed with solid-state or mechani- 
cal relays. A shunt regulator is generally used to main- 
tain a constant voltage on the main bus; this converts 
electrical energy into heat, which is radiated into 
space. In some cases it may be necessary to switch 
instruments on and off in a particular order to man- 
age the voltage on the main bus. See RELAY; SPACE 
POWER SYSTEMS. 

Telecommunications subsystem. In order to accom- 
plish its mission, the spacecraft must maintain com- 
munications with Earth. This includes receiving com- 
mands sent from ground controllers, as well as 
transmitting scientific data and routine engineer- 
ing “housekeeping” data. All of these transmissions 
are made in various segments of the microwave 
spectrum. Early missions typically used lower fre- 
quencies in the L or S band (1-4 GHz), while subse- 
quent missions have used the X band (7-9 GHz). The 
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design of the telecommunications subsystem takes 
into account the volume of data to be transmitted 
and the distance from Earth at which the spacecraft 
will operate, dictating such considerations as the size 
of antennas and the power of the on-board transmit- 
ter. See MICROWAVE; TELEMETERING. 

Advanced planetary probes have carried a dish- 
shaped high-gain antenna which is the chief antenna 
used to both transmit and receive. These antennas 
typically consist of a large parabolic reflector, with a 
subreflector mounted at the main reflector’s focus in 
a Cassegrain-type configuration. Most large high-gain 
antennas are fixed to the spacecraft’s body; the en- 
tire spacecraft must maintain Earth pointing to avoid 
disrupting communication. Smaller parabolic anten- 
nas used in relatively close missions such as those to 
Mars, however, may be on a steerable mount. Most 
large parabolic antennas are rigid; the high-gain an- 
tenna for Cassini, for example, is fabricated from a 
lightweight composite material. Galileo used a 16-ft 
(4.8-m) high-gain antenna designed to unfold like 
an umbrella after launch; although similar antennas 
functioned well on Earth-orbiting satellites, the an- 
tenna on Galileo did not open fully when ground 
controllers attempted to deploy it in April 1991, ren- 
dering it unusable. The beam width of parabolic 
high-gain antennas is typically very narrow, requir- 
ing an Earth pointing to within a fraction of a de- 
gree. The advantage of the high-gain antenna is 
that it allows a much greater transmission rate than 
do other antenna types. See ANTENNA (ELECTRO- 
MAGNETISM). 

In the interest of redundancy and in the event 
that Earth pointing is lost, spacecraft virtually always 
carry other on-board antennas. These may be low- 
gain antennas, which typically offer nearly omnidi- 
rectional coverage except for blind spots shadowed 
by the spacecraft body. Galileo was outfitted with 
two low-gain antennas which combined to afford 
completely omnidirectional capability. Other space- 
craft have used medium-gain antennas, which pro- 
vide a beam width of perhaps 20-30°, with a gain 
midway between that of high- and low-gain anten- 
nas. Magellan, for example, was equipped with a 
conical feed horn antenna which it used when its 
high-gain antenna was off Earth point. 

The spacecraft’s on-board transmitter typically 
provides a signal in the tens of watts. The final stage 
of the transmitter is a power amplifier which may be 
a traveling-wave tube or a solid-state amplifier. The 
carrier signal is modulated with a subcarrier which 
contains the measured scientific or engineering data. 
Ground controllers frequently make use of the tech- 
nique of Doppler tracking, wherein slight changes 
in the frequency of the probe’s signal are measured 
to reveal subtle acceleration changes by the space- 
craft. In some cases, spacecraft may be commanded 
to transmit a pure carrier signal for purposes such 
as Doppler tracking; this was used with great suc- 
cess in the Magellan mission to map Venus’s grav- 
ity field by measuring slight acceleration changes in 
the spacecraft as it passed over various portions of 
the planet. See DOPPLER EFFECT; MICROWAVE SOLID- 


STATE DEVICES; POWER AMPLIFIER; RADIO TRANSMIT- 
TER; TRAVELING-WAVE TUBE. 

The spacecraft’s receiver is designed to be sen- 
sitive to a narrow range of frequencies within a 
very small percentage of a specified frequency. Many 
spacecraft receivers incorporate phase-lock-loop cir- 
cuitry, which locks on to a signal detected within the 
specified bandwidth and follows any slight changes 
in frequency, particularly the Doppler shift produced 
by the motion of the probe relative to Earth. Al- 
though this design has worked well for most mis- 
sions, Voyager 2 presented special challenges be- 
cause one of its redundant receivers failed midway 
through the mission, and the second receiver’s fre- 
quency varied depending on the temperature of the 
on-board electronics. Ground controllers thus had to 
calculate the expected frequency to which the on- 
board receiver would be sensitive and had to tailor 
transmissions of spacecraft commands accordingly. 
See PHASE-LOCKED LOOPS; RADIO RECEIVER. 

In most modern cases, the transmitter and receiver 
may be combined into a single unit called a transpon- 
der. Some components of the telecommunications 
subsystem may be used for purposes other than com- 
munications; there may in fact be a radio-science 
team of investigators who use the on-board radio as a 
scientific instrument. As noted above, Magellan mea- 
sured Venus’s gravity by means of Doppler tracking 
of the radio carrier signal. On many missions such as 
Voyager, the radio signal is tracked as the spacecraft 
passes behind a planet, revealing information on the 
chemical makeup of the planet’s atmosphere. Mag- 
ellan’s main scientific instrument, its imaging radar, 
used the spacecraft’s high-gain antenna to send and 
receive radar pulses to and from the planet’s surface. 
See RADAR; RADAR ASTRONOMY; SPACE COMMUNICA- 
TIONS. 

Attitude-control subsystem. It would be impossible 
to navigate the spacecraft successfully or point its sci- 
entific instruments or antennas without closely con- 
trolling its orientation in space, or attitude. Many 
specialized techniques have been developed to ac- 
complish this. 

Some spacecraft, particularly earlier ones, have 
been spin-stabilized; during or shortly after launch, 
the spacecraft is set spinning at a rate on the order of 
a few revolutions per minute. Much like a rotating toy 
top, the spacecraft’s orientation is stabilized by the 
gyroscopic action of its spinning mass. Thruster jets 
may be fired to change the spacecraft’s attitude. Spin 
stabilization is preferred for scientific instruments 
that measure electromagnetic fields and charged par- 
ticles, because they are continuously swept in a cir- 
cle in such a way as to cancel out interference from 
the spacecraft itself. The stabilization method is not 
as good for scientific instruments that require point- 
ing, such as optical cameras. The Pioneer 10 and 11 
spacecraft were spin-stabilized, capturing photolike 
images with sweep polarimeters. Ulysses exclusively 
carries fields and particles instruments, and thus is 
also spin-stabilized. 

Most planetary spacecraft, however, are three-axis- 
stabilized, meaning that their attitude is fixed in 


relation to space. This method makes the operation 
of pointed instruments much simpler, although it is 
not as preferable for fields and particles instruments. 
The spacecraft’s attitude is maintained and changed 
via on-board thruster jets or reaction wheels, or a 
combination of both. Reaction wheels are electri- 
cally powered devices similar to gyroscopes which 
rotate the spacecraft about a given axis by spinning 
up or slowing down a massive wheel. Although most 
planetary spacecraft are three-axis-stabilized, Galileo 
was the only spacecraft to use a complex dual-spin 
design. Most of the Galileo spacecraft spins at three 
revolutions per minute, while a despun segment car- 
rying the camera and other pointed instruments is 
held in a three-axis-stabilized attitude. The two seg- 
ments of the spacecraft are joined by a complex spin 
bearing that must transfer hundreds of electrical sig- 
nals. See GYROSCOPE. 

Other parts of the attitude-control subsystem in- 
clude Sun sensors and star trackers which are used 
to establish points of reference in space. The subsys- 
tem’s functions are usually overseen by a dedicated 
computer system distinct from the spacecraft’s main 
command and data computer. Because spacecraft 
must operate at great distances from Earth where ra- 
dioed commands would take many minutes or even 
hours to arrive, the attitude-control subsystem is usu- 
ally one of the most autonomous parts of the space- 
craft, maintaining the spacecraft’s orientation with 
only infrequent intervention from the ground. See 
STAR TRACKER. 

Propulsion subsystem. Most spacecraft are outfit- 
ted with a series of thruster jets, each of which 
produces approximately 0.2-2 pounds-force (1- 
10 newtons) of thrust. The thrusters are placed 
so that at least two lie in opposite directions on 
each of the spacecraft’s three orthogonal axes. Ad- 
ditional thrusters are often added to lend redun- 
dancy. Thrusters are usually fueled with a monopro- 
pellant, hydrazine, which decomposes explosively 
when it contacts an electrically heated metallic cat- 
alyst within the thruster. The system also includes 
helium used to pressurize the propellant. 

In addition to maintaining the spacecraft’s atti- 
tude, on-board thrusters are used for trajectory- 
correction maneuvers. These maneuvers are usually 
carried out at several points during interplanetary 
cruise to progressively fine-tune the spacecraft’s aim 
point at its destination. Maneuvers are conducted by 
firing one or more thrusters for periods from several 
seconds to several minutes, accelerating or deceler- 
ating the spacecraft typically on the order of several 
meters per second. 

Spacecraft designed to orbit a planet or similar tar- 
get body must carry a larger propulsion element ca- 
pable of decelerating the spacecraft into orbit upon 
arrival. Magellan used a solid-fuel motor to insert 
it into orbit at Venus, whereas spacecraft such as 
Galileo and Cassini use bipropellant hypergolic sub- 
stances, monomethylhydrazine and nitrogen tetrox- 
ide, which expand rapidly in a flameless combus- 
tion when mixed. The main engine on Galileo and 
Cassini used to place the spacecraft in planetary 


orbit provided 90 and 110 Ibf (400 and 490 N) of 
thrust, respectively. Mars orbiters have taken ad- 
vantage of different propulsion approaches. Mars 
Global Surveyor and Mars Odyssey each used a large 
main engine [135 and 154 lbf (600 and 685 N), re- 
spectively] to enter orbit at Mars. Mars Reconnais- 
sance Orbiter, by contrast, braked into orbit by firing 
a group of six thrusters supplying 38 Ibf (170 N) of 
thurst each. See SPACECRAFT PROPULSION. 

Thermal control subsystem. The environment of 
space is very harsh, ranging from temperatures near 
absolute zero in the outer solar system to the ex- 
tremely hot locales of inner planets such as Venus and 
Mercury. In order to minimize the impact of temper- 
ature variations on the electronics onboard, space- 
craft nearly always incorporate some form of thermal 
control. Mechanical louvers, controlled by bimetal- 
lic strips similar to those in terrestrial thermostats, 
are often used to selectively radiate heat from the 
interior of the spacecraft into space. Other thermal 
strategies include painting exterior surfaces; Magel- 
Jan used white thermal blankets to reflect heat and 
keep interior units cooler. Magellan also carried op- 
tical solar reflectors composed of quartz mirror tiles 
on a number of surfaces, including the back sides of 
its solar arrays. Internal components are frequently 
painted black to encourage heat exchange between 
these components. 

In some cases, spacecraft may also carry one or 
more active forms of heating to maintain tempera- 
ture at required minimums. Often this is an electric 
unit that radiates heat from a resistive element similar 
to electric heaters on Earth; it may be controlled au- 
tonomously by on-board logic or commanded from 
the ground. Some spacecraft carry several radioiso- 
tope heating units (RHUs), each containing a small 
amount of plutonium which heats nearby spacecraft 
components. Because of their power needs, active 
cooling systems are not generally feasible on plane- 
tary spacecraft. 

Command and data subsystem. This designation is 
given to the main computer that oversees manage- 
ment of spacecraft functions and handling of col- 
lected data. Blocks of commands transmitted from 
Earth are stored in memory in the command and 
data subsystem and are executed at prescribed times. 
This subsystem also contains the spacecraft clock in 
order to accurately pace its activities, as well as all 
the activities of the spacecraft. 

In its data-handling function, the subsystem 
receives data destined for the ground on the space- 
craft’s data bus from other on-board subsystems, pro- 
cesses them, formats them for telemetry, and deliv- 
ers them to the telecommunications subsystem for 
transmission to Earth. Included are data from sci- 
entific instruments as well as housekeeping engi- 
neering data such as voltage and temperature read- 
ings. Data are usually stored on tape recorders on 
board while awaiting transmission to Earth. Since 
the 1990s, solid-state data recorders have taken 
the place of traditional tape-driven devices for this 
purpose. Most spacecraft encode data to be sent 
of Earth in order to correct transmission errors; 
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Viterbi, Golay, and Reed-Solomon encoding schemes 
have been used, and Turbo coding is now the most 
common. Some spacecraft also perform data com- 
pression to reduce the number of bits to be trans- 
mitted. This technique was used extensively on 
Galileo to compensate partially for the fact that 
the spacecraft’s high-gain antenna did not deploy 
completely. See DATA COMPRESSION; INFORMATION 
THEORY. 

Another critical function of the command and 
data subsystem is fault protection for the spacecraft 
in the event of a mishap. One of the chief strate- 
gies designed to respond to a contingency is safing, 
which shuts down or reconfigures subsystems to pre- 
vent damage either from within or from the external 
environment. After carrying out these activities, 
the fault-protection software usually commands the 
spacecraft to begin an automated search to re- 
gain Sun reference and reestablish communications. 
Spacecraft safing events that have taken place in 
actual flight missions have usually been caused by 
single-event upsets, probably resulting from a cos- 
mic ray interfering with on-board electronics or 
from transient electrical phenomena from on-board 
sources. Spacecraft are usually recovered after safing 
events, although one of the Phobos probes was be- 
lieved to be lost because of a faulty software com- 
mand. Another example of on-board fault protec- 
tion is the command-loss timer, which counts how 
much time has passed since a command has been re- 
ceived from Earth. If this count passes a designated 
value, it is assumed that a failure has occurred in 
the telecommunications or command subsystems. 
The spacecraft then switches to backup systems and 
takes other actions in an attempt to reestablish reli- 
able contact. 

Structure subsystem. The spacecraft’s physical 
structure is considered a subsystem itself for the 
purposes of planning and design. Usually the heart 
of this structure is a spacecraft bus, often consist- 
ing of a number of bays, which houses the space- 
craft’s main subsystems. Besides providing physi- 
cal support, the bus generally offers an electrical 
grounding reference, radio-frequency interference 
shielding, and protection from radiation and mete- 
oroids. The bus also provides attachment points for 
appendages such as booms and scanning platforms 
bearing scientific instruments. Magnetometers, for 
example, are usually mounted on a boom which may 
extend 30 ft (10 m) or more away from the spacecraft 
bus. For launch, this boom collapses and is stowed 
in a compact canister. The spacecraft bus is almost 
always covered with one or more layers of thermal 
blankets to help stabilize on-board temperature and 
to shield the spacecraft from micrometeoroid im- 
pacts. Special shielding may be added to protect the 
spacecraft from radiation in environments such as 
that at Jupiter, where ionized particles may degrade 
on-board subsystems, or from temperature extremes 
at planets such as Venus or Mercury. Spacecraft de- 
signed as landers usually also carry aeroshells and 
parachutes to aid during descent. See ATMOSPHERIC 
ENTRY; SPACECRAFT STRUCTURE. 


Miscellaneous subsystems. Among other miscella- 
neous spacecraft subsystems, the pyrotechnics sub- 
system includes electrically initiated devices, called 
squibs, that are used to deploy components such as 
booms, explode bolts that separate or jettison hard- 
ware such as descent probes or launch adapters, ig- 
nite solid rocket motors, and operate some types of 
valves. 

Redundancy. As noted above, redundancy tradi- 
tionally has been a prominent feature of planetary 
spacecraft, which typically carry two transmitters, 
two receivers, two command computers, and so on. 
In the event of a mishap, the spacecraft can switch 
to a backup system and carry on; this has happened 
many times. In an effort to reduce spacecraft mass 
and complexity, some designers have considered 
dropping redundancy and allowing spacecraft to op- 
erate on a single string of components. Although this 
may save mass, it also usually means a spacecraft can- 
not function if any of its principal subsystems fails. 


Mission Stages 


The process of conceiving, designing, building, 
launching, and operating a spacecraft mission takes 
many years. Voyager, for example, was conceived 
during the late 1960s and given formal project 
approval—called a “new start”—in 1972 by the Na- 
tional Aeronautics and Space Administration (NASA). 
Five years later, the twin Voyager 1 and 2 were 
launched, and executed their planetary flybys be- 
tween 1979 and 1989. In the 1990s and the first 
decade of the twenty-first century, they continued 
to return data as they left the solar system. Efforts 
have been made to simplify missions and to shorten 
the lead time required to design and launch space- 
craft, but prelaunch activities still involve a minimum 
of 2-3 years even for modest missions. 

In the United States, the programmatic steps lead- 
ing to preparation of a mission are commonly desig- 
nated by a series of terms. A conceptual study is con- 
ducted, which outlines the general scheme of a mis- 
sion concept. If approved, this proceeds to a phase 
A study, also called a preliminary analysis, which cre- 
ates a tentative design and project plan outlining 
the mission. This specifics what to build, when to 
launch, what course the spacecraft is to take, what 
is to be done during cruise, when the spacecraft will 
arrive at its destination, and what operations will be 
carried out. The project then moves into phase B, 
also called the definition phase, in which more for- 
mal specifications and requirements are determined. 
If the project wins formal approval, it progresses 
to phase C/D, or design and development, where 
the spacecraft is designed, assembled, tested, and 
launched. The process concludes with the mission’s 
operations phase, phase E, including its interplan- 
etary cruise and eventual flyby, orbit insertion, or 
landing. If the spacecraft is healthy at the conclusion 
of its specified mission period, it may be approved 
to continue into an extended mission. 

Throughout the design process, mission plan- 
ners must be sensitive to various constraints. These 
include the project’s overall acceptable budget, 


which will influence many decisions on spacecraft 
and mission complexity. Another constraint is the 
availability of launch vehicles, which in turn will 
dictate the maximum spacecraft mass and feasible 
trajectories. The launch period during which plan- 
etary geometry is appropriate to carry out the mis- 
sion must also be taken into account. The launch 
period for a transfer orbit to Mars, for example, oc- 
curs once every 26 months. Contention for available 
resources must also be considered, such as tracking 
time on the giant dish antennas of the U.S. Deep 
Space Network used to communicate with deep- 
space probes. See SPACECRAFT GROUND INSTRUMEN- 
TATION. 

Launch. Virtually all planetary launches in the 
United States take place at the Cape Canaveral Air 
Force Station in Florida, which is situated relatively 
close to the Equator with an eastward launch range 
extending over the Atlantic Ocean. Launches con- 
ducted by Russia take place at the Baikonur Cosmo- 
drome in Kazahkstan. In the 1980s, the United States 
and the Soviet Union were joined by Japan and the 
European Space Agency in launching solar system 
probes. Japan’s launch facility is at the Uchinoura 
Space Center, near the southwest tip of the island 
of Kyushu. European launches take place at the Cen- 
ter Spatial Guyanais at Kourou, French Guiana, on 
the northeast coast of South America. See LAUNCH 
COMPLEX. 

United States space probes traditionally have been 
launched on such vehicles as the Atlas, Delta, and 
Titan rockets. In 1989 and 1990, the Magellan, 
Galileo, and Ulysses spacecraft were launched from 
the payload bay of the United States’ space shut- 
tle, using specially adapted solid-fuel upper-stage 
engines to inject them on their interplanetary trajec- 
tories. The Russian launch fleet includes the liquid- 
fuel Proton. Japan has launched planetary missions 
on its solid-fueled Mu-5, while Europe has devel- 
oped the Ariane 5. See ROCKET PROPULSION; SPACE 
SHUTTLE. 

After a spacecraft arrives as the launch site, it is 
generally tested to ensure that all systems are work- 
ing normally after being transported from the facility 
where it was assembled. The probe is then mated to 
its upper-stage engine, and the combined stack is 
placed atop the launch vehicle. The probe’s propul- 
sion tanks are filled with fuel and pressurant. After 
launch, the main rocket usually places the probe and 
upper stage in a relatively low orbit around Earth. 
The upper stage then fires to send the probe on its 
interplanetary trajectory. At that point, pyrotechnic 
squibs may be fired to deploy on-board appendages, 
and the entire spacecraft goes through a checkout 
phase. 

Cruise. Many activities take place while the space- 
craft is en route to its destination planet or other 
target. Some scientific instruments, such as those 
studying fields and particles, may be fully powered to 
collect data during the interplanetary transit. Other 
instruments used only at the target body may be 
switched off. It is usually necessary to conduct one 
or more trajectory correction maneuvers in which 


on-board thrusters are fired to fine-tune the space- 
craft’s flight path. 

Although it is common to think of a spacecraft tra- 
jectory as a simple path from Earth to the target body, 
the fact that Earth and all other bodies are moving 
around the Sun makes trajectory design more com- 
plex. To travel from Earth outward to Mars, for exam- 
ple, using the least energy possible, mission planners 
usually take advantage of a Hohmann transfer orbit. 
This involves adjusting the existing orbit around the 
Sun so that it is more elliptical. Under the new orbit, 
the spacecraft’s perihelion, or closest approach to 
the Sun, will coincide with the orbit of Earth, while 
its aphelion, or farthest point from the Sun, will co- 
incide with the orbit of Mars. In order to arrive at 
a time when Mars is at the same point as the space- 
craft, it must be launched during an appropriate time 
window. 

Beginning in the 1970s, mission planners took ad- 
vantage of a technique called gravity assist trajec- 
tories to send spacecraft on to more than a single 
planetary target. In this technique, the spacecraft 
passes within a few hundred miles of a given planet; 
this accelerates (or decelerates) the spacecraft and 
changes its trajectory. (Because no energy is ever cre- 
ated or lost, the encounter also results in the planet 
being slowed, or speeded, by an immeasurably small 
amount in its orbit around the Sun.) Gravity assist 
was first used by Mariner 10 in 1973, when the 
spacecraft used the gravity of Venus to send it on to 
three flybys of Mercury in 1974 and 1975. The tech- 
nique was then used very extensively by Voyager 
to send spacecraft past Jupiter, Saturn, Uranus, and 
Neptune. In the 1990s, gravity assist became useful 
to send heavier spacecraft to destinations that would 
not be possible via direct trajectories by using avail- 
able launch vehicles. Galileo, for example, flew by 
Venus once and Earth twice on its way to Jupiter. See 
SPACE NAVIGATION AND GUIDANCE. 

Arrival. Activities at the target body vary, depend- 
ing on whether the spacecraft executes a flyby, goes 
into orbit, or lands on the body’s surface. Flyby activi- 
ties are generally divided into a far-encounter period, 
which may extend for several weeks, and a close en- 
counter, which includes the most hectic days directly 
surrounding the spacecraft’s closest approach to its 
target body. 

Spacecraft designed to go into orbit around the 
target body must fire a large engine to brake their 
velocity sufficiently so that the target body’s gravity 
can capture them. In a typical mission scenario, the 
spacecraft may enter an initial capture orbit which 
may be highly elliptical; over a period of several 
weeks, thruster firings will be used to circularize the 
orbit so that the spacecraft can carry out mapping 
operations. In addition, it has become routine for 
Mars orbiters to circularize their orbits after arriv- 
ing at the planet by using the drag of Mars’s atmo- 
sphere to reduce the orbit’s apoapsis, or high point, 
over time. This technique is called aerobraking. Most 
spacecraft are placed into a polar or fairly highly in- 
clined orbit at a planet to maximize the amount of 
surface that can be studied. Magellan was placed in 
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a near-polar orbit at Venus that caused the planet to 
rotate once beneath it every 243 days. In the Mars 
Reconnaissance Orbiter mission, the spacecraft was 
placed in a Sun-synchronous orbit that took it over 
regions uniformly at about 3 p.m. Mars local time. In 
this orbit type, the spacecraft’s orbit plane precesses 
with nearly the same period as the planet’s solar orbit 
period. This is useful for many scientific instruments 
because it makes the incidence angle of sunlight on 
surface features uniform from orbit to orbit. 

Landing a spacecraft on a planet requires even 
greater deceleration than in the case of orbit inser- 
tion. Spacecraft approaching planets with an atmo- 
sphere, such as Mars, usually take advantage of the 
technique of aerobraking, or using friction with the 
atmosphere, to slow the spacecraft. An aeroshell is 
usually required to protect the spacecraft from the 
effects of heating. After jettisoning the aeroshell, 
the spacecraft may use a propulsion system to 
achieve a soft landing, perhaps augmented by a 
parachute system. Although the Soviet Union’s Luna 
spacecraft returned soil samples from the Moon dur- 
ing the 1970s, sample return missions have not yet 
been mounted to other planets, although samples 
of cometary dust and solar and interstellar particles 
have been brought to Earth. Japan’s Hayabusa space- 
craft apparently failed to collect a sample when it 
landed on the asteroid Itokawa in 2005, but proba- 
bly obtained some dust from the asteroid that will be 
carried to Earth in 2010. 


Significant Missions 


Soon after the Soviet Union launched the Earth- 
orbiting Sputnik in 1957 and the United States re- 
sponded with Explorer 1 in early 1958, both coun- 
tries turned to developing probes that would escape 





Fig. 1. Model of Luna 1 (Mechta). Long metal strips are 
antennas, the tapered post is the magnetometer boom, and 
raised windowlike apertures are ion detectors. 
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Fig. 2. Full-scale model of the Pioneer 3 and 4 space 
probes, showing the antenna, the paint stripes for thermal 
control, and the despin device about the major diameter. 
(JPL/NASA) 


Earth orbit and travel to the Moon and beyond. 
Luna. The first successful mission to escape Earth 
orbit was the Soviet Union’s Luna 1 (Fig. 1), which 
executed a 3728-mi (5275-km) lunar flyby after 
launch on January 2, 1959. Later that year, the So- 
viet Union launched Luna 2, which made the first 
lunar impact, and Luna 3, which studied the far side 
of the Moon. Luna 4, launched in 1963, missed the 
Moon by 5282 mi (8500 km). The Soviets had a string 
of failures in 1965 in attempting a lunar soft land- 
ing; Luna 5 failed when the spacecraft made a hard 
impact on the Moon, Luna 6 missed the Moon by 
100,000 mi (160,000 km), and Luna 7 and 8 failed 
because retrorockets fired too early or late. Luna 9 
made the first lunar soft landing on February 2, 1966, 
and returned photographs from the Moon’s surface. 
Luna 10, 11, 12, 14, 15, and 19 were lunar or- 
biters, while Luna 13 was a soft lander. In 1970 the 
Soviet Union launched Luna 16, which carried out 
that country’s first successful lunar sample return. 
Later the same year, Luna 17 made a soft landing 
on the Moon with a robotic rover vehicle. Luna 18 
was intended as a sample-return mission but failed 
during lunar descent. Luna 20 in 1972 executed a 
successful sample return. The series continued with 
Luna 21, a \ander and rover, in 1973, and Luna 22, 
an orbiter, in 1974; and concluded with Luna 23 
and 24, sample return missions conducted in 1974 
and 1976 (Luna 24 was successful, while Luna 23 
was not). Much information about the missions was 


withheld by the Soviet government for many years, 
but some details have become available since the 
dissolution of the Soviet Union in the early 1990s. 
See MOON. 

Pioneer. This name was given to a family of largely 
dissimilar United States spacecraft used mainly for 
interplanetary physics, although some later missions 
in the series achieved success as planetary probes. 
Pioneer 1, 2, and 3 were failed lunar probes in 1958, 
although Pioneer J and 3 returned data on the en- 
vironment of space before reentering Earth’s atmo- 
sphere. Pioneer 4 (Fig. 2) passed within 37,300 mi 
(60,000 km) of the Moon in 1959, followed by Pio- 
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neer 5 in 1960, which relayed solar system data to a 
distance of 22.5 x 10° mi (36.2 x 10° km). Follow- 
ing a hiatus of 5 years, Pioneer 6 studied the space 
environment in solar orbit, as did Pioneer 7 in 1966, 
Pioneer 8 in 1967, and Pioneer 9 in 1968. 

The series took a very different direction with Pi- 
oneer 10 and 11, the first space probes sent into the 
outer solar system in 1972 and 1973, respectively. 
Pioneer 10 executed a flyby of Jupiter in December 
1973, while Pioneer 11 flew by Jupiter in Decem- 
ber 1974 and Saturn in 1979. In 1978 the series took 
yet another departure with Pioneer Venus (Fig. 3), 
which used two launches to send an orbiter and 
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Fig. 3. Expanded views of the structure of (a) the large probe and (b) one of the smaller probes launched into the Venus 
atmosphere from the Pioneer Venus bus. (After L. J. Nolte and S. C. Sommer, Probing a Planetary Atmosphere: 


Pioneer-Venus Spacecraft Description, 1975) 
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Fig. 4. Partial schematic of Surveyor to show part of the propulsion system associated 
with the vernier engines which provided midcourse trim and attitude control during firing 
of the large retrorocket, and final velocity removal near the lunar surface. (JPL/NASA) 


multiple atmospheric probes to Venus. See JUPITER; 
SATURN. 

Ranger. In the early 1960s, the United States car- 
ried out this series of missions to relay photographs 
of the Moon as the spacecraft descended to a crash 
landing. Ranger I and 2 ended in launch vehicle 
failures; Ranger 3 missed the Moon by 22,862 mi 
(36,793 km); Ranger 4 impacted the Moon but ex- 
periments proved inoperative when a timer failed; 
Ranger 5 missed the Moon by 450 mi (725 km); and 
Ranger 6 impacted the Moon but its television cam- 
era failed. Ranger 7 was the first success, returning 
4308 photos during its lunar descent in July 1964. 
In 1965 Ranger 8 and 9 sent back 7137 and 5814 
photos, respectively. 

Surveyor. These United States missions (Fig. 4) 
soft-landed robot spacecraft on the Moon as pre- 
cursors to the crewed Apollo missions. Surveyor 1 
landed on the Moon on June 2, 1966, and returned 
11,150 photos as it operated for 6 weeks. Surveyor 
2 suffered from a vernier engine failure and crashed 
into the lunar surface. Surveyor 3 carried a soil sam- 
pler for its soft landing in 1967. Surveyor 4 touched 
down on the Moon, but its signal was lost 2!/. min 
later. Surveyor 5 soft-landed in 1967, and relayed 
19,000 photos and conducted soil analyses. Surveyor 
6 executed the first takeoff from the Moon during its 


1967 mission. Surveyor 7, which soft-landed in 1968, 
was the last in the series. 

Lunar Orbiter. These spacecraft launched during 
the same era as the Surveyors were the first attempt 
to make detailed maps of the Moon. Lunar Orbiter 
1 was launched August 10, 1966, and relayed pho- 
tos for the next 20 days. Lunar Orbiter 2 sent 205 
photographic frames later that year; Lunar Orbiter 3 
sent 182 photos in 1967, followed by Lunar Orbiter 
4and 5. 

Mariner. This designation was given to the United 
States’ early planetary probes, which chiefly studied 
Venus and Mars, although the final in the series also 
visited Mercury. Mariner 1 was lost in 1962 after a 
typographical error in a software program caused 
the launch vehicle to veer off course. Mariner 2 
became the first spacecraft to encounter another 
planet when it flew by Venus at a distance of 21,594 
mi (34,752 km) later that year. It continued to relay 
data to a distance of 53.9 x 10° mi (86.7 x 10° km). 
Mariner 3 and 4 (Fig. 5a) targeted Mars; the first 
experienced a shroud failure, while the second 
became the first spacecraft to encounter Mars in 
1964. Mariner 5 (Fig. 5b) executed a 2480-mi 
(3991-km) Venus flyby in 1967. Mariner 6 and 7 
(Fig. 5c) again targeted Mars, sending back 75 and 
126 television pictures, respectively, during flybys 
at a distance of about 2150 mi (450 km) in 1969. 
Mariner 9 was a Mars orbiter, sending back detailed 
pictures after its launch in 1971. Mariner 10 was 
the first spacecraft to use a gravity-assist trajectory, 
encountering Venus before going on to three flybys 
of Mercury in 1974 and 1975. The United States then 
planned a mission with the working title Mariner 
JupiterSaturn, but the name was changed to 
Voyager before launch. See MARS; MERCURY 
(PLANET); VENUS. 

Venera, Mars, and Zond. These missions were the 
Soviet Union’s equivalent of the Mariners. The So- 
viets had the greatest success with Venera, which 
lofted a number of spacecraft to Venus over more 
than two decades. Although the first three Venera 
failed in 1961 and 1965, Venera 4 transmitted during 
its 94-min descent, though it failed at a considerable 
altitude above to Venus’s extremely hot surface. The 
series continued with Venera 5 and 6 in 1969. Ven- 
era 7 made the first soft-landing of Venus in 1970 
and transmitted for 23 min, which was remarkable 
considering the planet’s 900°F (500°C) surface tem- 
perature. Venera 8 made another successful landing 
in 1972. In 1975 the Soviet Union launched Venera 9 
and 10, which combined Venus orbiters and landers; 
Venera 9 returned the first pictures from the surface 
of another planet. In 1978 and 1981 the landers Ven- 
era 11, 12, 13, and 14 followed. Venera 15 and 16, 
launched in 1983, were orbiters that used synthetic- 
aperture radar to pierce Venus’s opaque atmosphere 
and map the planet’s surface. 

The Soviets’ Mars program achieved more mixed 
results. After the failure of Mars / in 1962, the So- 
viets attempted to make soft landings in 1971 with 
Mars 2 and 3. The first failed when it evidently im- 
pacted the planet’s surface with excessive force; the 


second touched down but failed 110 s later, having 
transmitted only a partial photograph that revealed 
no detail. In 1973 the Soviets made another attempt 
with a four-spacecraft armada, the orbiters Mars 4 
and 5 and the landers Mars 6 and 7. Mars 4 and 
7 missed the planet entirely, while Mars 6 stopped 
transmitting during its descent. Mars 5 entered orbit, 
but communications proved to be erratic. 

Zond was the designation given to several Soviet 
missions with various targets from 1964 through 
1970. Zond 1 and 2 were failed Venus and Mars 
probes, respectively, while Zond 3 returned photos 
from a lunar flyby in 1965. Zond 5 flew around the 
Moon and returned to Earth, splashing down in the 
Indian Ocean in 1968. In addition to other objec- 
tives, this and later Zond missions carried tortoises 
and other organisms for biological research. Zond 6 
flew around the Moon before returning to crash in 
the Soviet Union. Zond 7 also was quite successful. 
Zond 8 performed another lunar return in 1970. 

Viking. One of the most ambitious planetary explo- 
rations ever mounted was the United States’ Viking 
program. Two launches of combined orbiter-landers, 
Viking 1 and 2, took place in August and September 
1975, with arrival at Mars in June and August 1976. 
Viking’s chief purpose was the search for biological 
activity on the Martian surface. Both the orbiters and 
landers also carried out extensive imaging; the or- 
biters reconnoitered landing sites in preparation for 
descent of the landers and also carried out a global 
mapping program. The landers (Fig. 6) sent back the 
first extensive detailed pictures from the surface of 
another planet. 

The spacecraft failed to detect any sign of life on 
the planet. Many scientists, however, later concluded 
that Viking’s search methodology was very narrow, 
and began to devise other ways to search for past or 
present life on Mars. The Viking orbiters revealed a 
landscape that appeared to have been heavily refig- 
ured by liquid water, but it appeared for many years 
that none exists currently on the frigid planet. In 
2006, images taken by Mars Global Surveyor pro- 
vided evidence suggesting that on current-day Mars 
water has broken out of cliffs or crater walls and run 
downhill before evaporating. Mar’s polar caps have 
been found to be a mixture of carbon dioxide ice 
and water ice. 

Helios. This is a German-built and United States- 
launched solar system probe which because of its 
close approach to the Sun (0.29 AU) is called a solar 
probe. Helios 1 was launched in 1974, followed by 
Helios 2 in 1976. The experiments, directed toward 
phenomena taking place in the solar wind, were 
chiefly magnetometers and particle detectors. See 
SOLAR WIND. 

Voyager. In the late 1960s, mission designers 
began to contemplate a coming opportunity that 
would not be repeated for another 177 years. In the 
1970s it would be possible to launch a spacecraft that 
could sequentially visit all four giant gaseous outer 
planets—Jupiter, Saturn, Uranus, and Neptune—by 
using the technique of gravity assist to send the probe 
on to its next destination. 
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Fig. 7. Voyager spacecraft showing primary subsystems. 1 m = 3.3 ft. (NASA) 


Efforts culminated with the launch in 1977 of Voy- 
ager I and 2 (Fig. 7), which began a decade-long 
“grand tour” of the outer solar system. Although Voy- 
ager 2 was launched first, Voyager 1 took a shorter 
trajectory to Jupiter and arrived in 1979; it went on 
to encounter Saturn in 1980. Voyager 1’s trajectory 
at Saturn was optimized to study the planet’s major 
satellite, Titan, which was found to contain complex 
organic chemistry in its opaque atmosphere. This 
flight path precluded the spacecraft from later plan- 
etary encounters. Voyager 2 visited Jupiter in 1979 
and Saturn in 1981, then continued on to flybys of 
Uranus in 1986 and Neptune in 1989. During the 
1990s and the first decade of the twenty-first cen- 
tury, both spacecraft continued to relay back fields 
and particles data as they exited the solar system. 
See NEPTUNE; URANUS. 

Comet missions. The arrival of Comet Halley to the 
inner solar system in the mid-1980s prompted a great 
deal of international interest in sending spacecraft to 
meet it. The Soviet Union and Japan each dispatched 
two spacecraft, and the European Space Agency a 
single craft. The efforts were notable for a high de- 
gree of international cooperation, including the use 
of data obtained by the NASA Deep Space Network 
from the early-arriving Soviet craft to target more pre- 
cisely the flyby of the European craft. See COMET; 
HALLEY’S COMET. 

Plans originally called for a United States space- 
craft to join the Halley armada, but that mission was 
the victim of a budgetary cancellation. United States 
mission planners developed an ingenious scheme 
to redirect the International Sun-Earth Explorer 
3 USEE 3) satellite to conduct a flyby of the tail 
of Comet Giacobini-Zinner on September 11, 1985, 
about 6 months before the Comet Halley flybys. 

International Comet Explorer (ICE). This was the new name 
given the ISEE 3 (Fig. 8) when it was redirected 
to make its comet visit. The craft was originally 
placed in a halo orbit, a conditionally stable circu- 
lar orbit with its center at the Lagrange point up- 
stream (toward the Sun) from the Earth-Moon sys- 
tem. In 1982 and 1983 a series of maneuvers in the 
Earth-Moon system culminated in a final close flyby 
of the Moon that sent the craft on a new trajectory to 
fly through the tail of Comet Giacobini-Zinner, some 
6000 mi (10,000 km) from the comet’s nucleus. The 
craft’s payload was a half-dozen instruments prob- 
ing plasma electrons, plasma ions, energetic protons, 
magnetic fields, plasma waves, and radio waves. See 
CELESTIAL MECHANICS. 

Suisei and Sakigake. These two spacecraft repre- 
sented the first solar system exploration mission 
launched by the Japanese Institute of Space and As- 
tronautical Science (ISAS). Sakigake was launched in 
January 1985 and flew within a few million miles of 
Comet Halley on March 11, 1986. The spin-stabilized 
craft carried a solar-wind ion detector, a magnetome- 
ter, and a plasma-wave probe. 

The Suisei craft (Fig. 9), also called Planet-A, was 
launched in August 1985 and approached within 
about 90,000 mi (150,000 km) of Comet Halley on 
March 9, 1986. Like its sibling craft, Suisei was spin- 
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Fig. 8. International Sun-Earth Explorer 3 (ISEE 3), the last satellite of a coordinated 
series of three designed to study the Earth’s magnetosphere. This satellite was redirected 
toward Comet Giacobini-Zinner by a lunar encounter, and was renamed International 


Cometary Explorer (ICE). (ISEE Science Working Team) 


stabilized, but it carried a different science payload: 
an ultraviolet camera and a solar-wind experiment. 

VEGA. This enterprising mission by the Soviet 
Union sent a pair of spacecraft on flybys of Venus, 
where each released a lander and an atmospheric 
balloon, before continuing on to encounter Comet 
Halley (Fig. 10). 

VEGA 1 and 2 were launched in December 1984 
and flew by Venus in June 1985. VEGA 1’s Halley en- 
counter took place March 6, 1986, while VEGA 2’s 
followed 3 days later. Each craft carried optical cam- 
eras, spectrometers, dust detectors, and fields and 
particles instruments. Unlike the Japanese and Euro- 
pean craft, the VEGAs were three-axis-stabilized. 

The VEGAs achieved a closer flyby distance from 
Halley than did the Japanese probes, encountering 
the comet at distances of 5523 and 4990 mi (8889 
and 8030 km). They were able to provide highly use- 
ful targeting data for the European probe, which was 
to make by far the closest pass by the comet. 

Giotto. This spacecraft (Fig. 11) was the European 
Space Agency’s representative in the Comet Halley 
fleet. At its closest approach, Giotto came within 
about 360 mi (600 km) of the comet. 

The spin-stabilized craft’s payload included a cam- 
era, spectrometers, a photopolarimeter, dust exper- 
iments, and fields and particles instruments. Launch 
took place in July 1985, with the Halley flyby on 
March 14, 1986. 

Because the environment near the comet’s nu- 
cleus was known to be very dusty, Giotto was de- 
signed to endure a high number of particle impacts. 
Although only 2-3% of the spacecraft’s solar-panel 
capacity was lost during the encounter, the mirror 
on the camera began to suffer degradation when 
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Fig. 9. Model of the Japanese comet probe Suisei 
(Planet-A). (a) Assembled spacecraft with the solar cells 
assembly, high-gain antenna, and ports for instruments. 

(b) Inside view of the spacecraft with instruments arranged 
cylindrically about the spin axis. The extensive foil is for 
thermal protection and heat control. (Institute of Space and 
Astronautical Science) 


Giotto was inbound at a distance of about 3000 mi 
(5000 km), and the last usable image was taken at 
a distance of about 1220 mi (1963 km). In addition, 
particle impacts caused the craft to wobble with a 
nutation of 0.8°, resulting in a loss of radio signal 14 s 
before closest approach. The wobble was overcome, 
however, about 32 min later, and Giotto remained a 
highly functional probe years after the encounter. 

Phobos. An ambitious project for the exploration 
of Mars and its natural satellite, Phobos was under- 
taken by the Soviet Union in the late 1980s. Unfor- 
tunately, the two spacecraft were not to fulfill com- 
pletely their planned activities. 

Phobos 1 and 2 were launched 5 days apart in 
July 1988. The two nearly identical craft were to ar- 
rive at Mars in January 1989 and begin orbiting the 
planet. A series of maneuvers would have taken the 
craft near Phobos, where they would approach for a 
surface-contour-following flyby about 150 ft (50 m) 
from the rocky object’s surface. The craft would 
also have deployed a 110-Ib (50-kg) long-duration 
lander, expected to operate for about a year, and 
a 112-Ib (51-kg) hopper, which would have moved 
about the surface of Phobos in 60-ft (20-m) leaps. 
Altogether, the craft included a total of 37 scientific 
experiments. 

A series of commands sent to Phobos 1 in August 
1988 apparently contained a coding error that in- 
structed the spacecraft to turn off its attitude-control 
system. The craft then began to tumble slowly; as its 
solar panels were no longer pointed toward the Sun, 
it lost power and all contact was permanently lost. 

Problems arose with Phobos 2 by the end of 1988 
when its television system malfunctioned en route 
to Mars. Those difficulties were resolved by ground 
command. More serious was the loss of the on-board 
high-power radio transmitter, which would have per- 
mitted a high rate of data transmission. It was still pos- 
sible, however, to use the slower low-power trans- 
mitter to send data to Earth. 

Phobos 2 survived the transit of Mars, entered 
orbit in late January 1989, and began its scientific 
surveys of the red planet. By late March, the craft 
was positioned near the satellite Phobos, capturing 
high-detail images from a distance of several hun- 
dred miles. On March 27, 1989, however, a failure in 
the attitude control system caused the craft to begin 
tumbling erratically in an orbit just outside that of 
Phobos. Ground controllers spent a week trying to 
resume communications before giving up the craft 
as lost. 

Magellan. In the late 1970s and the 1980s, im- 
provements were made in synthetic-aperture radar, a 
technology that uses radar pulses to capture images 
where optical systems are not practical. This technol- 
ogy is ideal for the study of the planet Venus, which 
is perpetually shrouded by a thick layer of opaque 
clouds. See SYNTHETIC APERTURE RADAR (SAR). 

The first craft to perform radar imaging at Venus, 
Pioneer Venus of the United States was followed by 
the Soviet Venera 15 and 16 probes, which were 
able to improve resolution by about a factor of 10. 
They were followed by the United States’ Magellan 
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Fig. 10. VEGA spacecraft in fully deployed view. 


mission, which in turn achieved a factor-of-10 im- 
provement over the resolution delivered by the So- 
viet missions. 

Launched in May 1989, Magellan (Fig. 12) was 
also the first of several solar system probes to 
be lofted by the United States space shuttle. The 
probe was carried into Earth orbit in the cargo bay 
of the shuttle and released. A solid-fuel upper-stage 
engine attached to the probe then injected it into its 
interplanetary trajectory. 

Magellan entered Venus orbit in August 1990, ini- 
tiating a highly successful 4-year study of the planet. 
Each 8 months, Venus rotated once underneath the 
spacecraft as it passed overhead in a polar orbit. 
Magellan spent several of these 8-month orbital cy- i 
cles producing radar maps of 98% of the Venusian solar ce m0 331333 
surface with a resolution of 1000 ft (300 m) before array ~~ mum 397933333333 I ~ upper platforrn 
compiling a high-resolution, comprehensive gravity 
field map for 95% of the planet. In May 1994, ground 
controllers experimented with aerobraking, making 
use of the planet’s atmosphere to lower the space- 
crafts orbit. With solar panels and other compo- 
nents gradually failing, mission controllers decided 
to plunge the spacecraft into Venus’s atmosphere to bumper shield 
study the craft’s behavior and atmospheric charac- \ 
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Fig. 12. Magellan spacecraft. (a) Undergoing testing. (b) Deployed, showing primary subsystems. (JPL/NASA) 


began to consider the next step in planetary 
exploration. Their plans called for a series of 
orbiters that would return to each of the planets vis- 
ited by flyby spacecraft to conduct detailed studies. 
The first such approved mission was Galileo, which 
would orbit the solar system’s largest planet, Jupiter. 

Galileo (Fig. 13) had a complex dual-spin design 
in which most of the spacecraft was spin-stabilized 
to optimize fields and particles experiments, while 
the platform that carried pointed instruments such 
as cameras was capable of being three-axis-stabilized. 
The orbiter also carried a descent probe designed to 
relay data as it dived into the giant planet’s atmo- 
sphere. 
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En route to Jupiter, Galileo performed a flyby of 
Venus and two flybys of Earth. It also performed the 
first-ever flyby of an asteroid when it encountered 
the rocky body Gaspra in October 1991. In August 
1993, Galileo flew by a second asteroid, Ida. When 
images were sent to Earth from the craft’s on-board 
tape recorder 6 months later, the first-ever natural 
satellite of an asteroid was discovered when the tiny 
moon Dactyl was found to be orbiting Ida. See ASTER- 
OID. 

The failure of Galileo’s umbrella-like high-gain an- 
tenna to open fully prompted an extensive redesign 
of planned observations at Jupiter. After its arrival in 
December 1995, the mission’s highest priority was 




















radio science) 
sensors 


energetic particles detector 
plasma science instruments 
dust detector 

retropropulsion module 

Above: Spun section 


Below: Despun section 


scan platform 


Jupiter a 

atmosphere radioisotope 

probe thermoelectric 
generator 


Fig. 13. Galileo spacecraft. (a) Being tested. (b) Deployed, showing primary subsystems. The scan platform contains the 
photopolarimeter radiometer, near-infrared mapping spectrometer, solid-state imaging camera, and ultraviolet spectrometer. 
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to relay data from the atmospheric probe during its 
descent. Galileo then began sending selected images 
and other scientific data that had been compressed 
in on-board computers to make the best use of the 
craft’s available low-gain antennas. 

Galileo made about 30 orbits of Jupiter in the first 
5 years after its arrival. After completing an initial 
2'/2-year prime mission, Galileo was approved for 
an extended mission focusing on the Jovian satellites 
Europa and Io. The mission was extended a second 
time to allow the spacecraft to focus on observa- 
tions of Jupiter’s largest satellite, Ganymede, and on 
collaborative studies with Cassini as that spacecraft 
passed Jupiter in late 2000 en route to Saturn. Galileo 
discovered strong evidence that Europa has a melted 
saltwater ocean under an ice layer on its surface. The 
spacecraft also found indications that Ganymede and 
another Jovian satellite, Callisto, have layers of liquid 
saltwater as well. Other major science results from 
the mission include details of varied and extensive 
volcanic processes on the moon Io, measurements of 
conditions within Jupiter’s atmosphere, and the dis- 
covery of a magnetic field generated by Ganymede. 

On September 21, 2003, Galileo was intentionally 
plunged into Jupiter’s atmosphere to avoid the risk 
of impacting and possibly contaminating Europa. 

Hiten and Hagoromo (Muses-A). After their success 
with its two Comet Halley craft, mission planners in 
Japan turned to other projects they could execute 
as the nation’s space program developed. Their next 


effort was Muses-A, a mission to study the Moon. 

An expendable rocket launch in January 1990 car- 
ried aloft two spacecraft, Hiten and Hagoromo. The 
latter, a basketball-sized package carrying an instru- 
ment payload, was to go into orbit around the Moon. 
The larger Hiten was to occupy a highly ellipti- 
cal orbit around Earth, ranging in distance between 
19,000 and 470,000 mi (30,000 and 760,000 km). 
This orbit would take Hiten out toward the Moon, 
where it would receive and relay to Earth signals from 
Hagoromo. As Hagoromo approached the Moon, 
however, its radio transmitter failed, ending all con- 
tact with the craft. 

Ulysses. By the 1980s, the Sun had been studied 
for many years by ground-based equipment. Missions 
such as the Orbiting Solar Observatory satellites and 
the crewed Skylab in the 1970s extended these ob- 
servations to Earth orbit. All observing equipment, 
however, was limited to the two-dimensional surface 
of the ecliptic, the plane in which most planets orbit 
the Sun. 

Ulysses was designed to extend that point of view 
by sending a spacecraft out of the ecliptic and into 
the unexplored regions above and below the Sun’s 
poles. After collaborative work between NASA and 
the European Space Agency and the budgetary can- 
cellation of a United States craft that was to have 
taken part, a project emerged wherein the European 
agency built and provided the single Ulysses space- 
craft (Fig. 14) and contributed a computer system 
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Fig. 14. Ulysses spacecraft, showing primary subsystems. The propellant storage tank is part of the reaction control 
equipment (for attitude and so forth), and the thruster cluster is essentially a set of small engine nozzles for this equipment. 
The preamplifier is part of the unified radio and plasma wave experiment. (European Space Agency) 
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to control the craft from the ground. NASA provided 
some instruments, the radioisotope power source, 
launch by a United States space shuttle, tracking via 
its global Deep Space Network, and ground facil- 
ities at the Jet Propulsion Laboratory in Pasadena, 
California. 

Launched in October 1990, Ulysses flew first to 
Jupiter, where the giant planet’s gravity sent the craft 
out of the ecliptic in February 1992. The encounter 
of the Sun’s south pole took place in 1994, with an en- 
counter of the north pole in 1995. As the mission pro- 
gressed, one of the key scientific findings revealed 
a Sun with a homogeneous magnetic field lacking 
latitudinal variations, very different from the mag- 
netic climate observed by ground-based instruments. 
Ulysses’s observations continued after its flight over 
the Sun’s poles, and it made another passage over the 
Sun’s polar regions under conditions of high solar 
activity in 2000 and 2001. In 2006 it began its third 
passage over the Sun’s poles. 

Mars Observer. This mission was to be the United 
States’ first return to the red planet since the Viking 
effort of the 1970s. A polar orbiter, Mars Observer 
carried a high-resolution camera and other instru- 
ments to study Mars’s geology and atmosphere. The 
spacecraft was adapted from an assembly-line com- 
mercial Earth-orbiting satellite in an effort to stream- 
line design and save costs. 

Launched on a Titan 3 rocket in September 1992, 
the mission proceeded according to plan until 3 days 
before Mars arrival in August 1993, when all contact 
was lost with the craft. The loss took place during a 
period in which Mars Observer’s propulsion system 
was being pressured in preparation for the firing of 
its main engine to place it on orbit; the craft’s radio 
transmitter was intentionally turned off at the time to 
protect it from any forces during the pressurization. 
A review panel concluded that a failure probably oc- 
curred in the propulsion system that either ruptured 
a fuel line or caused an explosion. 

Clementine. Unlike most United States missions, 
carried out by NASA, this novel effort was the work of 
the Department of Defense. Clementine’s flight plan 
called for it to orbit Earth for several days before de- 
parting for the Moon, where it would spend more 
than 2 months in orbit capturing high-resolution im- 
ages of the lunar surface. The spacecraft was then 
to leave once more on a trajectory that would take 
it within 60 mi (100 km) of the asteroid 1620 Ge- 
ographos. 

After launch in January 1994, Clementine (Fig. 15) 
arrived in lunar orbit in February and carried out 
systematic mapping; on May 5, it left lunar orbit. Two 
days later, a malfunction of the on-board computer 
caused several of the craft’s thrusters to be activated 
during a 20-min period when Clementine was out of 
touch with ground controllers. This depleted all fuel 
from the craft’s tanks, preventing the asteroid flyby. 

Although it returned a wealth of lunar imagery, 
the chief objective of Clementine was not scientific, 
but testing 23 advanced technologies of interest to 
the military. Moreover, an aim was to demonstrate 
a streamlined project management approach that 


would make it possible to build and launch a small 
yet highly capable spacecraft cheaply in less than 
2 years. 

Cassini. Considered the last of the its era of 
large United States planetary exploration missions, 
Cassini (Fig. 16) was launched in 1997 to study the 
ringed planet Saturn and its major natural satellite, 
Titan. The Cassini orbiter carried an instrumented 
probe provided by the European Space Agency, Huy- 
gens, designed to descend to the surface of Titan. 
Shrouded by an opaque atmosphere containing a 
rich diversity of organic compounds, Titan possibly 
contains liquid oceans of methane. Like Galileo, the 
craft has made use of multiplanet gravity assists, fly- 
ing by Venus, Earth, and Jupiter on the way to its 
destination. 

The Cassini and Galileo spacecraft made coordi- 
nated observations of Jupiter on Cassini’s flyby of 
the giant planet on December 30, 2000. Cassini data 
were used to track daily changes of some of the 
planet’s most visible storms and to study patterns 
in natural radio emissions near the edge of Jupiter’s 
magnetic environment. 

After 7 years in transit, Cassini entered orbit at 
Saturn on July 1, 2004. Six months later, on January 
14, 2005, the Huygens probe took photos as it de- 
scended to the surface of Titan, where it touched 
down and continued to send data for about 90 min. 
Preliminary findings confirmed that the landing site 
is near the shoreline of a liquid ocean. 

The Cassini orbiter, meanwhile, continued on nu- 
merous close approaches to Saturn’s moons and 
studies of its atmosphere, rings, and magnetic field. 
One notable early science finding was the discovery 
that Saturn’s moon Enceladus may have liquid water 
reservoirs that erupt in geysers. 





Fig. 15. Clementine spacecraft being prepared for testing 
at Naval Research Laboratory, Washington, DC. 
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Fig. 16. Cassini spacecraft, showing primary subsystems. (JPL/NASA) 


Discovery Programs. Faced with constrained bud- 
gets but increasingly complex and costly planetary 
spacecraft, NASA in the mid-1990s undertook a con- 
scious program of downsizing missions. At the cen- 
ter of this effort was the Discovery Program, de- 
signed to sponsor low-cost solar system missions 
with highly focused science objectives. The program 
periodically makes competitive selections from pro- 
posals submitted by teams led by scientific princi- 
pal investigators. Early selected missions included 
Near Earth Asteroid Rendezvous, Mars Pathfinder, 
Lunar Prospector, Stardust, Comet Nucleus Tour, 
Messenger, and Deep Impact. 

Near Earth Asteroid Rendezvous (NEAR). The NEAR probe 
was launched in 1996 to rendezvous with and orbit 
the asteroid Eros, one of the small rocky bodies 
that pass relatively near Earth. After passing asteroid 
253 Mathilde in 1997, the probe was scheduled to en- 
ter orbit around Eros in late 1998. Due to a problem 
with flight software, however, the orbital insertion 
engine firing did not take place correctly, and NEAR 
sailed past Eros. However, the craft was able to try 
again about a year later after circling once around 
the Sun. NEAR correctly entered orbit around Eros 
on February 14, 2000, and began a series of maneu- 
vers to take it close to the surface of the small rocky 


body. The spacecraft, renamed NEAR-Shoemaker in 
honor of the late astronomer Eugene Shoemaker, 
gently landed on asteroid Eros on February 12, 2001, 
after orbiting the rocky body for a year. It captured 
69 images during the final 3 mi (5 km) of its descent, 
revealing features as small as 0.4 in. (1 cm) across. 
The final signals from the spacecraft before it was 
shut down were received February 28, 2001. 

Lunar Prospector. One of the smallest of the early Dis- 
covery missions, Lunar Prospector was launched in 
1998 and reached the Moon after 5 days in flight. Or- 
biting 60 mi (100 km) above the lunar surface, the 
probe carried five instruments designed to help un- 
derstand the origin and evolution of the Moon and 
to determine whether water ice may be present in 
shaded areas near the Moon’s poles. After a year and 
a half in orbit, Lunar Prospector was intentionally 
crash-landed in a perpetually shadowed crater near 
the Moon’s south pole; scientists hoped that the im- 
pact would throw off water vapor detectable from 
ground- and space-based observatories. No visible 
debris plume was detected, however. 

Stardust. Launched in 1999, this probe (Fig. 17) 
was designed to fly through the cloud of dust that 
surrounds the nucleus of a comet and, for the first 
time, collect cometary material for return to Earth. 
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Fig. 17. Stardust spacecraft, showing primary subsystems. (JPL/NASA) 


The spacecraft had a pair of tennis-racket-shaped 
collectors using a substance called aerogel to snag 
minuscule dust particles. During the course of the 
mission, the spacecraft collected dust streaming 
through the solar system from interstellar sources 
as well as dust thrown off by Comet Wild-2 during 
Stardust’s flyby on January 2, 2004. On January 15, 
2005, the spacecraft returned to Earth, where a sam- 
ple capsule descended by parachute to the salt flats 
of the Utah desert. 

Genesis. Somewhat similar in concept to Stardust, 
Genesis was a mission designed to collect particles 
and return them to Earth. Launched August 8, 2001, 
Genesis traveled sunward to the Lagrange 1 point, 
where it spent more than 2 years collecting sam- 
ples of solar wind particles. On September 8, 2004, 
the spacecraft brought its sample return capsule to 
Earth and put it on a trajectory to descend over the 
salt flats of Utah. The capsule’s drogue parachute and 
parafoil did not deploy, however, and the capsule im- 
pacted the ground at a speed of 193 mi/h G11 km/h). 
Despite this setback, technicians found that much of 
the payload was intact, allowing the science team to 
fulfill its highest objectives, including the measure- 
ment of the isotopic ratios of oxygen and nitrogen 
in the solar wind. 

Comet Nucleus Tour. The Comet Nucleus Tour (Con- 
tour) mission was designed to visit two diverse 
comets. The spacecraft was launched July 3, 2002, 
but 6 weeks later contact was lost after a maneuver 
intended to send the probe out of Earth orbit. 

Messenger. Messenger—short for Mercury Surface, 
Space Environment, Geochemistry, and Ranging— 
was conceived as the first probe to orbit Mercury, 
the closest planet to the Sun. Mercury was previously 
studied at close range in the 1970s during three fly- 
bys by the Mariner 10 spacecraft. Messenger was 
launched August 3, 2004, and was to fly by Earth 
once, Venus twice, and Mercury three times before 
entering orbit at Mercury in 2011. 

Deep Impact. This novel mission concept called for 
a spacecraft to fly to a comet and dispatch a copper 


projectile that would excavate a substantial crater in 
the comet’s nucleus, giving scientists a unique view 
of the nucleus’ makeup. Launched January 12, 2005, 
Deep Impact encountered Comet Tempel 1 on July 
4, 2005, delivering the impactor was planned. The 
collision—documented both by the penetrator be- 
fore its demise and by the flyby spacecraft—threw 
off a large bright cloud of debris. Investigators were 
surprised to find that the comet nucleus was buried 
in a deep layer of powdery dust. 

New Millennium Program. At about the same time 
that NASA established the Discovery Program to 
sponsor smaller planetary missions, it also created a 
program called New Millennium designed to launch 
missions flight-testing advanced technologies. The 
first New Millennium mission was Deep Space 1, 
a probe launched in 1998 to test an ion engine 
and 11 other technologies. After an initial problem 
in starting the propulsion system, the ion engine 
worked well, and the technology validation effort 
was viewed as highly successful, although a pointing 
problem prevented the probe’s camera from obtain- 
ing useful photos during an asteroid flyby in 1999. 
See ION PROPULSION. 

After the failure of the star tracker on Deep 
Space 1, the spacecraft’s science camera was repro- 
grammed to maintain the craft’s three-dimensional 
orientation in flight. After completing its primary 
mission of flight-testing the new space technologies, 
Deep Space 1 executed a flyby of Comet Borrelly in 
September 2001. 

Mars initiatives. Following the failure of Mars Ob- 
server, NASA quickly approved an orbiter mission, 
Mars Global Surveyor, to duplicate most of the sci- 
entific objectives of the lost spacecraft. In addition, 
the space agency established a program to send pairs 
of orbiters and landers to Mars during each bian- 
nual launch period, culminating in sample return 
missions in the early twenty-first century. 

Mars Global Surveyor. Launched in 1996, Mars Global 
Surveyor spent 2 years circularizing its orbit using 
the technique of aerobraking, wherein the spacecraft 


is slowed by frictional drag as it skims through the 
planet’s thin upper atmosphere at each orbital peri- 
apsis. First tested at the end of Magellan’s mission at 
Venus in 1994, aerobraking allows planetary orbiters 
to use less propellant to carry out orbital insertion at 
the time of arrival. Mars Global Surveyor embarked 
on its main mapping mission in 1999, collecting high- 
resolution images and elevation data. Among early 
findings, scientists determined that the Martian mag- 
netic field is not globally generated in the planet’s 
core but is localized in particular areas of the crust. 
But the most startling science results came shortly 
before contact was lost with Mars Global Surveyor 
in 2006 following nearly a decade in orbit around the 
Red Planet. By comparing photos taken over a series 
of several years, scientists found evidence of bright 
new deposits in two gullies on Mars suggesting that 
liquid water flowed there briefly sometime during 
those years. 

Mars Pathfinder. This mission was initially conceived 
as an engineering demonstration of a way to deliver 
a payload to Mars using the innovative technique of 
inflatable airbags for landing, and was subsequently 
equipped with an instrumented robotic rover to 
carry out a science mission. Launched in 1996, Mars 
Pathfinder (Fig. 18) landed on July 4, 1997, in Ares 
Vallis, an ancient floodplain that scientists selected 
because it was believed to contain a wide variety 


wind sensor 


of rocks. The last successful data transmission from 
Pathfinder was completed on September 27, 1997. 
Both the lander and rover outlived their design life- 
times, the lander by nearly three times and the rover 
by twelve times. Pathfinder returned 2.3 x 10° bits 
of information, including more than 16,500 images 
from the lander and 550 images from the rover, as 
well as more than 15 chemical analyses of rocks and 
soil and extensive data on winds and other weather 
factors. Findings from the investigations carried out 
by scientific instruments on both the lander and the 
rover suggest that Mars was at one time in its past 
warm and wet, with water existing in its liquid state 
and with a thicker atmosphere. 

Mars 96. During the same biannual launch pe- 
riod when Mars Pathfinder and Mars Global Sur- 
veyor were launched, Russia launched a complex 
spacecraft called Mars 96, which combined an or- 
biter and several landers. Originally named Mars 
94, the mission was renamed due to a launch de- 
lay. The Mars 96 probe failed to leave Earth orbit 
when booster rockets misfired, and reentered Earth’s 
atmosphere. 

Mars 98. Mars 98 consisted of the Mars Climate 
Orbiter and the Mars Polar Lander. The orbiter car- 
ried the balance of science instruments from the pay- 
load of the lost Mars Observer that were not reflown 
on Mars Global Surveyor. The lander, meanwhile, 
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Fig. 18. Mars Pathfinder, as deployed on Mars, showing primary subsystems. (JPL/NASA) 
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was designed to be the first spacecraft to settle 
down near the planet’s south pole to study the 
history of water on Mars. In addition, the lander 
carried two Deep Space 2 microprobes, developed 
under NASA’s New Millennium Program, designed 
to crash into the planet’s surface independently and 
test for the presence of water ice as a technology 
demonstration. 

Launched in December 1998, Mars Climate Or- 
biter was lost when it entered the Martian atmo- 
sphere too low and presumably burned up; a subse- 
quent investigation established that the failure was 
caused because navigation-related data were not 
properly converted from U.S. Customary to metric 
units. Mars Polar Lander was launched in January 
1999 and was also lost upon arrival. Although the 
cause of this failure was less certain, investigators 
concluded that flight software did not properly take 
into account the behavior of sensors on the space- 
crafts landing legs, and as a result thrusters were 
probably cut off too early during the craft’s final de- 
scent. No signal was received from the Deep Space 2 
microprobes, which were to be jettisoned just before 
atmospheric entry. 

Mars Odyssey. A follow-up to Mars Global Surveyor, 
the Mars Odyssey spacecraft was an orbiter launched 
April 7, 2001, that reached the planet 6 months 
later. Aerobraking ended in January 2002, and the 
science mission commenced a month later. Mars 
Odyssey carries three science instruments, includ- 
ing one package designed to study the radiation en- 
vironment in space and at Mars for the purpose of 
protecting future astronauts; it also serves as a com- 
munication relay for the Mars Exploration Rovers. 

Mars Express/Beagle 2. For the 2003 Mars launch op- 
portunity, the European Space Agency conceived an 
orbiter, Mars Express, equipped with a science pay- 
load including an imaging radar that could penetrate 
the planet’s surface. In addition, Mars Express car- 
ried Beagle 2, a United Kingdom-built lander de- 
signed to search for signs of Martian life, past or pres- 
ent. Launched June 2, 2003, on a Soyuz-Fregat rocket 
from Kazakhstan, the orbiter reached the Red Planet 
on December 25, 2003. Beagle 2 entered Mars’s at- 
mosphere the same day, but contact was never estab- 
lished with it following its descent. Mars Express, 
meanwhile, embarked on its orbital study, produc- 
ing early results such as evidence of buried impact 
craters from the imaging radar. 

Mars Exploration Rovers. During the same period in 
which Europe launched Mars Express and Beagle 
2, the United States lofted two identical robotic 
rovers designed to explore Mars, Spirit and Opportu- 
nity. The travel capabilities of the rovers were much 
greater than those of Sojourner, the United States’ 
previous Mars rover that landed in 1997. Spirit was 
launched June 10, 2003, and landed January 4, 2004, 
while Opportunity was launched July 7, 2003, and 
landed January 25, 2004. They touched down on op- 
posite sides of the planet— Spirit in Gusev Crater and 
Opportunity in Merdiani Planum, both sites being 
viewed as promising for the study of rocks that could 


reveal past water activity on Mars. Despite a software 
problem that prevented Spirit from performing sci- 
ence for 10 days shortly after its landing, both rovers 
went on to travel around studying rocks for several 
years, far beyond their originally planned 90-day mis- 
sions. 

Mars Reconnaissance Orbiter. On August 12, 2005, the 
United States launched Mars Reconnaissance Or- 
biter, which reached the planet on March 10, 2006, 
and, following aerobraking, began its science mis- 
sion in November. One of its instruments is a high- 
resolution camera that is the most powerful ever 
carried on a planetary mission. In addition, the or- 
biter is equipped with a subsurface radar designed 
to operate in conjunction with the radar on Mars Ex- 
press. Soon after arrival, the science team released 
photos taken by the orbiter of several landers on 
the planet’s surface, demonstrating the power of the 
high-resolution camera. 

Other missions. Although the United States’ Dis- 
covery and Mars programs account for many of 
the missions sent beyond Earth orbit in the early 
twenty-first century, several other projects have been 
launched under other sponsorships. 

Wilkinson Microwave Anisotropy Probe. Intended to survey 
the sky at infrared wavelengths to measure heat left 
over from the big bang, the Wilkinson Microwave 
Anisotropy Probe was launched June 30, 2001. It 
orbits the Lagrange 2 point some 9 x 10° mi (1.5 x 
10° km) from Earth. The mission was able to measure 
many cosmological parameters with higher accuracy 
than was previously possible; it puts the age of the 
universe, for example, at 13.7 x 10° years, plus or 
minus 200 x 10° years. See WILKINSON MICROWAVE 
ANISOTROPY PROBE. 

Hayabusa. Known before launch as Muses-C, 
Hayabusa is a Japanese mission to collect samples 
from an asteroid and return them to Earth. The space- 
craft was launched May 9, 2003, and approached the 
small near-Earth asteroid 25143 Itokawa in Septem- 
ber 2005. It did not enter orbit around the asteroid, 
but took up a heliocentric orbit nearby. After making 
observations of the object, the spacecraft prepared 
for a series of touchdowns to collect samples. It was 
uncertain whether the sample collection was car- 
ried out as planned, but the spacecraft was believed 
to have probably obtained at least some dust in its 
sample collector. The spacecraft is scheduled to de- 
liver a reentry capsule in June 2010 to Earth, where 
it will land via parachute in Australia. 

Spitzer Space Telescope. Similar in concept to a previ- 
ous Earth-orbiting infrared observatory, the Spitzer 
Space Telescope was launched August 25, 2003, and 
occupies an Earth-trailing heliocentric orbit, drifting 
away from Earth at about 0.1 AU per year. The tele- 
scope features a 33.5-in. (85-cm) mirror and three 
instruments that allow it to conduct a variety of 
photometric, spectroscopic, and spectrophotomet- 
ric studies. See SPITZER SPACE TELESCOPE. 

Smart 1. Smart 1 (the name derives from Small 
Missions for Advanced Research in Technology) 
was a European spacecraft that orbited the Moon. 


The mission’s primary objective was testing of a 
solar-powered ion thruster. Launched September 27, 
2003, from French Guiana, Smart 1 spent 13 months 
in an orbit gradually spiraling outward from Earth, 
shaped by the ion propulsion. In November 2004 the 
ion thruster was activated to decelerate it into orbit 
around the Moon—a maneuver that was expected 
to take 4'/, days but which required approximately 
3 months. It was intentionally crash-landed onto the 
Moon on September 3, 2006. 

Rosetta. The European Rosetta probe was launched 
March 2, 2004, on a mission to rendezvous with 
Comet 67P/Churyumov-Gerasimenko more than 
10 years later, in May 2014. En route, the spacecraft is 
scheduled to fly by Earth three times and Mars once, 
in addition to a flyby of the asteroid 2867 Steins in 
2008. Rosetta carries a lander named Philae, which 
will use harpoons and drills to secure itself to the 
surface of the comet nucleus. 

Venus Express. Another European mission, Venus Ex- 
press was launched November 9, 2005, and entered 
orbit around Venus on April 11, 2006. The purpose of 
the mission is to make detailed studies of the planet’s 
atmosphere, clouds, plasma environment, and sur- 
face characteristics. 

New Horizons. During the 1990s, NASA began con- 
ducting studies of how to carry out a mission to 
Pluto, which was then regarded as the solar system’s 
only planet that had not been visited by a robotic 
spacecraft. The mission concept that was eventually 
approved and moved forward was New Horizons. 
Launched January 19, 2006, New Horizons was to fly 
by Jupiter in 2007 and fly by Pluto in July 2015, con- 
tinuing on and out of the solar system. One or two en- 
counters with Kuiper Belt objects may be possible in 
the event of an extended mission. A few months after 
the launch, the International Astronomical Union re- 
classified Pluto as a dwarf planet. See KUIPER BELT; 
PLUTO. 

Solar Terrestrial Observatory. The Solar Terrestrial Ob- 
servatory (Stereo), is a mission designed to study 
the Sun. Two identical spacecraft were launched 
October 26, 2006, into highly elliptical geocentric 
orbits with an apogee at about the distance of the 
Moon. After a few orbits, they were to pass close to 
the Moon, where a gravity-assist effect would send 
them into heliocentric orbits inside and outside the 
Earth’s orbit. The dual spacecraft allow stereoscopic 
imaging of the Sun and solar phenomena such as 
coronal mass ejections. 

Trends. Missions of the early twenty-first century 
demonstrate ongoing refinement in a number of 
areas. One area that has seen considerable develop- 
ment is that of increasing on-board autonomy. For 
example, the Mars Exploration Rovers are able to ex- 
ecute simple decision-making about their environ- 
ments, and how to carry out science observations 
that makes operations proceed more quickly than 
if every move was commanded from Earth. In addi- 
tion, newer spacecraft have great capabilities in their 
anomaly response. The fault protection system of an 
older spacecraft might direct it simply to enter a pro- 
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tective or “safing” mode in response to unexpected 
conditions. Newer spacecraft are programmed to 
carry out additional diagnostics and take corrective 
actions. 

Another new initiative is in the area of in situ sci- 
ence instruments. The development of greatly minia- 
turized microprocessors, memory, and sensors has 
enabled the creation of more sophisticated science 
instruments. Many of these are designed to search 
for evidence of exobiological processes on Mars and 
other bodies. Franklin O’Donnell 
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Experiments conducted in space in order to take 
advantage of the reduced gravity in studies of the 
growth, behavior, and properties of materials. Space- 
lab, developed by the European Space Agency (ESA), 
was a laboratory module that flew in the space 
shuttle payload bay. First launched in 1983 on the 
Columbia, the module became the workhorse for 
United States and international science missions em- 
phasizing low gravity. Twelve such missions were 
flown, and an additional seven missions utilized 
Spacelab components (such as a pallet to support 
experiments not requiring direct crew interaction). 
The experiments in fluids, combustion, materials sci- 
ence, and biotechnology conducted on these mis- 
sions, together with their related ground-based re- 
search, resulted in more than 2200 scientific pub- 
lications, about 1200 of which appeared in peer- 
reviewed journals. 

The Spacehab laboratory module, which also flies 
in the shuttle payload bay, was privately developed 
by Spacehab, Inc., beginning in 1993. A number 
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of commercial experiments have been conducted 
on the 21 Spacehab missions (as of January 2007), 
several of which carried experiments to the Russian 
Mir space station. 

Biotechnology. Since biotechnology experiments 
generally have modest space and power require- 
ments, they were able to be accommodated by mid- 
deck lockers in other shuttle missions as well as on 
the Spacelab microgravity emphasis missions, which 
provided them with many more flight opportunities. 
Even though the microgravity environment on some 
of these missions was not ideal, biomolecular crystal 
growth experiments for structural analysis as well 
as cell and tissue culturing experiments produced 
many interesting results, some of which could have 
commercial applications. 

Protein crystal growth. It has now been established be- 
yond question that at least some proteins and other 
macromolecular systems form higher-quality crys- 
tals in a microgravity environment. A survey of 221 
macromolecular growth systems flown on 19 shuttle 
missions found that 57.1% produced crystals large 
enough for diffraction analysis, and 27.2% (or al- 
most half of those that grew large enough for diffrac- 
tion analysis) showed improvement in the diffrac- 
tion data compared to the best crystals ever grown 
on the ground by any means. One criterion used 
for selecting growth systems for flight was the abil- 
ity to grow diffraction-sized crystals in the flight 
apparatus in a time commensurate to the time avail- 
able on orbit. Since crystals growing under diffusion- 
limited transport in space would be expected to 
grow more slowly than those under convective en- 
hanced transport on the ground, there is little ques- 
tion that many more of the space-grown crystals 
would have grown to a size large enough for x-ray 
analysis if more time had been available. This sup- 
position has been confirmed by crystal growth ex- 
periments that were carried to the Mir station. Even 
though conditions on Mir were not ideal for crystal 
growth, several of these growth systems have pro- 
duced centimeter-sized crystals that are large enough 
for neutron diffraction analysis. (Neutron diffraction 
can directly reveal the positions of hydrogen atoms, 
whereas the positions of these atoms must be in- 
ferred from the positions of heavier atoms seen by 
x-ray diffraction.) Crystal size limitations have re- 
stricted the number of protein structures that have 
been determined by neutron diffraction analysis to 
fewer than a dozen. See CRYSTAL GROWTH; CRYSTAL 
STRUCTURE; NEUTRON DIFFRACTION; X-RAY CRYSTAL- 
LOGRAPHY; X-RAY DIFFRACTION. 

In some cases the improvement in diffraction reso- 
lution seen in the space-grown crystals has been key 
to solving the structure of a particular protein for the 
first time, and in other cases the space-grown crys- 
tals contributed significantly to the refinement of the 
structure of certain proteins. Several drugs based on 
structures in which space-grown crystals played a 
part are in clinical trials. 

Cell culturing experiments. A number of experiments 
have demonstrated that cell metabolism and biosyn- 
thesis of cell products are altered in microgravity. 


For example, the analysis of the essential oils pro- 
duced by a rose flower on orbit found products that 
are not normally found in normal gravity. Several 
experiments have also shown that gene regulation 
is significantly altered on exposure to microgravity. 
Included are genes that regulate cell-cycle, apop- 
tosis, heat shock response, metabolism, structural 
proteins, cancer cell growth, intracellular signaling, 
mitochondrial proteins, and a number of transcrip- 
tion factors and DNA binding proteins that are either 
up-regulated or down-regulated. In particular, it was 
found that a specific cell-cell signaling matrix mate- 
rial, important to the layering of cells during heart 
development, was up-regulated in microgravity. This 
information could be important to the development 
of tissue-engineered heart patches to replace dam- 
aged cardiac tissue. See BIOMEDICAL ENGINEERING; 
CELL (BIOLOGY); TISSUE CULTURE. 

Normal cells do not tend to form three- 
dimensional structures in cultures in vitro. However, 
a new rotating bioreactor developed by NASA mim- 
ics one aspect of microgravity by keeping cells in 
suspension as they continuously fall through their 
culture medium. This device has been used on the 
ground by a variety of investigators at the NASA/NIH 
Center for Three-Dimensional Tissue Culture and at 
several universities to study a broad spectrum of 
biomedical research issues. However, it has also been 
found that the shear forces on the cells in the rotating 
reactor often inhibit their attachment to the three- 
dimensional assembly. This was justification for fly- 
ing the rotating bioreactor on several shuttle flights 
to the Mir space station. In microgravity, the rota- 
tion rates required to maintain the cells in suspen- 
sion are orders of magnitude smaller than on Earth. 
Techniques developed to provide nutrients to the 
culture and remove waste products were success- 
fully demonstrated, and several cultures were carried 
out which had proven difficult in normal gravity. 

Combustion experiments. There are two com- 
pelling reasons for the study of combustion in 
microgravity. One is the issue of fire safety in the 
design and operation procedures of orbiting labora- 
tories; the other is to take advantage of the weight- 
less state to study certain combustion phenomena in 
more detail and to test various models in which con- 
vection has been ignored in order to be mathemati- 
cally tractable. Examples of the latter are a number of 
droplet combustion experiments in which the burn- 
ing of free-floating or tethered droplets was studied. 
The absence of gravity allowed the droplet size to be 
increased to as much as 5 mm (0.2 in.) so that more 
detailed observation could be made. The objective 
is to test theories of droplet combustion and soot 
formation that are of importance to improving the 
efficiency of internal combustion engines, gas tur- 
bine engines, and home and industrial oil-burning 
heating systems. See COMBUSTION; GAS TURBINE; IN- 
TERNAL COMBUSTION ENGINE. 

In an interesting variation of this experiment, a 
container was filled with various combustible mix- 
tures near their lean limit of combustion (the lowest 
concentration of fuel-to-air mixture that will ignite 


and/or support combustion). A flame ball was cre- 
ated by an electrical spark. A stationary spherical 
flame front develops as fuel and oxygen diffuse into, 
and heat and combustion products diffuse out of, the 
flame ball. This is the simplest possible geometry in 
which to study the chemical reactions and the heat 
and mass transport of lean combustion processes. 
It is expected that these experiments will provide 
new insight into combustion processes in the lean 
burning limit, which are important in improving the 
efficiency of engines and heating systems. 

Materials science. Processing of metallic alloys and 
composites has been carried out in space in order to 
study their microstructure, thermal properties, and 
crystal growth. 

Evolution of microstructure. The strength and other 
properties of an alloy depend on its microstructure, 
which is characterized by the size, orientation, and 
composition of the grains that make up the solid. One 
of the main tasks of a materials scientist is to design 
solidification processes to produce the microstruc- 
ture that will give the material the desired properties. 
With current computational capabilities, it is possi- 
ble to design a complex mold so that the heat and 
mass flow will produce the desired microstructure 
throughout the final casting. However, in order to 
do this the basic laws governing the development of 
the microstructure must be known, along with the 
thermophysical properties of the components. Es- 
tablishing the physical basis for the various laws that 
describe the solidification process has, over the last 
half-century, transformed metallurgy from an indus- 
trial art based on empiricism to a more exact science. 
Because of the complicating effects of convection, 
many of the laws in use today are based on theo- 
ries that (for simplicity’s sake) assume no convective 
flows. The effects of convection are then added to 
the calculation to make the result somewhat more re- 
alistic. However, most of these theories have never 
been tested in the absence of convection, so vari- 
ous subtleties may have been overlooked. The ability 
to experiment in microgravity provides an opportu- 
nity to test some of these theories to make sure they 
are basically correct. Later Spacelab experiments in- 
vestigated theories that predict the lamella spacing 
in eutectic structures, the growth rate of dendrites 
in castings, the coarsening and particle agglomera- 
tion in composites formed by liquid-phase sintering, 
the particle-solidification interactions and ripening 
in alloys with a dispersed second phase, and the 
phase-separation mechanisms that act in monotec- 
tic alloys (systems which exhibit liquid phase immis- 
cibility). See ALLOY; COMPOSITE MATERIAL; CONVEC- 
TION CHEAT); MATERIALS SCIENCE AND ENGINEERING; 
METAL CASTING. 

Measurement of thermal properties. A number of self- 
diffusion and binary diffusion experiments in liquid 
metal systems have been carried out on the Space- 
lab missions. It was found that diffusion coefficients 
measured in space were generally 30-50% lower 
than the accepted values based on ground-based ex- 
periments carried out in capillaries to minimize con- 
vection. This would suggest that most, if not all, dif- 
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fusion measurements in normal gravity have been 
contaminated to some degree by convection. Fur- 
thermore, the increased precision of the space data 
allowed an accurate determination of the tempera- 
ture dependence of diffusion in liquid metal systems. 
Surprisingly, instead of following the Arrhenius be- 
havior typical of diffusion in solids, it was found that 
the diffusion data appear to fit a power law in which 
temperature is raised to exponents ranging from 1 to 
2, depending on the system. This unexpected power 
law behavior of diffusion may shed some light on the 
structure of the liquid state of metals, which is still 
poorly understood. See DIFFUSION. 

Conducting materials can be levitated by electro- 
magnetic induction in normal gravity, but the cur- 
rent required for levitation also heats the sample. In 
microgravity, the levitation and heating may be de- 
coupled, which allows samples to be deeply under- 
cooled in the liquid state with no physical contact 
that would induce the nucleation of the solid phase. 
Thus it becomes possible to measure thermophysical 
properties of liquids well below their normal freez- 
ing temperatures, which are important in developing 
new metallic glasses and other novel nonequilibrium 
solids. Several very clever techniques have been de- 
veloped for measuring the temperature, surface ten- 
sion, viscosity, and electrical resistivity of levitated 
samples with no physical contact using the TEMPUS 
facility developed by the Institute for Space Simula- 
tion in Cologne, Germany. These properties of sev- 
eral metallic glass-forming systems were measured 
in the undercooled state with this apparatus. One 
particularly interesting result was evidence for mag- 
netic ordering of ferromagnetic CogpPd29 in the un- 
dercooled liquid state near the Curie temperature of 
the solid, which demonstrates that ferromagnetism 
can indeed exist in the liquid state. See ELECTROMAG- 
NETIC INDUCTION; FERROMAGNETISM. 

Crystal growth experiments. On four Spacelab missions 
was flown a sophisticated solidification experiment 
apparatus developed by the French Space Agency 
(CNES) and the French Atomic Energy Commission 
(CEA) called Materials for the Study of Interesting 
Phenomena of Solidification on Earth and in Orbit 
(MEPHISTO). It was used by French and American 
investigators to study the stability and breakdown 
of the solidification interface of metal alloy systems. 
The Seebeck voltage generated by the melt-solid in- 
terfacial was measured, which served as an indicator 
of incipient interfacial breakdown. The results were 
used to test and refine theories of interfacial stabil- 
ity of both faceting and nonfaceting alloy systems. 
Such theories are important in developing crystal 
growth processes for alloy-type semiconductor and 
photonic systems in which interfacial breakdown 
must be avoided. See ALLOY STRUCTURES; SEEBECK 
EFFECT; SEMICONDUCTOR; THERMOELECTRICITY. 

A number of attempts were made to grow a va- 
riety of compound and alloy-type semiconducting 
systems from the melt with the aim of improv- 
ing the compositional uniformity through diffusion- 
controlled transport. This goal has proven to be elu- 
sive, even in microgravity, which underscores the 
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extreme sensitivity of such systems to buoyancy- 
driven flows even at acceleration levels on the order 
ofa micro-g. It was found, however, that dislocations 
and twinning were significantly reduced in many of 
the space experiments, especially in regions where 
the forming solid was not in contact with the wall 
of the growth ampoule. This observation would im- 
plicate wall effects as a major source of imperfec- 
tions in terrestrially grown crystals, and has encour- 
aged some crystal growers to try to develop growth 
methods that avoid or least minimize wall contact. 
See CRYSTAL DEFECTS. 

Several semiconductor systems were grown in 
space by the floating zone process to eliminate 
wall contact. The absence of hydrostatic pressure 
in space allows much larger floating zones to be 
deployed than are possible under normal gravity. 
Well-ordered crystals of gallium arsenide (GaAs) with 
diameters up to 20 mm were grown using a mir- 
ror furnance to focus light from an arc lamp onto 
the molten zone. The dislocation densities were 
very low; however, dopant striations attributed to 
nonsteady Marangoni convection (convective flows 
driven by surface tension gradients) were observed. 
Such striations were avoided in other growth sys- 
tems that had a native oxide skin which served to 
suppress the Marangoni flows. See ZONE REFINING. 

Robert J. Naumann 
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A reusable crewed orbital transportation system. The 
space shuttle, along with crewless (robotic) expend- 
able launch vehicles such as Atlas and Delta, make 


up the United States Space Transportation System 
(STS). The shuttle provides the unique capability 
for in-flight rendezvous and retrieval of faulty or ob- 
solescent satellites, followed by satellite repair, up- 
date, and return to orbit or return to Earth for repair 
and relaunch. The space shuttle also plays an essen- 
tial continuing role in the construction and provi- 
sioning of the International Space Station USS) by 
transporting major components, such as the giant 
solar-cell arrays and the Canadian computer-driven 
double-ended robotic arm, from Earth to the JSS 
and installing them using extended extravehicular 
activity (EVA, or space walks) by trained JSS resident 
and shuttle-visiting astronauts. See SATELLITE (SPACE- 
CRAFT); SPACE STATION. 

Early in its history the space shuttle was ex- 
pected to be a low-cost replacement for expend- 
able launch vehicles. Following the Challenger shut- 
tle accident in 1986, it became clear that crewless 
vehicles would have a continuing place in the 
United States launch vehicle fleet, that they have 
advantages over the shuttle such as lower cost 
and shorter lead time for tasks within their ca- 
pability, and that the shuttle fleet would be 
fully occupied for much of the first decade of 
the twenty-first century doing complex jobs re- 
quiring human presence, in particular associated 
with the ZSS, which no other launch system can do. 
For such reasons the STS was expanded to include 
the expendable launch vehicle families in use by 
the Department of Defense (DOD) and the National 
Aeronautics and Space Administration (NASA). 

Prior to permanent occupation of the ISS, the shut- 
tle carried attached payloads, such as the European 
Space Agency (ESA) Spacelab module, which made 
use of the low-gravity environment for mission dura- 
tions of up to 10 days or so. Such experiments are 
now almost always flown on the JSS to take advan- 
tage of the longer microgravity duration implicit in 
the JSS “continuing human presence in space” phi- 
losophy. 

The space shuttle orbiter accommodates a crew 
of four to seven for orbital mission durations up 
to about 10 days. The space shuttle flight system 
(Fig. 1) consists of the orbiter, which includes the 
three liquid-fueled shuttle main engines; an external 
fuel tank; and two solid-fuel strap-on booster rockets. 
The external tank is discarded during each launch. 
The solid-fuel rocket casings are recovered and 
reused. The orbiter lands horizontally as an unpow- 
ered (“dead stick”) aircraft on a long runway at the 
NASA Kennedy Space Center in Florida or, when con- 
ditions for landing these are unacceptable, at the 
Edwards Air Force Base in California. See ROCKET 
PROPULSION. 

Flight sequence. A nominal flight sequence is 
shown in Fig. 2. The shuttle is launched with 
all three main engines and both strap-on solid-fuel 
booster rockets burning. The booster rockets are 
separated 2 min after liftoff at an altitude of 30 mi 
(48 km), 28 mi (44 km) downrange. Main engine 
cutoff occurs about 8 min after liftoff, nearly 70 mi 
(110 km) above the Atlantic Ocean, 890 mi 


(1430 km) downrange from Kennedy. The external 
tank (ET) is separated from the orbiter shortly 
after the main engine cutoff, before orbital velocity 
(speed) is reached, so that the relatively light empty 
tank can burn up harmlessly during its reentry into 
the atmosphere above the ocean. Main propulsion 
for the rest of the mission is provided by the orbital 
maneuvering system (OMS) engines, whose first task 
is to complete insertion of the shuttle into its final, 
nearly circular path during its first orbit (flight once 
around Earth) after liftoff. 

After the shuttle orbiter completes the orbital 
phase ofits mission and is ready to return to Kennedy, 
its pilot rotates the spacecraft 180 (tailfirst relative 
to the orbiter’s direction of motion) and fires the 
orbital maneuvering system engines to decrease 
the space vehicle’s speed. This maneuver reduces 
the orbiter’s speed just enough to allow the orbiter 
to fall into the tenuous outer atmosphere, where 
atmospheric drag causes the orbit to decay (altitude 
and velocity decrease) along a spiral path. This 
reentry slowdown occurs in a precise, computer- 
controlled manner so that the landing may be made 
manually, halfway around the Earth, on the desired 
runway at Kennedy or Edwards. At orbital altitudes, 
small reaction-control engines (jets) maintain the 
desired orientation of the orbiter. As the spacecraft- 
becoming-airplane descends into the denser lower 
air, its aerodynamic control surfaces gradually take 
over their normal maintenance of heading, pitch, 
and roll, and the small jets are turned off. Landing 
is made at a speed of 220 mi/h (100 m/s). The 
brakes are helped to bring the vehicle to a stop by 
a parachute (drag chute) which is deployed from 
the tail after touchdown of the nose wheel and 
released well before the vehicle comes to a complete 
stop. 

Vehicle data. The orbiter carries into space the 
crew and the payload for the space shuttle vehicle 
system. It is comparable in size to a medium com- 
mercial jet transport, with a length of 121 ft G7 m) 
and a wing span of 78 ft (24 m). Its dry (unfu- 
eled) mass is 161,400 Ib (73,200 kg). The orbiter can 
deliver cylindrical payloads of up to 65,000 Ib 
(29,500 kg) of mass, with lengths as great as 60 ft 
(18 m) and diameters of up to 15 ft (4.6 m). 

The external tank is 157 ft (48 m) long by 28 ft 
(7.6 m) in diameter and contains a combined mass of 
1,550,000 Ib (703,000 kg) of available liquid oxygen 
(LOX) and liquid hydrogen propellants. The solid- 
fuel booster rockets are each 149 ft (45.4 m) long by 
12 ft G.7 m) in diameter and have a loaded mass 
of 1,292,600 Ib (586,200 kg). Together, the three 
space shuttle main engines and the two solid-fuel 
rocket boosters generate a thrust of 6,400,000 Ibf 
(28,500,000 newtons) at liftoff. The total mass 
of the shuttle system at liftoff is 4,491,000 Ib 
(2,037,000 kg). 

Orbiter vehicle systems. Major orbiter systems are 
the environmental control and life support, electric 
power, hydraulic, avionics and flight control, and the 
space shuttle main engines. Each system is designed 
with enough redundancy to permit either continua- 
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Fig. 1. Space shuttle lifting off from the Kennedy Space Center. 


tion of mission operations or a safe return to Earth 
after any single-element failure. For example, three 
fuel-cell power plants generate the in-flight electric 
power. Fach fuel cell feeds one of three indepen- 
dent electrical distribution buses. Similarly, three in- 
dependent hydraulic systems, each powered by an 
independent auxiliary power unit, operate the aero- 
surfaces. See FUEL CELL; SPACE POWER SYSTEMS. 

The avionics system uses five general-purpose 
computers, four of which operate in a redundant set 
while the fifth operates independently. These com- 
puters perform guidance, navigation, and control 
calculations, and operate the attitude (orientation) 
controls and other vehicle systems. They also mon- 
itor system performance and can automatically re- 
configure redundant systems in the event of a fail- 
ure during flight-critical mission phases. Each of the 
three space shuttle main engines has an independent 
computer for engine control during ascent. See DIG- 
ITAL COMPUTER; MULTIPROCESSING. 

The thermal protection system, which is not 
redundant, presented a major technical challenge 
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Fig. 2. Typical space shuttle mission profile. 


to the orbiter development schedule. This system, 
unlike those of previous single-use spacecraft, has a 
design requirement of reuse for 100 missions. Per- 
formance requirements also dictate that the sys- 
tem withstand temperatures as high as 3000°F 
(1650°C) while maintaining the vehicle’s structure 
at no more than 350°F (177°C). See ATMOSPHERIC 
ENTRY. 

The key to meeting this challenge was to develop 
a material possessing an extremely low specific heat 
capacity and thermal conductivity, together with ad- 
equate mechanical strength to withstand the launch 
and reentry vibration and acceleration. These ther- 
mal characteristics must remain stable up to temper- 
atures of at least 3000°F (1650°C). The solution was 
found in silica ceramic tiles, some 24,000 of which 
cover most of the orbiter’s surface. During the high- 
est thermal load portion of the reentry trajectory 
(path), the orbiter’s wings are level and the nose is 
elevated well above the flight path. This causes the 
temperature of the undersurface to be substantially 
higher than that of the upper surface. For this rea- 
son, the undersurface is covered with black borosili- 
cate glass-coated high-temperature tiles. Most of the 
upper surface of the shuttle is covered with a lower- 
temperature silica blanket made of two outer layers 
of woven fabric and an insulating buntinglike cen- 


ter layer stitched together in a quiltlike pattern. Fi- 
nally, the nose cap and wing leading edges are subject 
to the highest (stagnation) temperatures, which re- 
quires the use of molded reinforced carbon-carbon 
material. 

Flight testing. Like any commercial or military 
transport aircraft, the space shuttle system had to 
pass a series of flight tests before it could be com- 
mitted to operational service. These tests were con- 
ducted in two phases: approach and landing, and 
orbital flight. 

For the approach and landing phase, which tested 
the capabilities of the shuttle as an unpowered 
aircraft, the first test vehicle, Enterprise, was air- 
launched from a modified Boeing 747 airplane which 
was used later to airlift (“piggyback”) operational 
shuttle orbiters between California and Florida. Five 
successful flights in 1977 verified the subsonic dow- 
speed) aerodynamics and handling qualities of the 
orbiter. 

In 1981 the first protoflight powered space shuttle 
Columbia was launched from Kennedy on the first 
of four orbital test missions, STS 1 through STS 4. 
All major test objectives were successfully accom- 
plished. The most significant problem that surfaced 
involved damage to Columbia’s thermal tiles, but 
the orbiter’s underlying structure suffered no 


temperature problems and the repaired areas sur- 
vived subsequent flights without further thermal dif- 
ficulties. 

Flight operations. STS 5 initiated the operational 
shuttle flight phase in 1982. Many tests of the orbiter 
remained to be accomplished and were scheduled in 
conjunction with operational flights. The ability to 
do deployment, rendezvous, proximity operation, 
and retrieval using the remote manipulator system 
(robotic arm), was demonstrated in 1983 using 
the West German Shuttle Pallet Satellite Spacecraft 
(SPAS). One year later the first shuttle landing took 
place on Kennedy’s 3-mi (5-km) runway. Longer-term 
problems were found and fixed. For instance, an 
unexpected chemical reaction with the tile water- 
proofing agent caused degradation of the tile bond- 
ing adhesive, requiring replacement of the degraded 
material and development of a new waterproof- 
ing agent. In 1985 the planned four-shuttle fleet 
was completed with the delivery of the orbiter 
Atlantis. 

Payload accommodations. The orbiter payload bay 
is divided into four cylindrical sections, each of 
which can support one major payload. Standard ser- 
vices to each section include electric power, data 
recording, and two-way telemetry (data exchange). 
Optional services, such as active payload cooling, 
may also be purchased by the customer. 

Small experiments for researchers are accommo- 
dated by a variety of carriers. These are normally 
manifested on a space-available basis and flown at 
a relatively modest cost. Self-contained experiments 
requiring only turn-on and turn-off by the crew are 
carried in canisters mounted along the forward por- 
tion of the payload bay sidewall. They are known in- 
formally as “get-away special” (GAS) canister (CAN) 
payloads. Small experiments can also be acommo- 
dated in the middeck section of the pressurized 
cabin. Larger experiments are mounted on a carrier 
in the payload bay, making orbiter power and teleme- 
try services available. 

Experiments requiring precision pointing can be 
mounted on a free-flying carrier that is deployed and 
retrieved with the robotic arm. The carrier platform 
provides highly accurate pointing and a microgravity 
environment. 

Figure 3 shows a typical collection of payloads 
and experiments to illustrate the diversity of pur- 
poses which have been served on each shuttle flight. 
Specifically, it shows the STS 61-B cargo configu- 
ration, which was made up of three communica- 
tions satellites: the Australian AUSSAT 2; the Mexi- 
can MORELOS-B; and the United States commercial 
SATCOM KU-2. The attached payloads are Experi- 
ment Assembly of Structure in Extravehicular Activ- 
ity (EASE), Assembly Concept for Construction of 
Erectable Space Structure (ACCESS), IMAX Cargo Bay 
Camera (ICBC), and G-479 Telesat-Canada Getaway 
Special (GAS). In the crew compartment are the Con- 
tinuous Flow Electrophoresis System (CFES), Diffu- 
sive Mixing of Organic Solutions (DMOS), Morelos 
Payload Specialist Experiments (MPSE), and Orbiter 
Experiments (OEX). 
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Fig. 3. STS 61-B cargo configuration. Acronyms are explained in the text. 


It should be noted that, were this mission (which 
was launched in November 1985) planned after 
1986, probably each of the communications satel- 
lites would be handled by an expendable launcher, 
and after 2000 many of the experiments would be 
done on the JSS. 

Challenger disaster. Shortly before noon on Jan- 
uary 28, 1986, Challenger lifted off from Kennedy 
Space Center on the twenty-fifth shuttle mission. At 
72 s into the flight, with no apparent warning from 
real-time data, the external tank exploded and Chal- 
lenger was destroyed. 

All seven crew members perished, the first to 
die in NASA’s human space-flight program spanning 
25 years and 56 crewed launches. Further flight op- 
erations were suspended while a Presidential (Blue- 
ribbon) Commission reviewed the circumstances 
surrounding the accident, determined its probable 
cause, and developed recommendations for correc- 
tive actions. 

Recovery. Nearly 5 months later the Commission 
submitted its report which contained recommenda- 
tions to be implemented before returning the space 
shuttle to flight status. 

The cause of the accident was determined to be in- 
adequate design of the solid rocket motor field joint. 
“Field joint” refers to the assembly of the solid rocket 
motor at Kennedy rather than at the manufacturer’s 
factory, because the massive motor must be shipped 
in cylindrical sections. Deflection of the joint with 
deformation of the seals at cold temperature allowed 
hot combustion products to bypass both O-rings, re- 
sulting in erosion and subsequent failure of both pri- 
mary and secondary seals. 
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The redesigned field joint (Fig. 4) has an over- 
lapping tang that helps prevent separation during 
the startup phase of motor operation when the case 
is responding dynamically to the internal pressure 
buildup. A third O-ring seal, in the overlapping tang, 
was also included in the new design, along with the 
ability to verify the seal’s integrity after installation. 
Other improvements were also made in the design 
of the solid rocket motor. 

NASA completely reviewed all activities associ- 
ated with launch aborts and established an additional 
transatlantic abort landing site at Ben Guerir, Mo- 
rocco. An escape system was devised that allows 
the crew to parachute out of the orbiter once the 
vehicle has achieved controlled gliding flight. The 
system uses a pole extended out of the orbiter side 
hatch (Fig. 5). 

NASA also modified its shuttle program manage- 
ment structure and established an Office of Safety, 
Reliability, Maintainability, and Quality Assurance, re- 
porting directly to the NASA Administrator. 

Return-to-flight actions. Before resuming space shuttle 
flights, in addition to actions mandated by the Pres- 
idential Commission, NASA initiated a Systems De- 
sign Review (SDR), a detailed review of all shuttle 
program hardware and software. Its objective was 
to establish a series of changes that would result 
in improved safety, with reliability and efficiency 
being of second and third priority, respectively. As 
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Fig. 4. Solid-fueled rocket motor field joint. (a) Original design. (6) New design. 


a result, 254 modifications were made to the or- 
biter, 24 to each main engine, 30 to each solid rocket 
booster, and 13 to the external tank. 

Because the solid rocket cases are recovered and 
refurbished, it was known that the primary (but not 
the secondary) O-ring had failed on several earlier 
flights. For human space flight systems, a basic re- 
quirement of redundancy for safety, which includes 
the field joint O-rings, is that should the designed 
redundancy be known or suspected of being com- 
promised, the mission must be delayed until the re- 
dundancy is known to be restored. This requirement 
was waived by NASA for several shuttle missions, in- 
cluding Challenger’s final flight. 

On September 29, 1988, 32 months after Chal- 
lenger’s final flight, space shuttle Discovery success- 
fully resumed shuttle operations. During the next 
14 years, more than 80 missions were flown with 
no anomalies more serious than several foreshort- 
ened missions because of the failure of a redundant 
component or subsystem. 

Columbia disaster. On February 3, 2003, while the 
shuttle Columbia was reentering the atmosphere, 
it disintegrated. All seven crew members perished, 
and pieces of the orbiter fell in a long swath over 
East Texas. As much of the debris as possible was 
collected and taken to Kennedy Space Center, where 
pieces were placed in their appropriate position on 
a hanger floor. The Columbia Accident Investigation 
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Fig. 5. Orbiter crew escape system. (a) Typical profile. 1 ft = 0.3 m. (b) U.S. Navy jumper demonstrating use of the crew 


escape system pole in a test aircraft. 


Board (CAIB) was formed to determine the cause of 
the accident and to present recommendations. As 
during the Challenger investigation, all flights were 
suspended. 

Recovery. Seven months after the disaster, the CAIB 
issued its report. The cause of this accident was 
a breach in the thermal protection sytem on the 
leading edge of the left wing, caused by a piece of 
insulating foam that separated during launch from 
the left bipod ramp section on the external tank. 
This breach in the reinforced carbon-carbon panel al- 
lowed superheated air to penetrate through the lead- 
ing edge of the left wing, which melted the aluminum 
structure of the wing. Aerodynamic forces caused 
loss of control of the shuttle and its subsequent 
breakup. 

The CAIB made 15 recommendations. The inde- 
pendent NASA Return to Flight Task Group agreed 
that NASA had met the intent of 12 of the recom- 
mendations. NASA could not meet the major recom- 
mendation, to eliminate foam from leaving the ex- 
ternal tank, due to engineering problems. NASA did 
remove the foam on the bipod ramp and made im- 
provements in foam application to lessen the foam 
debris. NASA also met other recommendations for 
better ground-based imagery of the vehicle, and bet- 
ter high-resolution imagery of the orbiter and the 
external tank. 

STS 114 return to flight. When STS 114 launched in 
July 2005, the new cameras on the external tank, in- 
stalled for high resolution of the orbiter, recorded a 
large piece of foam coming off the tank. On orbit, 
the crew used the new Orbiter Boom Sensor Sys- 
tem (OBSS) to inspect the reinforced carbon-carbon 
and the tiles for damage. Only two gap fillers were 
found, extending between the tiles. NASA decided 
that the fillers had to be removed or cut off. An as- 
tronaut on the station arm pulled them out easily. On 
another spacewalk, NASA also tested several meth- 
ods to repair the tiles and the reinforced carbon- 


carbon, should further damage be detected in the 
future. 

Six in-flight anomalies were taken due to the un- 
expected foam loss. The analysis and wind tunnel 
testing required to understand and eliminate the 
loss led to further delays in the launch of the next 
mission. Worse, when hurricane Katrina struck the 
Gulf Coast, it damaged the Michoud Assembly Facil- 
ity, which builds the external tanks, causing further 
delays. The next mission, STS 12/, did not launch 
until July 4, 2006, almost exactly one year after 
STS 114. The most significant change in the tank 
was the elimination of the liquid hydrogen and liq- 
uid oxygen protuberance air load (PAL) ramps. The 
tank was shipped from Kennedy Space Center back 
to Michoud for the modifications. 

Technical achievements. The first operational shut- 
tle accomplished the first commercial satellite de- 
ployments in 1982. In 1984 the first in-orbit satellite 
repair was accomplished on the Solar Maximum 
Mission’s control system and a primary experiment 
sensor. Later that same year Palapa and Westar, two 
communications satellites in useless orbits, were re- 
covered and returned to Earth on the same space 
shuttle mission. These repair and recovery missions 
demonstrated the usefulness of humans in these new 
space tasks. 

In September 1988, a Tracking and Data Relay 
Satellite (TDRS) was launched by the first space shut- 
tle to fly after the Challenger accident. The initial 
TDRS network was completed 6 months later by 
the second shuttle launch of a TDRS, which enabled 
the closing of many Spaceflight Tracking and Data 
Network (STDN) ground stations and a major im- 
provement in the duration and bandwidth of com- 
munication with robotic and crewed spacecraft. See 
SPACE COMMUNICATIONS; SPACECRAFT GROUND IN- 
STRUMENTATION. 

The year 1989 was a banner one for solar sys- 
tem exploration, thanks to space shuttle launches of 
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Magellan to Venus and Galileo to Jupiter. In 1990 
the first mission, Ulysses, was undertaken to inves- 
tigate the magnetic field configuration and varia- 
tions around the Sun’s poles, after the craft’s shuttle 
launch. STS 32 launched the long-awaited Hubble 
Space Telescope (HST) in 1990; however, this tele- 
scope’s most important work was not accomplished 
until later that decade after two shuttle EVA visits— 
the first in 1993 to correct a bad case of astigmatism 
produced by the HST mirror manufacturer, and the 
second in 1997 to update the sensors and replace 
major spacecraft subsystem components that were 
failing. Additional servicing missions were carried 
out in 1999 and 2002. In 2006, after much debate, 
NASA decided to perform one more Hubble servic- 
ing mission, enhancing its capabilities and extending 
its life by replacing failed components and updating 
some sensors and data processors. 

Science accomplishments. Successful launches of 
Magellan, Galileo, Ulysses, and HST (together with 
the two in-orbit servicing calls) constitute a major 
contribution to the space sciences. The deployment 
of the Long Duration Exposure Facility (LDEF) as a 
free-flying satellite by a shuttle in 1984 and its shuttle 
recovery in 1990 alerted materials researchers to the 
rapid degradation of many materials used in space, 
even though some data were lost due to the delay 
of the LDEF recovery because of the hiatus in shut- 
tle operations in 1986-1988. Finally, the experience 
of the bioscientists and materials researchers devel- 
oping research methods and equipment for experi- 
ments aboard JSS is helping to get this work off to a 
good start. See REMOTE MANIPULATORS; SPACE BIOL- 
OGY; SPACE PROCESSING. 

Safety upgrades. Until 2001, official NASA state- 
ments projected the availability of a single-stage-to- 
orbit space shuttle replacement by 2012. Cancella- 
tion or suspension in 2001 of the X-33 and other 
steps toward a shuttle replacement led to specula- 
tion that the space shuttle might be the major crewed 
link with JSS until 2020 or later. Consequently, the 
Space Flight Advisory Committee advised NASA to 
give higher priority and funding to space shuttle 
high-priority safety upgrades, which NASA had al- 
ready identified and was investigating. 

In 2001, NASA planned to develop four high- 
priority safety upgrades for the shuttle fleet. An elec- 
tric auxiliary power unit would replace the haz- 
ardous hydrazine-fueled high-speed turbine-driven 
hydraulic systems. New engine sensors in the Main 
Engine Advanced Health Monitoring system could 
cut the risk of a catastrophic engine failure by about 
25%. The Solid Rocket Booster Advanced Thrust Vec- 
tor Control project would shift away from hydrazine 
in the turbines that drive the nozzle hydraulics. Cock- 
pit upgrades would install advanced avionics in the 
shuttle fleet, including the flat-panel cockpit displays 
already available on Atlantis to ease the crew work- 
load during ascent. 

In 2004, President Bush announced the Vision 
for Space Exploration. Part of the Vision required 
the shuttle to be retired no later than 2010, only 
5 years after the first return-to-flight mission. Conse- 


quently, some of the safety upgrades were canceled, 
and other upgrades were added. The electric auxil- 
iary power units and Solid Rocket Booster Advanced 
Thrust Vector Control programs were canceled. The 
Advanced Health Monitoring System was flown in 
monitor-only mode on STS 116 in December 2006. 
The results will be examined before it is placed in op- 
erational mode. All three orbiters received the Cock- 
pit Avionics Upgrade. 

A new upgrade was added, the Station/Shuttle 
Power Transfer System (SSPTS). Since the ISS is pow- 
ered from an inexhaustible supply of sunlight and the 
shuttle is limited in power by the amount of hydro- 
gen and oxygen carried onboard, a system was de- 
signed to power the shuttle from the /SS. The SSPTS 
converts the 125-volt DC JSS power to 28-volt DC 
shuttle power. This will allow the shuttle to stay at 
the JSS for longer periods of time. The shuttle Edeav- 
our has been modified with the SSPTS and is sched- 
uled to fly in June 2007 to test it. Installation of the 
SSPTS is also planned on the shuttle Discovery, but 
not on Atlantis, due to the retirement of the shuttles. 

John F. Clark; Jeffrey C. Mitchell 
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A complex physical structure specifically designed to 
serve as a multipurpose platform in low Earth orbit. 
Functioning independently and often without a crew 
actively involved in onboard operations, a space sta- 
tion contains the structures and mechanisms to oper- 
ate and maintain such support systems as command 
and data processing, communications and tracking, 
motion control, electrical power, and thermal and 
environmental control. Evolving together with tech- 
nology and increasing in scope and complexity, the 
space station has a history in which each program 
was based upon the developments and achievements 
of its predecessor. 

Salyut. The Soviet Union began construction of 
the world’s first space station, called Salyut, in 1970. 
It was the primary Soviet space endeavor for the next 
15 years. The design was retained not only through 
a series of Salyuts that were launched within that 
decade and later, but also through the development 
of Mir, the most famous, long-lived Russian achieve- 
ment in space. The Salyuts were cylindrical in shape 
and contained compartments with specialized func- 
tions for operating a space station. The docking com- 
partment was designed to accept the Soyuz space- 
craft that transported the cosmonauts. The transfer 


compartment gave the cosmonauts access to the var- 
ious work compartments. The work compartments 
contained the mechanisms that operated and con- 
trolled the station, as well as the laboratories in 
which the cosmonauts performed experiments 
while they were onboard. Salyut 1 initially carried a 
crew of three, but after three cosmonauts died when 
a valve in the crew compartment of their descent 
module burst and the air leaked out, the crews were 
reduced to two cosmonauts, and both crew mem- 
bers were outfitted with pressurized space suits. 

During the Cold War, Soviet Salyuts were launched 
as missions to serve both civilian and military 
purposes. These missions never were linked in a 
flight, and many of the details about the flights were 
shrouded in secrecy. Since the end of the Cold War 
those details have been declassified, revealing the 
extent and scope of Soviet activity in space, which 
included astronomical and medical experiments; 
testing of a broad range of communications and 
telemetry, guidance, and navigation systems for or- 
biting spacecraft; Earth observation using cameras 
and satellites; manufacture of space-processed prod- 
ucts; and, perhaps most importantly, evaluations 
such as the effects of zero gravity over an extended 
period of time on a human body. 

The Soviet Salyuts set the record of 8 months 
for extended duration of human space flight. At 
the end of their usefulness, all Salyut spacecraft 
were deorbited by Mission Control Moscow and per- 
mitted to burn up during reentry into the Earth’s 
atmosphere. 

Skylab. Starting later than the Soviets, the United 
States also had plans for a space station. The pro- 
gram, known as Skylab, was developed by the Na- 
tional Aeronautics and Space Administration (NASA) 
building on the success of its heavy-lift rocket, the 
Saturn, which had boosted the Apollo rockets and 
helped place the first human being on the Moon. Sky- 
lab weighed just less than 100 tons (90 metric tons). 
It was launched on May 14, 1973, from the Kennedy 
Space Center aboard a Saturn 5 rocket, which en- 
joyed a reputation for never having failed. Although 
the launch was flawless, a shield designed to shade 
Skylab’s workshop deployed about a minute after 
liftoff and was torn away by atmospheric drag. That 
began a series of problems, most involving overheat- 
ing, that had to be overcome before Skylab was safe 
for human habitation. Eventually three crews served 
aboard Skylab throughout 1973 for periods of 28, 59, 
and 84 days, respectively. The single greatest contri- 
bution made by each crew was the extravehicular 
activities that restored Skylab’s ability to serve as a 
space station. 

Apollo-Soyuz Test Program. In 1975, during the pe- 
riod of detente, plans were made for a joint United 
States-Soviet space venture, known as the Apollo- 
Soyuz Test Program (ASTP). For the first time, United 
States astronauts and Soviet cosmonauts became 
members of each other’s flight teams, training to- 
gether, touring their launch sites, and working in 
their mission control rooms. For 15 months, they 
trained and prepared for their historical joint mission 


in space. In July 1975, the astronauts of Apollo 18 
linked with the cosmonauts of Soyuz 22; their 
docked configuration functioned as a miniature in- 
ternational space station. 

Mir. Throughout the 1970s, the goal of the 
Soviet space program appeared to be the develop- 
ment of a permanently deployed space station. It 
would have both civilian and military applications, 
and it would be based upon the technology gleaned 
from the experience with the Salyut and the Soyuz 
vehicles. A major task that the Soviets sought to ac- 
complish in their quest for a permanent space sta- 
tion was the optimization of conditions that would 
enable cosmonaut crew members to live in space 
for extended periods of time. The crew then would 
be able to operate at maximum efficiency and effec- 
tiveness when performing space-based experiments. 
Soviet decision makers sought to take advantage of 
the unique opportunities presented by a permanent 
space station to exploit the space environment and 
benefit their economy. For example, Soviet scientists 
studied methods to direct solar power by means of 
equipment onboard a space station to Soviet facilities 
for consumption at home. 

Mir (which in Russian means either “peace” or 
“world”) was the name of the vehicle for the world’s 
first multipurpose, permanently operating crewed 
space station. Based upon the Salyut design and con- 
figuration, Mir essentially was an extension and ex- 
pansion of the shell of the basic Soviet space vehicle 
(Fig. 1). It would incorporate the standard Salyut/ 
Soyuz/Progress profile. External ports were added 
for docking of Soyuz vehicles which would carry 
crew members, and Progress (resupply) vehicles 
which would bring foodstuffs, drinking water, extra 
equipment and apparatus, sanitary requisites, medi- 
cal apparatus, and propellant. These same Progress 
vehicles would return space “junk” to Earth. Con- 
struction was based upon a modular design, per- 
mitting the Soviets to replace modules whenever 
significant improvements in technology made the 
earlier modules obsolete. Two cylindrical modules 
formed the basic shape of Mir and served as the liv- 
ing and central control compartments for the crews. 
Additional modules were used for scientific experi- 
ments. 

The Mir core station was a 49-ft (15-m) module. 
Its launch aboard a Soviet Proton rocket on February 
20, 1986, was televised internationally for the first 
time in Soviet space history. Mir was assembled in 
space and was composed of six modules. In addition 
to the core, the Kvant-1 module daunched in April 
1987) was a 19-ft (6-m) pressurized lab used for astro- 
physics research. The Kvant-2 module was launched 
in December 1989, and was used for biological re- 
search and for Earth observation. The Kristall mod- 
ule was launched in August 1990, and provided the 
docking port for the United States space shuttles that 
visited Mir. Between 1994 and 2001, there were nine 
dockings between space shuttle vehicles and Mir. 
The final two modules, Spektr and Priroda daunched 
in June 1995 and April 1996, respectively) were re- 
mote sensing modules used to study the Earth. Mir 
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Fig. 1. Mir complex, in orbit 1986-2001. (After Commercial Uses of Space Exploration [brochure], Glavcosmos, 1989) 


provided a platform for long-term microgravity re- 
search and increased knowledge about living and 
working in space. By the end of its function in March 
2001, the 143-ton (130-metric-ton) Mir had spent 
15 years in orbit and had orbited the Earth more 
than 87,600 times with an average orbiting speed 
of 17,885 mi/h (28,800 km/h). Its superstructure, 
109 ft 3 m) long and 90 ft (27 m) wide, burned 
up upon reentry in to the Earth’s atmosphere, and 
scraps were scattered into the Pacific Ocean just east 
of New Zealand. See AEROSPACE MEDICINE; SPACE BI- 
OLOGY; SPACE PROCESSING; SPACE SHUTTLE. 
International Space Station. Through the late 
1980s, the United States continued to compete with 
the Soviet Union for dominance in space. The Soviet 
successes with Mir prompted the United States to 
respond with what became known as Space Station 
Freedom. Moved from the NASA drawing board in 
1993, components of the space station were tested 
by shuttle crews during a variety of missions. Sup- 
porters of this space station recognized that the ben- 
efits of space exploration are far-reaching and long- 
term, and they advocated such unique opportunities 
as manufacturing drugs, scientific materials research 
in microgravity, and studying the health and status 
of the Earth’s environment from outer space. At 
the direction of President Clinton, the United States 
transformed the single-nation concept for Space Sta- 
tion Freedom into a multinational partnership with 
the European Space Agency and the Russian Space 
Agency to create what is now known as the Interna- 


tional Space Station (ISS). See SPACE PROCESSING. 

Five space agencies [NASA in the United States, 
the Canadian Space Agency (CSA), the European 
Space Agency (ESA), the Japan Aerospace Explo- 
ration Agency AXA), and the Russian Federal Space 
Agency (RSA)] and sixteen nations have united to 
build the International Space Station. More com- 
plex than any ground-based civil engineering project 
built to date, the JSS will take approximately 12 years 
to construct (Fig. 2). 

The design evolved over more than a decade. 
When completed, the JSS will be the largest human- 
made object ever to orbit the Earth. It will include 
six laboratories built by a consortium of nations. 
The internal research space will be equivalent to 
the cabin space inside a 747 aircraft. Approximately 
80 United States space shuttle, Russian Soyuz, and 
partner-nation heavy transport vehicle flights will 
transport the structures and mechanism necessary to 
construct the station over those 12 years. The United 
States space shuttle must deliver most JSS modules 
and major components. The ISS will weigh approxi- 
mately 1 million pounds (almost 450,000 kilograms); 
it will orbit the Earth at an inclination of 51.6° to 
either side of the Equator; it will fly at an altitude 
between 230 and 285 mi 370-460 km); and it will 
move at a speed of 17,500 mi/h (28,000 km/h), orbit- 
ing the Earth every 90 min. It will host a maximum 
crew of six. 

As of January 2007, there had been 18 United 
States space shuttle, 14 Russian Soyuz, and 





Fig. 2. Artist’s rendering of how the International Space Station may look when its assembly sequence is completed in 2010. 
(Changes to the final station configuration will likely occur before then.) The 1-million-pound (450,000-kg) station will have a 
pressurized volume equal to two jumbo jets, and an acre of solar panels. (NASA) 


25 Russian Progress flights to build and support 
the International Space Station. The first flight, 
from the Baikonur Cosmodrome (a launch facil- 
ity in Kazakhstan), was a Russian Proton rocket in 
November 1998, which carried the Zarya module 
or Functional Cargo Block (FGB). This was a self- 
contained, independently operating power and com- 
munications module. It formed the base to which 
further components were added. A month later, the 
space shuttle Endeavour carried up the Unity mod- 
ule, which was attached to Zarya. Additional flights 
added components and conducted repairs. The 
Russian service module (SM), Zvezda, was launched 
and docked with the FBG in July 2000. It provided 
the station’s early living quarters and systems to sup- 
port power, data processing, flight control, propul- 
sion, and life support. Consistent with the Russian 
historical design and architecture of the Salyut and 
Mir platforms, Zarya and Zvezda featured automated 
operation. 

Expedition flights support the ISS. Astronauts and cosmo- 
nauts assigned to flights to the ISS compose an Ex- 
pedition crew. As of January 2007, there had been 
14 Expedition crews posted to the ISS. Expedition 
crews are composed of either two or three mem- 
bers: a Commander and either one or two Flight En- 
gineers. The primary missions of each Expedition 
crew are to assemble the station and to learn how 
to live and work in space. These are essential tasks 


if crewed space programs are to venture deeper into 
the solar system for the purposes of human explo- 
ration and scientific discovery. To this end, Expedi- 
tion crew members serve as research scientists. They 
use the JSS as an orbiting laboratory and as a test plat- 
form for future technologies. 

Expedition One begins building the ISS. The first per- 
manent crew of the International Space Station 
also launched from the Baikonur Cosmodrome, on 
October 31, 2000, aboard a three-stage, 310-ton 
(280-metric-ton) Soyuz rocket. Known as Expedi- 
tion One, the crew was composed of one American 
and two Russians. U.S. Navy Captain William (Bill 
Shepherd served as the commander in what was his 
fourth trip into space. The Russians were Air Force 
Lieutenant Colonel Yuri Gidzenko, who served as 
Soyuz commander, and Sergei Krikalev, who served 
as flight engineer. This was the second flight for 
Gidzenko, whose first flight was as the commander 
for Mir on which he served for 179 days. Krikalev 
had been in space four times previously, logging 
484 days in space. One of his flights was to the In- 
ternational Space Station aboard space shuttle En- 
deavour for the mission which mated the Zarya and 
Unity modules. Shortly after their arrival, the Expedi- 
tion One crew named the International Space Sta- 
tion the “Space Station Alpha.” 

During flight the Expedition One crew used 
Greenwich Mean Time (GMT) as its reference. This 
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was designed to provide the best combination of 
communications coverage for the American and 
Russian flight control teams. 

While aboard the International Space Station, the 
Expedition One crew hosted three visiting space 
shuttle crews. Those crews arrived with the mis- 
sion of continuing the construction of the station. 
The vast array of photovoltaic cells that provide the 
bulk of the power for the station arrived and was in- 
stalled by one crew. An additional crew brought the 
American laboratory, Destiny, which will be the cen- 
ter for scientific research. Within the laboratory are 
housed a variety of scientific racks, containing hard- 
ware and equipment for research and study. The pri- 
mary mission of the Expedition One crew was to pre- 
pare the station for long-term habitation, and there 
were several significant tasks that had to be accom- 
plished. For crew command and control, all onboard 
computer control systems and all means to identify 
emergencies, warnings, and cautions were inspected 
and tested for function and reliability. For crew com- 
fort and safety, all air purification and revitalization 
devices, heaters, and water supply systems were 
inspected and made serviceable. For communica- 
tions within the modules and down to Mission Con- 
trols in Houston and Moscow, all internal and exter- 
nal communications, command and data handling, 
and telemetry systems were tested and made oper- 
ational. In addition, the Expedition One crew was 
to refurbish the International Space Station. That 
was accomplished by unloading three crewless 
Russian Progress resupply vehicles, all of which ar- 
rived and docked automatically to the ports available 
on the Russian Zarya and Zvezda modules. 

Because of the amount of time required by the Ex- 
pedition One crew to prepare the station for nomi- 
nal operations, there was not much time planned for 
scientific activities and experiments. However, some 
of the experiments planned for the first expedition 
included research into protein crystal growth, anal- 
ysis of seed growth, study of the Earth by means of 
high-resolution images taken from the space station, 
spatial differences in carbon dioxide concentration, 
physiological tests involving a resistive exercise de- 
vice, and treadmill and medical evaluations of loco- 
motion and heart rate. Scientific activities prepared 
independently by the Russian Space Agency involved 
research in the fields of biotechnology, biomedical 
research, geophysical research, and engineering. 

Expedition Two continues construction. Expedition One 
was succeeded by Expedition Two, which launched 
aboard space shuttle Discovery on March 8, 2001, 
and docked with the International Space Station 
the following day. The Expedition One crew re- 
turned to Earth aboard Discovery, which undocked 
from the International Space Station on March 
18, 2001, and landed at Edwards Air Force Base 2 
days later. Their highly successful mission had lasted 
138 days 17 hours 39 minutes. 

Expedition Two also had a three-member crew. 
The commander was Russian cosmonaut Yuri 
Usachev, and the two flight engineers were U.S. Air 


Force Lieutenant Colonel Susan Helms and U.S. Army 
Colonel James Voss. On their arrival flight, the space 
shuttle Discovery carried the first multipurpose lo- 
gistics module, which was built by the Italian Space 
Agency and named Leonardo. The logistics modules 
are reusable cargo carriers designed specifically for 
space-flight operations. The space shuttle Endeav- 
our with a crew of seven launched from Kennedy 
Space Center on April 19, 2001, to deliver additional 
station-building materials and instrumentation. 
Unique among the cargo in this flight was the space 
station’s remote manipulator system or robotic arm, 
and another multipurpose logistics module, called 
Rafaello. 

Follow-on Expeditions. Through Expedition Fourteen 
and continuing with Expeditions until the JSS is no 
longer viable as a platform for exploration and re- 
search, the Expedition crews will serve onboard the 
ISS for stays between 4 and 6 months. In addition to 
the tasks of helping build, maintain, and sustain the 
ISS, the Expedition crews conduct science activities 
daily in a variety of fields such as human research, 
physical and life sciences, technology demonstra- 
tions, Earth observations, and education activities. 
These efforts were interrupted briefly after the loss 
of the space shuttle Columbia in February 2003, 
and United States space shuttle flights to the Inter- 
national Space Station were suspended until July 
2005. 

Given the JSS status as an advanced orbiting lab- 
oratory operating in zero gravity, the opportunities 
to study human endurance in space and to test new 
technologies and techniques for space exploration 
are endless. Human medical research, which devel- 
ops the knowledge essential to send space explorers 
beyond Earth’s orbit and deep into the solar system, 
focuses both on the effects of living in space and zero 
gravity and the countermeasures necessary to reduce 
the health risks associated with space travel. The goal 
is to defeat the harmful effects of space on human 
health. Research in the physical and life sciences 
is intended to use the environment of microgravity 
to study the growth of proteins, cells, crystals, and 
other biological structures. Technology demonstra- 
tions help develop new technologies used in space- 
craft materials and systems that function in the mi- 
crogravity environment. Finally, Earth observations 
and education activities link the global population 
with the work and mission on board the station, in- 
spiring humankind to continue to explore and use 
space as a resource for improving life on Earth. 

Future development. In addition to serving as a test 
bed and springboard for future space exploration, 
the JSS will serve as a research laboratory for inno- 
vation in science and technology. Early research on 
the JSS has focused on such topics as biomedical re- 
search and countermeasures to understand and con- 
trol the effects of space and zero gravity on crew 
members; biological study of gravity’s influence on 
the evolution, development, and internal processes 
of plants and animals; biotechnology to develop su- 
perior protein crystals for drug development; and 


fluid physics. Advanced research will be oriented 
toward topics such as human support technology, 
materials science, combustion science, and funda- 
mental physics. 

Construction of the International Space Station is 
projected to be completed in 2010. The ISS Program 
will continue to coordinate the training and opera- 
tions of international flight crews, use vehicles built 
in multiple countries and launched at multiple sites, 
integrate a worldwide communications network that 
extends into deep space, and unite the international 
scientific and engineering communities. 

When the International Space Station is com- 
pleted, it will host an international crew of six mem- 
bers. Training for these crews is projected for a pe- 
riod between 3 and 5 years. Expedition crews will 
live and work aboard the station for a period of 3 to 
6 months. The flight records of these expeditions 
and the results of the research conducted by the 
crews will be available to the international commu- 
nity. See SPACE FLIGHT; SPACE TECHNOLOGY; SPACE- 
CRAFT STRUCTURE. James J. Pelosi 
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Space technology 


The systematic application of engineering and scien- 
tific disciplines to the exploration and utilization of 
outer space. Since 1957, space satellites and probes 
have been launched beyond the Earth’s atmosphere 
to orbit the Earth (velocity of 7 km/s or 4.4 mi/s) or to 
escape from it (velocity 11 km/s or 6.9 mi/s). Due to 
the severe environmental conditions in space, such 
as high vacuum, intense solar radiation, and extremes 
of temperature, a variety of disciplines are required 
to design, build, test, and launch a successful space- 
craft. See ESCAPE VELOCITY. 

Space technology developed so that spacecraft 
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and humans could function in this environment that 
is so different from the Earth’s surface. Conditions 
that humans take for granted do not exist in outer 
space. Objects do not fall. There is no atmosphere 
to breathe, to keep people warm in the shade, to 
transport heat by convection, or to enable the burn- 
ing of fuels. Stars do not twinkle. Liquids evaporate 
very quickly and are deposited on nearby surfaces. 
The solar wind sends electrons to charge the space- 
craft, with lightninglike discharges that may damage 
the craft. Cosmic rays and solar protons damage elec- 
tronic circuits and human flesh. The vast distances 
require reliable structures, electronics, mechanisms, 
and software to enable the craft to perform when 
it gets to its goal—and all of this with the design 
requirement that the spacecraft be the smallest and 
lightest it can be while still operating as reliably as 
possible. 


Basic Technology 


All spacecraft designs have some common features: 
structure and materials, electrical power and storage, 
tracking and guidance, thermal control, and propul- 
sion. 

Structure and materials. The spacecraft structure 
is designed to survive the forces of launch G g for a 
space shuttle launch) and to be lifted by cranes, at- 
tached to the vehicle, and otherwise handled during 
ground processing. The structure is made of metals 
(aluminum, beryllium, magnesium, titanium) or a 
composite (boron/epoxy, graphite/epoxy). It must 
also fit the envelope of the launcher. For the space 
shuttle this is 5 m (16.5 ft) in diameter and the length 
of a bus. For expendable launchers, the spacecraft 
is enclosed in a fairing, a cylindrical volume that 
varies with the launcher. The volume of the fairing 
and shuttle is not large enough to permit fully de- 
ployed solar arrays and other devices during launch. 
The spacecraft must also have structures that extend 
out so that sensors will be far enough away from 
sources of interference, such as radioisotope thermal 
electric generators (RTGs), to be useful, or outside 
the magnetic fields of the spacecraft. See SPACE 
SHUTTLE. 

The use of fluids and gases presents unique prob- 
lems. In the case of the Hubble Space Telescope, 
the optics are so sensitive that traditional thrusters 
cannot be used—they would deposit contaminants 
on the mirrors and ruin the clarity of the optics. 
Batteries, heat pipes, ammonia coolant loops, and 
thrusters must be analyzed to determine how their 
leakage would affect the mission. For long space mis- 
sions, care must be taken to have exterior surfaces 
that are resistant to atomic oxygen, high solar inten- 
sity, or extremes of heat and cold. Sliding surfaces or 
mechanical bearings requiring lubricants are usually 
placed inside a sealed container. When they are not, 
liquid lubricant replenishment is provided, or special 
solid lubricants may be used. Graphite, which is an 
excellent lubricant on Earth, becomes a harsh lubri- 
cant in space when absorbed gases leave its surface. 
Special dry lubricants or magnetic bearings need to 
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be used. See SPACE TELESCOPE, HUBBLE; SPACECRAFT 
STRUCTURE. 

Thermal control. The average temperature of a 
spacecraft is determined by equating the heat input 
with the heat radiated away. The major power input 
for spacecraft located between Earth orbit and the 
orbit of Mercury is solar radiation. This is equal to 
the product of the solar constant (approximately 
1370 W/m? near the Earth and 5900 W/m? near 
Mercury), the area facing the Sun, a, and the 
average absorptivity of the surface, a (0 to 1). The 
heat radiated away is the product of the emissivity of 
the surface, € (0 to 1), the surface area, A, the Stefan- 
Boltzmann constant o [5.67 x 1078 W(m?/K5], and 
the fourth power of the absolute temperature. For a 
sunlit spacecraft that does not generate heat, the tem- 
perature can be obtained from Eq. (1). Note that the 
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temperature varies with the properties of the surface 
(a/e), the orientation of the spacecraft (a/A), and the 
intensity of the solar radiation (S). As the spacecraft’s 
time in space increases, the surface characteristics 
change due to ultraviolet exposure, atomic oxygen 
attack, and micrometeoroid impact. See SOLAR CON- 
STANT. 

Spacecraft generally radiate heat internally; thus 
the above equation is a simplification. Radio trans- 
mitters generate many watts of heat, as do other 
electrical devices on board. Externally, the reflected 
light from the Earth or other celestial bodies adds 
to the external heat absorbed. The spacecraft’s tem- 
perature will drop radically when these power inputs 
drop, as when the spacecraft passes into the shadow 
of the Earth and the solar radiation is eclipsed. For 
example, the temperature of an eclipsed deployed 
solar array of a spacecraft in geostationary orbit may 
drop to —100 K (—280°F). 

Temperatures inside the spacecraft depend on 
thermal radiation between surfaces, thermal conduc- 
tion through solid components, and heat dissipation 
of electrical components at various locations. These 
in turn depend on the surface coating (paint, metallic 
surface, or other coating). To maintain temperatures 
at acceptable limits, various active and passive de- 
vices are used: coatings or surfaces with special ab- 
sorptivities and emissivities, numerous types of ther- 
mal insulation, such as multilayer insulation (MLD 
and aerogel, mechanical louvers to vary the heat radi- 
ated to space, heat pipes, electrical resistive heaters, 
or radioisotope heating units (RHUs). 

Multilayer insulation has been used extensively on 
spacecraft. It consists of layers of insulation between 
layers that reflect radiation. The atmosphere be- 
tween the layers is removed in the vacuum of space, 
which prevents the conduction of heat. It is tens to 
hundreds of times more effective than foam or fiber- 
glass batting. 

Heat pipes are sealed tubes or rectangular pipes 
in which a liquid is evaporated at the hot end and 
the vapor flows to the cold end where it condenses 


and travels back to the hot end via a wick. They have 
the advantage of no moving parts and transferring 
200 times the heat of a solid pipe of the same cross- 
sectional area. See HEAT PIPE; TEMPERATURE. 

Tracking and positioning. The location of a space- 
craft can be measured by determining its distance 
from the transit time of radio signals or by mea- 
suring the direction of received radio signals, or by 
both. Data are usually taken over a period of hours 
or days, and the best orbit fit is determined numer- 
ically by computer. Distances are often measured 
by transmitting tones of different frequencies and 
finding the phase relations of the returned signal. 
The Intelsat ranging system uses four frequencies 
from 35 to 27,777 Hz, while the Goddard Space 
Flight Center range and range rate (GRARR) system 
transmits tones from 8 to 500,000 Hz. The direc- 
tion of a spacecraft can be determined by turning 
the Earth station antenna to obtain the maximum 
signal, or by other equivalent and more accurate 
methods. See ASTRONAUTICS; CELESTIAL MECHANICS; 
GUIDANCE SYSTEMS; ORBITAL MOTION; SPACE NAVI- 
GATION AND GUIDANCE. 

Propulsion. The velocity of a spacecraft is changed 
by firing thrusters. Solid propellant thrusters are 
rarely used. Liquid propellant thrusters are either 
monopropellant or bipropellant. Monopropellant 
thrusters use a fuel such as hydrazine or hydrogen 
peroxide that decomposes when it comes in con- 
tact with a catalyst. Bipropellant thrusters use a fuel 
and oxidizer that ignite when the two combine in 
the thruster. Electric thrusters, such as mercury or 
cesium ion thrusters, have also been used. Electric 
thrusters have the highest efficiency (specific im- 
pulse) but the lowest thrust. The Deep Space 1 space- 
craft, which visited an asteroid and was expected 
to visit Comet Borrelly in late September 2001, 
uses electric thrusters to accelerate xenon ions to 
109,435 km/h (68,000 mi/h). See HYDRAZINE; INTER- 
PLANETARY PROPULSION; ION PROPULSION; PROPEL- 
LANT; SPACECRAFT PROPULSION; SPECIFIC IMPULSE. 

Attitude control. Most spacecraft are spin-stabilized 
or are three-axis body-stabilized. The former uses the 
principles of a gyroscope; the latter uses sensors 
and thrusters to maintain orientation. Some body- 
stabilized spacecraft (such as astronomical observa- 
tories) are fixed in inertial space, while others (such 
as Earth observatories) have an axis pointed at the 
Earth and rotate once per orbit. A body-stabilized 
spacecraft is simpler than a spinner but requires 
more hardware. The orientation of a spacecraft is 
measured with Sun sensors (the simplest method), 
star trackers (the most accurate), and horizon (Earth 
or other body) or radio-frequency (rf) sensors (usu- 
ally to determine the direction toward the Earth). 
Attitude corrections are made by small thrusters or 
by reaction or momentum wheels; as the motor ap- 
plies a torque to accelerate or decelerate the rota- 
tion, an equal and opposite torque is imparted to 
the spacecraft. Torques can also be exerted by 
turning the axis of a momentum wheel. When a 
wheel reaches its mechanical limit of rotation speed, 


momentum is “dumped” by firing thrusters; that is, 
as the wheel decelerates, the thrusters are fired to 
exert a torque equal and opposite to the torque of the 
decelerating wheels. Momentum wheels are used 
on the Hubble Space Telescope because thrusters 
would deposite contaminants on its mirror. To re- 
duce size and weight, momentum wheels are being 
considered for both attitude control and power stor- 
age. See STAR TRACKER. 

Electrical power. Primary electrical power is most 
often provided by solar cells made from a thin sec- 
tion of crystalline silicon protected by a thin glass 
cover. Gallium arsenide solar cells are beginning to 
replace silicon, despite their increased weight, be- 
cause they are more efficient. The crystal contains 
dopants to produce extra electrons on one side and 
holes on the other side, forming a pn junction. The 
n-type material is on the sunlit surface, since it is 
more resistant to the damage caused by the radiation 
in the Van Allen belts. Sunlight creates hole-electron 
pairs in the junction, and the electric field caused by 
the space charge layer in the junction sends the elec- 
trons toward a metal grid, and the holes toward the 
base electrode. A 2 x 4 cm (0.8 x 1.6 in.) cell pro- 
vides about 150 mW of power at 10-18% efficiency 
for silicon cells and up to 23% for gallium arsenide 
cells. See SOLAR CELL. 

Excess power from the solar cells is stored in 
rechargeable batteries so that when power is in- 
terrupted during an eclipse, it can be drawn from 
the batteries. Nickel-cadmium batteries were used 
widely but are being replaced by nickel-hydrogen 
batteries, which provide higher energy per weight 
and higher energy per volume, properties that are 
very desirable for spacecraft. Other batteries with 
even better properties are being tested, such as 
lithium ion batteries, which up to now have been 
considered too dangerous, since they can explode 
or catch fire. Electrical power can be stored with 
momentum wheels. On the International Space Sta- 
tion, a flywheel energy storage system was being 
designed to replace the nickel-hydrogen batteries. 
The lifetime of batteries is limited by the tempera- 
ture control maintained and the depth of discharge 
during operation. The latter is the fraction of power 
taken from the battery during each charge-discharge 
cycle. Flywheels are not limited by temperature or 
depth of discharge. They can be charged and dis- 
charged indefinitely. See BATTERY. 

Other sources of power generation include fuel 
cells, radioisotope thermoelectric generators 
(RTGs), tethers, and solar dynamic power. Fuel cells 
have been used on the Apollo and space shuttle pro- 
grams and produce a considerable amount of power, 
with drinkable water as a by-product. Fuel cells use 
the same principles as a battery, but employ a fuel 
and oxidizer to produce electricity and water at very 
high efficiencies (70%). RTGs use the heat of decay 
of radioisotopes to produce electricity. They are 
very useful for spacecraft at distances of Mars and 
beyond where the solar flux would require imprac- 
tically huge arrays of solar cells to produce the same 
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amount of power. RTGs have low efficiencies (9%), 
and the radioisotope of choice, plutonium, is very 
expensive. Tethers use a kilometers-long conducting 
wire moving through the magnetic field of the Earth 
to generate electricity, the same principle as a 
generator. They have been tested on the 7SS-1R 
mission and produced voltages as high as 3500 V 
and current levels of 480 mA. Solar dynamic power 
is being considered for the space station. It uses the 
Sun’s energy to heat a fluid to turn a turbine, much 
like terrestrial power plants. These devices have 
the advantage of high power and high efficiency 
(30%) while being small and not producing as 
much atmospheric friction as the large solar arrays. 
See ELECTROMAGNETIC INDUCTION; FUEL CELL; 
NUCLEAR BATTERY; SOLAR ENERGY; SPACE POWER 
SYSTEMS. 

Telemetry and command. The status and condition 
of a spacecraft are determined by telemetry. Tem- 
peratures, voltages, switch status, pressures, sensor 
data, and many other measurements are transformed 
into voltages, encoded into pulses, and transmitted 
to Earth. This information is received and decoded 
at the spacecraft control center. Desired commands 
are encoded and transmitted from the control cen- 
ter, received by the satellite, and distributed to the 
appropriate subsystem. Commands are often used 
to turn equipment on or off, switch to redundant 
equipment, make necessary adjustments, and fire 
thrusters and pyrotechnic devices. See SPACE COM- 
MUNICATIONS; SPACECRAFT GROUND INSTRUMENTA- 
TION; TELEMETERING. 


Special-Purpose Technology 


Many spacecraft missions have special requirements 
and hence necessitate special equipment. 

Space probes. Satellites that leave the Earth’s grav- 
itational field to travel around the Sun and visit other 
planets have special requirements. Distances are 
greater (on the order of 1.8 x 10° mi or 3 x 10% km), 
mission times are longer (years), and the intensity of 
the Sun’s radiation may change markedly (inversely 
proportional to the square of the distance from the 
Sun). With greater distances, communications are 
more difficult, and information must be transmitted 
at a slower rate. For probes that go closer to the Sun, 
the higher intensity of sunlight poses a severe ther- 
mal problem. For probes that go to the outer planets 
(Jupiter, Saturn, and beyond), the decrease in solar 
intensity makes solar cells less effective, requiring 
other power sources such as radioisotope thermo- 
electric generators. See SPACE PROBE. 

Reentry vehicles. Spacecraft that return to Earth 
require special protection. Retro-rockets are used to 
reduce the orbital velocity so that the satellite falls 
and enters the Earth’s atmosphere. Most of the de- 
crease in velocity is produced by atmospheric fric- 
tion; in the last stage this may be augmented by de- 
ploying parachutes. On entering the atmosphere, the 
velocity is extreme (4.4-6.9 mi/s or 7-11 km/s), and 
the friction generates high temperatures (thousands 
of degrees). One type of protection is an ablative 
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shield. This shield, which was used on Apollo cap- 
sules, was designed to melt and vaporize, dissipating 
heat in the process. On the shuttle, a special thermal 
shield made of glass fibers can withstand the high 
temperatures, with enough insulation to protect the 
interior. 

The shape of a reentry spacecraft is designed to 
maintain a stable and desired attitude during reentry. 
Communications during part of the reentry are cut 
off by the high-density plasma sheath formed around 
the spacecraft, through which radio-frequency sig- 
nals will not penetrate. Final recovery may be ac- 
complished in the air (film reconnaissance satellite, 
as in the Discoverer program), in the ocean (Apollo 
astronauts), or on land (shuttle and Russian cosmo- 
nauts). 

For missions to other planets, aerobraking has 
been used. Aerobraking uses the atmosphere of the 
planet instead of using retrorockets to slow the 
spacecraft. The Magellan spacecraft used aerobrak- 
ing in 1994 to achieve orbit of Venus, and the Mars 
Global Surveyor used aerobraking in the much thin- 
ner atmosphere of Mars to achieve orbit. See ATMO- 
SPHERIC ENTRY; NOSE CONE. 

Crewed spacecraft. Technology must be devel- 
oped to meet the many needs associated with human 
survival in space. Oxygen equivalent to one-fifth of 
an atmosphere is needed for breathing; the remain- 
ing nitrogen is desirable but not essential. The early 
Soviet cosmonauts had an atmospheric pressure es- 
sentially equivalent to that on Earth; United States 
astronauts had much less nitrogen, with a total pres- 
sure of only one-third of an atmosphere. This lower 
pressure is satisfactory for breathing, but increases 
the fire hazard and does not carry sound as far (only 
16 ft or 5 m for normal conversation). Provision must 
be made for adding oxygen and removing carbon 
dioxide from the air, for supplying food and water, 
and for disposing of human wastes. For extravehicu- 
lar activity (EVA), a special suit is required to provide 
oxygen for breathing and to maintain adequate pres- 
sure. 

The lack of gravity produces a number of techno- 
logical requirements for astronauts living in space. 
Physically, it is just as easy (or just as hard) to walk 
on the ceiling as to walk on the floor, but an astronaut 
feels more comfortable in a room with a local verti- 
cal, that is, with one surface that looks like a floor 
and another that looks like a ceiling. Objects tend 
to float in space so that restraints are necessary: me- 
chanical, elastic, or magnetic. Showers require spe- 
cial vacuum hoses to remove the water, since it does 
not run down the drain. Astronauts can be weighed 
by rocking in a special chair fitted with springs; the 
astronaut’s mass is determined from the frequency 
of rocking. See SPACE BIOLOGY; SPACE FLIGHT; SPACE 
STATION; WEIGHTLESSNESS. 

Rendezvous. In some missions one spacecraft 
must find, approach, and make contact with another 
spacecraft. One active spacecraft performs the 
maneuvering by firing thrusters, while the other 
spacecraft (target) remains passive. During the far ap- 
proach, the orbits of the two spacecraft are deter- 


mined by conventional means, and maneuvers are 
calculated to bring the spacecraft closer together. 
During the near approach, the position of the tar- 
get spacecraft is determined directly from the active 
spacecraft, sometimes by radio and sometimes by 
visual contact. For final approach, a closing veloc- 
ity is maintained with the proper attitude, so that 
contact is made with the desired surfaces. Usually a 
probe mates with an appropriate drogue, hooks hold 
the two spacecraft together, and mechanical devices 
then firmly attach one spacecraft to the other. 

For the space shuttle, the Ku-band antenna also 
serves as a radar antenna for final approach and ren- 
dezvous. When the shuttle docks with the Interna- 
tional Space Station, it uses docking hardware man- 
ufactured in Russia. 


Reliability 


Space is distant not only in kilometers but also in 
difficulty of approach. Large velocity changes are 
needed to place objects in space, which are then dif- 
ficult to repair and expensive to replace. Therefore 
spacecraft must function when they are launched, 
and continue to function for days, months, or years. 
The task is similar to that of building a car that will 
go 125,000 mi (200,000 km) without requiring me- 
chanical repair or refueling. Not only must space 
technology build a variety of parts for many missions, 
but it must achieve a reliability far greater than the 
average. This is accomplished by building inherent 
reliability into components and adding redundant 
subsystems, supported by a rigorous test schedule 
before launch. Efforts are made to reduce the num- 
ber of single points of failure, that is, components 
that are essential to mission success and cannot be 
bypassed or made redundant. 

Electronics. The failure rate for an electronic sub- 
system is often estimated by adding up the failure 
rates of its components. The failure rate of each com- 
ponent is assumed to be independent of that of other 
components, and the failure of a subsystem. While 
electronic parts are highly reliable, a subsystem has 
so many parts that there may be an appreciable 
chance of failure. Component failure rates are min- 
imized by special manufacture, quality control, and 
derating, that is, operation at voltages or tempera- 
tures which are lower than the maximum permitted. 

If an electronic subsystem does not have the de- 
sired reliability, a redundant unit can be included. If 
either of the two subsystems continues to operate, 
the mission can still be accomplished. Mathemati- 
cally, the probability that at least one of the subsys- 
tems is operating, Rj2, is given by Eq. (2), where R is 
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the reliability of the single subsystem. For example, 
if each subsystem has a reliability, R, of 0.98, then 
the probability that at least one subsystem is work- 
ing is 0.9996, which is a significant improvement. 
The equation above is based on the assumption that 
failures of the two subsystems are not related; this 
may not be true when two units are built by the 


same manufacturer at the same time. See RELIABIL- 
ITY, AVAILABILITY, AND MAINTAINABILITY. 

Mechanical systems. While there are fewer me- 
chanical parts than electrical parts on most space- 
craft, mechanical parts tend to be less reliable than 
electrical ones. Deployment mechanisms usually 
work only once, but must work correctly the first 
time. Bearings must be lubricated for their ex- 
pected lifetimes, and special lubricants must be used. 
Torque margins are increased, so that even if unusu- 
ally high torques are encountered, the motors will 
still keep the device turning. Electromechanical re- 
lays should have currents reduced before they are 
opened, to prevent arcing and welding of contacts 
in vacuum. 

Testing. The key to a reliable spacecraft is exten- 
sive testing before launch. This ensures not only that 
every spacecraft part is adequately functioning, but 
also that the spacecraft is subjected to the differ- 
ing environments expected in space, and in some 
cases is overstressed to find the weak components. 
Tests are performed first on components, then sub- 
systems, and finally, when feasible, on fully assem- 
bled spacecraft. The testing program requires many 
weeks and consists of three main parts: electrical per- 
formance, vibration, and thermal-vacuum tests. Elec- 
trical testing includes evaluation of the spacecraft’s 
ability to perform a variety of operations which will 
be required in space. Antennas may be separately 
tested on an antenna range, where the effect of reflec- 
tions from nearby objects is minimized. Steady-state 
acceleration may be evaluated with a static test, in 
which mechanical loads are placed on various parts 
of the structure, or with a large centrifuge. Vibration 
tests, simulating the launch environment, subject the 
spacecraft to low-frequency vibration (5-100 Hz), 
acoustic vibration (20-10,000 Hz), and pyrotechnic 
shock conditions. Many of these tests can be accom- 
plished with an electromechanical vibrator. Thermal- 
vacuum tests consist of placing the spacecraft in a 
high-vacuum chamber (less than 107° torr or 107° 
pascal) and simulating the temperatures that will 
occur in space. See ENVIRONMENTAL TEST; INSPEC- 
TION AND TESTING; MECHANICAL VIBRATION; QUAL- 
ITY CONTROL. 

Sometimes the first spacecraft tested (prototype) 
remains on the ground, and only subsequent (flight) 
spacecraft are launched. The prototype spacecraft is 
often less reliable because of modifications made, 
tests which must be repeated, and planned over- 
stressing in tests. Most spacecraft perform their mis- 
sions successfully, but often unexpected anomalies 
occur. Redundant components may fail, and sub- 
systems, although performing adequately, may not 
quite meet design specifications. Telemetered per- 
formance is analyzed, problems are diagnosed, and 
changes are proposed in the design and testing of 
future spacecraft. In a few cases, space technology 
improves the spacecraft performance in space, and 
appropriate rf commands are sent to modify its op- 
eration, to reset parameters, to modify software, or 
to fire thrusters which will reorient the spacecraft or 
shake it to expel a bubble in a fuel line. 


Testing must be timed correctly during the as- 
sembly process. The Tethered Satellite System 1 was 
fully tested to ensure that the satellite with its tether 
would deploy properly. A modification was then ap- 
proved that ultimately caused the tether not to de- 
ploy fully, resulting in a partially failed mission. The 
International Space Station has the unique prob- 
lem that some of the components are not connected 
to each other until they are in orbit. NASA has a 
system using state-of-the-art computer-aided-design- 
and-manufacturing (CAD/CAM) software and pho- 
tography to determine that the components will 
mate to each other mechanically. NASA also decided 
to conduct tests to ensure that the electrical connec- 
tions from one component to another were compat- 
ible and that the software would command and con- 
trol each of the elements. NASA and Boeing had to 
build emulators of the components already in orbit, 
and will continue to do so as the station is assembled. 
These multielement integrated tests add further ex- 
pense, but are necessary to prove that the space sta- 
tion will work when fully assembled. See ASTRONAU- 
TICAL ENGINEERING; COMPUTER-AIDED DESIGN AND 
MANUFACTURING; SATELLITE (SPACECRAFT); SPACE. 
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Space-time 

A term used to denote the geometry of the physical 
universe as suggested by the theory of relativity. It is 
also called space-time continuum. Whereas in new- 
tonian physics space and time had been considered 
quite separate entities, A. Einstein and H. Minkowski 
showed that they are actually intimately intertwined. 
I. Newton’s ideas on space and time are summarized 
in the following list: 

1. Given two events, each of which is clearly lo- 
calized in space and lasts only for an instant in time, 
such as two strokes of lightning striking small tar- 
gets, all observers of the events will be in agreement 
as to which of the two events took place earlier in 
time, or whether they were actually simultaneous. 

2. If the events were not simultaneous, the inter- 
val of time between them is an absolute entity, agreed 
on by all competent observers. 

3. The spatial distance between two simultaneous 
events is an absolute entity, agreed on by all compe- 
tent observers. 
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Of these three, the first assumption, concerning 
the concept of simultaneity of distant events, is 
the crucial one, the other two depending on it. Si- 
multaneity, however, can be given an unambiguous 
meaning only if there is available some instantaneous 
method of signaling over finite distances. Actually, 
according to the theory of relativity, the greatest 
speed of transmission of intelligence of any kind is 
the speed of light c, equaling about 3 x 10!° cm/s. 
Moreover, any signal traveling precisely at the speed 
c appears to travel at that same speed to all con- 
ceivable observers, regardless of their own states of 
motion. This is the only reasonable interpretation of 
the results of the Michelson-Morley experiment and 
the effect of aberration. Accordingly, the question of 
whether two given events are simultaneous or not 
can be decided only with the help of signals that at 
best have traveled from the sites of these events to 
the station of the observer at the speed of light. See 
ABERRATION OF LIGHT; LIGHT; RELATIVITY. 

Under these circumstances, Einstein showed that 
in general two observers, each using the same tech- 
niques of observation but being in motion relative to 
each other, will disagree concerning the simultaneity 
of distant events. But if they do disagree, they are also 
unable to compare unequivocally the rates of clocks 
moving in different ways, or the lengths of scales 
and measuring rods. Instead, clock rates and scale 
lengths of different observers and different frames 
of reference must be established so as to assure the 
principal observed fact. Each observer, using his or 
her own clocks and scales, must measure the same 
speed of propagation of light. This requirement leads 
to a set of relationships known as the Lorentz trans- 
formations. See LORENTZ TRANSFORMATIONS. 

In accordance with the Lorentz transformations, 
both the time interval and the spatial distance be- 
tween two events are relative quantities, depending 
on the state of motion of the observer who carries 
out the measurements. There is, however, a new ab- 
solute quantity that takes the place of the two former 
quantities. It is known as the invariant, or proper, 
space-time interval t and is defined by Eq. (1), where 
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T is the ordinary time interval, R the distance be- 
tween the two events, and c the speed of light in 
empty space. Whereas 7 and R are different for dif- 
ferent observers, t has the same value. In the event 
that Eq. (1) would render t imaginary, its place may 
be taken by o, defined by Eq. (2). If both t and o are 
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zero, then a light signal leaving the location of one 
event while it is taking place will reach the location 
of the other event precisely at the instant the signal 
from the latter is coming forth. 

The existence of a single invariant interval led the 
mathematician Minkowski to conceive of the totality 
of space and time as a single four-dimensional contin- 
uum, which is often referred to as the Minkowski uni- 


verse. In this universe, the history of a single space 
point in the course of time must be considered as 
a curve (or line), whereas an event, limited in both 
space and time, represents a point. So that these ge- 
ometric concepts in the Minkowski universe may be 
distinguished from their analogs in ordinary three- 
dimensional space, they are referred to as world 
curves (world lines) and world points, respectively. 

Minkowski geometry. The geometry of the Min- 
kowski universe in some respects resembles the ge- 
ometry of ordinary (euclidean) space but differs from 
it in others. The Minkowski universe has four dimen- 
sions instead of the three dimensions of ordinary 
space; that is to say, for a complete identification a 
world point requires four pieces of data, for instance, 
three space coordinates and a time reading. But there 
are in the Minkowski universe world points, world 
lines, two-dimensional surfaces (including planes), 
three-dimensional surfaces (often called hypersur- 
faces), and four-dimensional domains. A hypersur- 
face may, for instance, be a spatial domain (volume) 
at one instant in time, or it may be a two-dimensional 
(ordinary) surface for an extended period of time. 
Thus one may form all the geometric figures that are 
also possible in a four-dimensional euclidean space. 

The indefinite metric. In a euclidean space there are 
the cartesian coordinate systems, those rectilinear 
systems of coordinates that are mutually perpendic- 
ular and whose coordinate values correspond to real 
lengths. The distance S$ between two points whose 
coordinate differences are X, Y, and Z, respectively, 
is given by Eq. (3). The coordinate transformations 


SC=X +Y +Z 6) 


that lead from one cartesian coordinate system to 
another are called orthogonal coordinate transfor- 
mations. Formally, they are the coordinate transfor- 
mations that preserve the precise form of Eq. (3). 
Likewise, the Lorentz transformations preserve the 
precise form of Eqs. (1) and (2), respectively. A co- 
ordinate system in which these two equations hold 
is called a Lorentzian frame of reference. 

Whereas the form on the right of Eq. G) is pos- 
itive definite, that is, always greater than or equal 
to zero, the right-hand sides of Eqs. (1) and (2) are 
indefinite; that is, they may be positive or negative. 
This fact represents the single but all-important dif- 
ference between a four-dimensional euclidean space 
and a Minkowski universe. The forms (1), (2), and 3) 
are called metrics. Hence the Minkowski universe is 
said to possess an indefinite metric. 

As a result of the indefinite character of the met- 
ric, a triangle in the Minkowski universe may possess 
one side that is longer than the sum of the two oth- 
ers; conversely, one of the three sides may have the 
length zero. Depending on whether t or ø is real, 
or both vanish, an interval is classified as timelike, 
spacelike, or a null-interval (see illus.) 

Improper Lorentz transformations. In the Minkowski uni- 
verse there are three different types of Lorentz trans- 
formations involving some kind of reflection, in 
addition to the more usual Lorentz transformations 


(called proper Lorentz transformations). The first 
type of improper Lorentz transformation changes the 
sign of all three spatial coordinates but leaves the 
sense of the time axis unchanged. This transforma- 
tion changes a right-handed screw into a left-handed 
screw; it is also called a parity transformation. The 
second type of improper Lorentz transformation in- 
terchanges the future with the past but leaves the 
space coordinates unchanged. This transformation 
is called time reversal. The third improper Lorentz 
transformation reflects both the space and the time 
coordinates; it bears no special name of its own. The 
original arguments which led to the formulation of 
the special theory of relativity all support the propo- 
sition that the laws of nature are invariant under 
proper Lorentz transformations. They are inconclu- 
sive as to whether the laws of nature should also 
be invariant under the improper Lorentz transforma- 
tions. The laws of mechanics and of electrodynam- 
ics have this property; the second law of thermody- 
namics distinguishes between past and future in a 
nonsymmetric manner, but it is usually assumed that 
this feature of thermodynamics is to be explained 
by its statistical nature and that it has no bearing on 
the properties of the underlying basic dynamics. See 
SYMMETRY LAWS (PHYSICS). 

Until recent times, it had therefore been taken for 
granted that the basic laws of nature should make 
no distinction between right-handed and left-handed 
screws, and that they should not discriminate be- 
tween future and past. Certain difficulties in the in- 
terpretation of the decay of K mesons led T. D. Lee 
and C. N. Yang to suspect that these assumptions 
might not be tenable, and they suggested some ex- 
periments on meson decay and on radioactive beta 
decay, which showed that in these particle transfor- 
mations nature certainly discriminates between left- 
handed and right-handed screws. See PARITY (QUAN- 
TUM MECHANICS). 

Neutrinos are electrically neutral particles which 
travel at the speed of light and are endowed with 
an intrinsic spin equal to that of electrons. It ap- 
pears that the neutrino always spins counterclock- 
wise when viewed in the direction of its travel, while 
its antiparticle, the antineutrino, always spins clock- 
wise. (As far as is known, this holds true of each of 
the presently known varieties of neutrinos.) The ex- 
act role of the improper Lorentz transformations in 
nature is under intensive theoretical and experimen- 
tal investigation. See NEUTRINO. 

Curved space-time. Whereas the Minkowski uni- 
verse is the appropriate geometric model for the 
special theory of relativity, the general theory of 
relativity makes use of a further generalization. In 
the Minkowski universe a particle that is not sub- 
ject to external forces and which therefore travels 
along a straight line Gin ordinary space) and at a uni- 
form speed is represented by a straight world line. 
In general relativity, an external gravitational force 
is indistinguishable from an inertial force, which in 
special relativity would arise if a non-lorentzian frame 
of reference were to be employed. Accordingly one 
requires a four-dimensional space in which it is im- 
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Two-dimensional sketch of three possible classes of intervals. The null directions actually 


form a three-dimensional surface having the shape of a double cone, called the light 
cone. An alternative Lorentz frame is indicated by broken lines. 


possible to distinguish between a lorentzian and a 
non-lorentzian frame of reference whenever a gravi- 
tational field is present. Such a space cannot be flat, 
as the Minkowski universe is, but must be curved. 
Such an aggregate is called a riemannian space. In a 
general riemannian space there are no straight lines 
but only curves. See FRAME OF REFERENCE; GRAVITA- 
TION; RIEMANNIAN GEOMETRY. Peter G. Bergmann 
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Instrumentation located on the Earth for monitoring, 
tracking, and communicating with satellites, space 
probes, and crewed spacecraft. Radars, communica- 
tion antennas, and optical instruments are classified 
as ground instrumentation. They are deployed in net- 
works and, to a lesser extent, in ranges. Ranges are 
relatively narrow chains of ground instruments used 
to follow the flights of missiles, sounding rockets, 
and spacecraft ascending to orbit. Some ranges are 
a few miles long; others, such as the U.S. Air Force’s 
Eastern Test Range, stretch for thousands of miles. 
Networks, in contrast, are dispersed over wide geo- 
graphical areas so that their instruments can follow 
satellites in orbit as the Earth rotates under them at 
15° per hour, or space probes on their flights through 
deep space. 

Networks are of two basic kinds: networks sup- 
porting satellites in Earth orbit, and networks sup- 
porting spacecraft in deep space far from Earth. A 
third concept was added in the 1980s with the Track- 
ing and Data Relay Satellite System (TDRSS), also 
called the Space Network (SN). TDRSS replaced most 
of the ground stations used for Earth orbital support. 
The Tracking and Data Relay Satellites (TDRS) are 
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placed in geosynchronous orbits to relay signals to 
and from other orbiting spacecraft during more than 
85% of each orbit, to and from a single ground sta- 
tion. 

Functions. Ranges and networks have various tech- 
nical functions: 

1. Tracking: determination of the positions and 
velocities of space probes and satellites through 
radio and optical means. 

2. Telemetry: reception of telemetered signals 
from scientific instruments and spacecraft house- 
keeping functions. 

3. Voice reception and transmission: provision for 
communication with the crew of a spacecraft, such 
as the space shuttle. 

4. Command: transmission of coded commands 
to spacecraft equipment, including scientific instru- 
ments. 

5. Television reception and transmission: provi- 
sion for observation of the crew, spacecraft environ- 
ment, and so on. 

6. Ground communications: telemetry, voice, 
television, command, tracking data, and spacecraft 
acquisition data transmission between network sites 
and the central mission control center, and payload 
information to user facilities. 

7. Computing: calculation of orbital elements and 
radar acquisition data prior to transmission to users; 
also, computation of the signals that drive visual dis- 
plays at a mission control center. See TELEMETERING. 

Mission control. From the mission control cen- 
ter, highly trained mission personnel, observing 
computer-generated displays, dispatch commands 
and analyze acquired data through ground communi- 
cation with the appropriate network stations. From 
these stations, command data are dispatched to or- 
biting satellites. 

Tracking can be accomplished by a great variety 
of instruments, broadly classified as optical or radio- 
frequency (rf). In the optical class are cinetheodo- 
lites, ballistic cameras, Baker-Nunn cameras, tele- 
scopes, and laser ranging systems. Radio-frequency 
tracking systems are even more varied and include 
radar, active and passive radio interferometers, radio 
Doppler systems, and range-and-range-rate systems, 
which resemble Doppler radars except that they 
do not measure spacecraft direction. See DOPPLER 
RADAR; OPTICAL TRACKING SYSTEMS; RADAR. 

Ranges. The best-known range in the United States 
is the Eastern Test Range (ETR; formerly called 
the Atlantic Missile Range), which begins at Cape 
Canaveral, Florida. In addition, the Department of 
Defense operates the Western Test Range (WTR) and 
several others. The National Aeronautics and Space 
Administration (NASA) launches most of its space- 
craft from the Eastern Test Range, but also launches 
some sounding rockets and small satellites from its 
Wallops Island facility on the Virginia east coast. Dur- 
ing the shuttle era, the Eastern Space and Missile 
Center (ESMC) within the Eastern Test Range per- 
forms the functions of both the Eastern Test Range 
and the Western Test Range for the shuttle. Each of 
the United States ranges possesses tracking equip- 


ment, telemetry receivers, and electronic equipment 
to perform its functions. Many other countries have 
missile and sounding rocket ranges. 

The Eastern Test Range is one of the longest ranges 
in the world. The range head begins with the NASA 
and military launch pads and blockhouses, on Cape 
Canaveral proper, and those of NASA on nearby Mer- 
ritt Island, Florida. A missile or spacecraft launched 
from one of these areas climbs vertically, then pitches 
toward downrange tracking and communication an- 
tennas. As the rocket moves over the Atlantic, it 
passes within range of stations instrumented to track, 
command, and receive telemetry from the rising 
launch vehicle and its payload. See LAUNCH COM- 
PLEX. 

Cape Canaveral itself is well supplied with instru- 
ments such as the C-band AN/FPS-16 radar, many 
tracking telescopes, and several large Doppler and 
interferometer tracking systems. Similar instruments 
along the range watch each flight. The range stations 
are connected to each other by submarine cable and 
radio and satellite communication links. Geodetic 
surveys, timing signals from radio station WWV of 
the National Institute of Standards and Technology, 
and standardized operating procedures also unify the 
range stations. 

Spacecraft networks. NASA operates two ground- 
based networks (Fig. 1) and the TDRSS. The ground- 
based networks are the Spaceflight Tracking and Data 
Network (STDN), which tracks, commands, and re- 
ceives telemetry from United States and foreign satel- 
lites in Earth orbit; and the Deep Space Network 
(DSN), which performs the same functions for all 
types of spacecraft sent to explore deep space, the 
Moon, and solar system planets. The TDRSS pro- 
vides the same support to Earth orbital spacecraft 
as the STDN. The U.S. Department of Defense op- 
erates two classified networks: the Satellite Control 
Facility (SCF); and the National Range Division Sta- 
tions, which include those of all United States mili- 
tary ranges. Russia and European Space Agency (ESA) 
also maintain similar networks. 

Spaceflight Tracking and Data Network (STDN). The STDN is 
a complex of ground-based stations designed to sup- 
port NASA’s Earth-orbiting scientific and applications 
spacecraft and crewed spaceflight programs such as 
the space shuttle. The operations control center and 
the central computing facility for operation and anal- 
ysis are located at the Goddard Space Flight Center 
in Greenbelt, Maryland (Fig. 1). 

The STDN, operational since May 1971, combined 
the former Space Tracking and Data Acquisition Net- 
work (STADAN) and the Manned Space Flight Net- 
work (MSFN). The crewless STADAN tracked space- 
craft in Earth orbits from ground facilities distributed 
over much of the globe. The minimum tracking 
(minitrack) facilities, implemented in 1958, used an- 
tennas composed of steel rails parallel to the ground. 
The minitrack’s narrow fan pattern received radio 
transmissions from the spacecraft as it passed over- 
head. The STADAN stations used parabolic (dish) an- 
tennas whose conical beams swept the sky to follow 
the orbiting spacecraft. 
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Fig. 1. World map showing the tracking networks for space operations of the National Aeronautics and Space Administration 


(NASA). 


The MSFN provided two-way contact between the 
ground and space for the astronauts in the Mercury, 
Gemini, and Apollo programs. It consisted of both 
ground stations and tracking and support ships that 
could be positioned where ground stations were un- 
available or impractical. The stations were config- 
ured to support the individual requirements of each 
mission whether the mission involved Earth orbit or 
Moon landing. 

In general, the systems needed at each STDN sta- 
tion are determined by the support that the station 
provides and whether the support is directed primar- 
ily toward crewed or crewless space efforts. Within 
the STDN, the unified S-band (USB) system is the 
prime equipment dedicated to tracking, telemetry, 
and two-way communications. It is so named be- 
cause voice and data can be handled on one S-band 
carrier. Stations that use the USB system are further 
identified by the size of their USB antenna systems: 
85-ft (26-m) or 30-ft (9-m) dish diameter. 

The significant disadvantage of the STDN is that 
the antennas are bound to the surface of the Farth. 
This creates a limited field of view because the an- 
tenna can usefully transmit and receive only when a 
spacecraft is overhead. Thus, the average coverage 
that a low-altitude spacecraft may expect to receive 
from the tracking stations is limited to approximately 
15% of its orbit. This amount of coverage is insuffi- 
cient to support newer, more sophisticated space- 
craft with instruments that generate much more 
information. It became impractical to store this in- 
formation on board the spacecraft during the long 
periods when the spacecraft is out of contact with 
ground stations. The most practical solution was to 
construct the TDRSS. 

The TDRSS network has replaced most of the 
STDN stations, substituting the space segment and 
two ground stations for most of the ground stations 
within the STDN. The remaining STDN stations pro- 
vide support to older spacecraft, which are not com- 
patible with the TDRSS, and the special support, 


such as launch and landing support to the shuttle. 

Deep Space Network (DSN). NASA, through the Jet 
Propulsion Laboratory of the California Institute of 
Technology in Pasadena, has established a Deep 
Space Network (DSN), consisting of three Deep 
Space Communication Complexes (DSCC) located 
at Goldstone, California; Madrid, Spain; and Can- 
berra, Australia. The complexes are approximately 
120° apart in longitude, which makes it possible to 
provide continuous communications for planetary 
spacecraft as the Earth rotates. From its beginning 
in 1959, the DSN has supported all the deep space 
probes such as Mariner, Pioneer, Viking, and Voy- 
ager, as well as assisting during the Apollo crewed 
lunar missions. While TDRSS began replacing STDN 
in 1985, DSN services continue to be used for Earth 
orbiters above 7500 mi (12,000 km) because TDRSS 
antenna pointing is designed only for spacecraft up 
to this altitude. 

The Network Control Center at the Jet Propulsion 
Laboratory is the single point of control for the DSN. 
The Network Control Center handles messages trans- 
mitted to and from the spacecraft, and validates DSN- 
generated radio navigation data. The data are then 
sent to Mission Operations for flight path control. 
Additionally, the Network Control Center performs 
research, development, and engineering for the DSN. 

Each DSCC has three or four antennas. A 
main data-capture instrument is the 230-ft 
(70-m) transmit-receive antenna with a parabolic 
surface. The precisely contoured parabolic sur- 
face permits the antenna to operate efficiently at 
short-wavelength radio frequencies (for example, at 
the X-band frequency, principally used to receive 
pictures from the outer planets, the wavelength 
is 3.5 cm). The combination of large area and 
high-frequency operations gives the 230-ft (70-m) 
antenna an extremely narrow field of view (one 
ten-millionth of the sky). The receiver system that 
receives such weak signals from deep space has a 
minuscule noise temperature (20 K), hundreds of 
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times less than that in the best high-fidelity home 
FM radio receivers. The total power received by 
each 230-ft (70-m) antenna in a typical outer planet 
encounter, such as Voyager at Uranus, can be as 
weak as 107 W. Yet, the noise-combating codes 
allow high-resolution pictures to be reconstructed 
with few errors. For example, in January 1986, 
the NASA Voyager 2 spacecraft flew by Uranus. 
A good data rate was maintained by arraying or 
simultaneously combining signals from the DSN 
230-ft (70-m) antenna and the two 112-ft G@4-m) 
DSN antennas at Canberra and the Australian Parkes 
Observatory 210-ft (64-m) radio astronomy antenna 
200 mi (320 km) away. This provided 29,000 bits 
of information per second, equivalent to a color 
picture every 5 min. The result was hundreds of 
color pictures during the brief encounter. Because 
Uranus was in the southern part of the sky in 1986, 
the Australian stations viewed the spacecraft longer 
than the Goldstone or Madrid stations. 

In addition to receiving data, stations must send 
commands to the distant spacecraft to instruct the 
on-board computers what maneuvers to perform and 
what experiments to conduct. The antennas at each 
DSCC can simultaneously send commands to and re- 
ceive data from the spacecraft. Different frequencies 
are used for transmitting and receiving; therefore, 
even though hundreds of kilowatts can be transmit- 
ted, the effect on the extremely weak signals being 
received from the spacecraft is negligible. 

While the antenna is receiving data and transmit- 
ting commands, radio measurements of spacecraft 
paths are conducted for navigation. Because the very 
distant spacecraft often must be targeted to pass 
within small (5-10-mi or 8-16-km) regions near the 
target planets or their satellites or rings, extreme di- 
rectional accuracy is required, in some cases near a 
ten-millionth of a degree. By receiving signals from 
the spacecraft at two DSCCs simultaneously, the time 
delay between reception at the two stations can be 
measured. Since the speed of light and the distance 
and direction between the two stations are known, 
the position of the spacecraft can be determined 
from trigonometry. 

The DSN is also used as a scientific facility in 
radio science experiments. In these experiments, the 
changes in the received signal are carefully measured 
as they pass through a planet’s atmosphere or iono- 
sphere, interplanetary space, the solar corona, or 
the rings around planets. Using these measurements, 
radio scientists can determine the atmospheric pres- 
sure as a function of altitude above the surface. Cou- 
pled with spacecraft instruments and Earth-based 
measurements, such radio science has been a very 
powerful tool for the exploration of the solar system. 

The DSN planetary radar is also used to char- 
acterize the bodies in the solar system. Its note- 
worthy achievements include calibrating the length 
scale of the solar system (the astronomical unit) to 
greatly improve radio navigation of planetary space- 
craft, discovering the slow backward rotation of the 
planet Venus, finding craters on Venus, measuring 
altitudes on Mars, selecting Martian landing sites for 


the Viking Lander, determining that Saturn’s rings 
were composed of large icelike particles, and study- 
ing asteroids and comets to determine surface prop- 
erties and rotation. 

By using techniques similar to two-station preci- 
sion radio navigation, changes in relative positions 
of particular locations on Earth are measured. This 
provides a deeper understanding of crustal plate mo- 
tion and, ultimately, of earthquake mechanisms. The 
DSN is also used as a radio astronomy observatory, 
either alone or in cooperation with other observato- 
ries. See GEODESY; RADIO ASTRONOMY. 

Much advanced technology has arisen from the 
DSN, including the first field-operational amplify- 
ing masers for low-noise, weak-signal reception; the 
first field-operational hydrogen maser clocks for fre- 
quency and timing in navigation; power-efficient 
coded communications, including the first use of 
block and convolutional coding; on-line signal pro- 
cessing by general-purpose digital computers; and 
high-speed determination of signal strength as a func- 
tion of frequency across a very wide frequency band. 
See ATOMIC CLOCK; MASER. 

In addition to providing communications for 
United States deep-space craft, joint deep-space mis- 
sions with other countries are supported on a co- 
operative basis. As the only worldwide network that 
can work with extremely weak signals, the DSN is a 
valuable resource for other countries. 

Laser Tracking Network. The Laser Tracking Network 
consists of a series of laser systems used for ranging 
to retroreflector-equipped satellites in highly stable 
orbits. Laser stations obtain ranging data for these 
satellites by bouncing a highly concentrated pulse 
of laser light off the retroreflector corner cube in- 
stalled on the spacecraft exterior. The exact position 
of the spacecraft in orbit can then be mathematically 
determined for a given point in time. By comparing 
several ranging operations, orbital predictions can 
be interpolated and extrapolated. 

The resultant data have a variety of applications, 
such as precise prediction of satellite orbit and mea- 
surement of the Earth’s gravitational field, polar mo- 
tion, earth tides, Earth rotation, tectonic plate mo- 
tion, and crustal motion to accuracies within the 
centimeter range. The Laser Tracking Network is a 
multinational cooperative effort with over 30 laser 
sites located in North and South America, Europe, 
China, Japan, and Australia. 

The Laser Tracking Network consists of both fixed 
and mobile laser ranging systems. The mobile sta- 
tions can be deployed worldwide to prepared sites. 
A typical laser mobile system consists of a laser trans- 
mitter, a receiver subsystem, the optical equipment, 
a computer, a timing subsystem, and a surveillance 
radar. The total system is housed in mobile vans, with 
a power-generation van required at some sites. 

TDRSS. In the plan developed for 1990 and beyond, 
the DSN provides support to high-altitude Earth- 
orbiting spacecraft. Other than for launch support 
from the stations at Merritt Island, Florida; Bermuda; 
and Dakar, Senegal, all support to low-altitude Earth- 
orbiting satellites is from the TDRSS. The TDRSS 
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Fig. 2. Tracking and Data Relay Satellite System (TDRSS) typical user configuration. 


(Fig. 2) consists of two operational geostation- 
ary satellites, an in-orbit spare, and two colocated 
ground terminals with associated operations control 
facilities at White Sands, New Mexico. The two ac- 
tive satellites, called TDRS-East and TDRS-West, are 
in geosynchronous orbit roughly above the Equator 
and at 41 and 171°W longitude, respectively. The 
TDRSS spare is located at 83°W longitude. The TDRSs 
relay signals between the ground station and satel- 
lites orbiting generally below 7500 mi (12,000 km) 
above Earth during nearly all of each orbit, except 
for a zone of exclusion which varies with spacecraft 
altitude. 

Communication between the ground terminals 
and TDRSS is at Ku-band frequencies via ground ter- 
minal antennas. TDRS on-orbit tracking, telemetry, 
and command are provided by the same facilities. 
A separate antenna is provided for S-band operations 
during transfer orbits and as a backup to the Ku-band 
capability. Each TDRS can support two high-data-rate 
users simultaneously at either S- or Ku-band with the 
steerable single-access antennas. Multiple access per- 
mits up to 20 users at a time, but with much lower 
data rates. 

The advantage of the TDRSS is that support of 
low-orbiting spacecraft operations is no longer re- 
stricted to short and infrequent periods of mutual 
STDN-spacecraft visibility. Orbital coverage for low- 
altitude Earth-orbiting spacecraft is increased from a 
maximum of 15% of each orbit, using conventional 


tracking stations, to 85% with TDRSS. This increase, 
in turn, made possible the elimination of the major- 
ity of the STDN stations and consolidation of their 
remaining functions with those of the DSN, which 
can take over responsibilities of both. 

The space shuttle Challenger carried the first of 
the three satellites into space in April 1983. Since that 
time the two other satellites required to complete 
the constellation were launched, and the system is 
operational. 

Worldwide network. The common element con- 
necting all networks is the worldwide NASA com- 
munication system (NASCOM), which links all NASA 
stations, on all continents except Antarctica, through 
the use of submarine cables, land lines, microwave 
data links, and satellite and fiber-optic transmis- 
sions. The trend is progressively toward satellite com- 
munications as the data rates to be handled con- 
tinue to increase. The use of satellites permits an 
Earth terminal to be placed at the user’s facility and 
thus permits the transfers of data in fewer hops. 
Ground instrumentation provides the vital link be- 
tween the spacecraft and mission controllers and sci- 
entists on the ground. See SATELLITE (SPACECRAFT); 
SPACE COMMUNICATIONS; SPACE FLIGHT; SPACE NAV- 
IGATION AND GUIDANCE; SPACE PROBE; SPACE STA- 
TION. H. William Wood 

Bibliography. M. Gerard and P. W. Edwards (eds.), 
Space Information Systems in the Space Station 
Era, 1988. 
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A system that provides control of location and atti- 
tude of spacecraft by using rocket engines to gen- 
erate motion. Spacecraft propulsion systems come 
in various forms depending on the specific mis- 
sion requirements. Each exhibits considerable varia- 
tion in such parameters as thrust, specific impulse, 
propellant mass and type, pressurization schemes, 
cost, and materials. All of these variables must be 
considered in deciding which propulsion system 
is best suited to a given mission. Typical space- 
craft applications include communications satellites, 
science and technology spacecraft, and Earth- 
monitoring missions such as weather satellites. 
Orbital environments range from low-Earth to 
geosynchronous to interplanetary. See ASTRONAUTI- 
CAL ENGINEERING; ROCKET PROPULSION; SATELLITE 
(SPACECRAFT); SPACE FLIGHT; SPACE PROBE; SPECIFIC 
IMPULSE; THRUST. 

Design parameters. The two fundamental variables 
that define the design of spacecraft propulsion sys- 
tems are the total velocity change to be imparted 
to the spacecraft for translational purposes, and the 
impulse necessary to counteract the various exter- 
nal torques imposed on the spacecraft body. From 
these, the required quantity of a given propellant 
combination can be specified by Eqs. (1) and (2). 
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In Eq. (1), Mp; is the mass of propellant that is re- 
quired to be expended to perform the translational 
maneuver of velocity increment A V; Ip is the spe- 
cific impulse, which is dependent on the propellant 
combination and engine design; g is the acceleration 
of gravity; and M,/, is the mass of the spacecraft prior 
to the rocket engine burn. In Eq. (2), M,? is the mass 
of propellant required to counteract the total distur- 
bance impulse J. The total propellant requirement 
for a given mission is the sum of these results. A typi- 
cal propellant budget for a communications satellite 
is given in the table. Propellant accounts for almost 
60% of the lift-off mass of a communications satellite. 
In this case, the translational AV propellant consti- 
tutes over 95% of the spacecraft’s needs, and station- 
keeping (translational) burns use over 95% of the fuel 


required during the operational life of the spacecraft 
(over 10 years). 

Most spacecraft require translation in one or more 
axes as well as rotation about the spacecraft center of 
mass. The AV requirements are dictated by the mis- 
sion design, launch-site selection, and in-orbit pertur- 
bations. For instance, some spacecraft require repo- 
sitioning throughout the mission. A launch from, for 
example, French Guiana in South America results in 
a similar mission to a launch from Cape Canaveral, 
Florida, but the propellant requirements are substan- 
tially affected because the launch vehicles leaving 
from each site put the spacecraft in different orbits. 
The spacecraft must then perform translations of dif- 
ferent magnitude to place themselves in the final lo- 
cation. 

Importance of specific impulse. From Eqs. (1) and (2) the 
significant effect of the specific impulse Zp on the 
total propellant load that a spacecraft must carry to 
perform its assigned mission can be seen. Since a 
massive satellite must be boosted into space by the 
use of expensive launch vehicles, such as the space 
shuttle and Ariane, significant cost savings may be 
gained if smaller, less expensive launch vehicles may 
be used. The size of the required launch vehicle is di- 
rectly proportional to the mass of the payload. Since 
most of the other components that make up space- 
craft are relatively fixed in weight, it is critical to uti- 
lize propellant combinations that maximize specific 
impulse. 

Propellant combinations. For modern spacecraft the 
choices are either bipropellants, which utilize a liq- 
uid oxidizer and a separate liquid fuel; solid propel- 
lants, which consist of oxidizer and fuel mixed to- 
gether; or monopropellants, which are liquid fuels 
that are easily dissociated by a catalyst into hot, 
gaseous reaction products. High specific impulse is 
offered by bipropellants, followed by solid propel- 
lants and monopropellants. For spacecraft the spe- 
cific impulse values range from 315 s down to 220 s. 
The liquid bipropellants are nitrogen tetroxide 
(N20,) as the oxidizer and either hydrazine (N2H4) or 
monomethylhydrazine (CH3N2H;) as the fuel. Solid 
rocket motors use hydroxy-terminated polybutadi- 
ene (HTPB) as a binding agent for the ammonium 
percholorate (NH4ClO,) oxidizer and aluminum fuel. 
Monopropellants are presently limited to hydrazine, 
which is catalyzed on a bed of iridium-coated pellets. 
See PROPELLANT. 

Propulsion system designs. Spacecraft attitude con- 
trol schemes play an important role in defining the 














Typical propellant budget for a geosynchronous communications satellite 
Function AV, ft/s (m/s) Fuel, Ibm (kg) Oxidizer, Ibm (kg) 

Transfer orbit burns — 4 (2) — 
Apogee injection 4927 (1502) 1200 (544) 1320 (599) 
Preoperational = 20 (9) at 
Station acquisition 43 (13) 25 (11) — 
Station keeping 2240 (683) 900 (408) — 
Attitude control = 20 (9) — 

Total 7210 (2198) 2169 (984) 7320 (699) 








detailed characteristics of spacecraft propulsion sys- 
tems. Essentially, there are three methods for stabi- 
lizing a spacecraft: three-axis control, spin control, 
and gravity gradient. In three-axis systems the body 
axes are inertially stabilized with reference to the Sun 
and stars, and utilize rocket engines for control in all 
six degrees of freedom. Spin-stabilized spacecraft use 
the inertial properties of a gyroscope to permanently 
align one of the axes by rotating a major portion of 
the spacecraft body about this axis. This approach 
significantly reduces the number of thrusters needed 
for control. Gravity gradient control is a nonactive 
technique that relies on the Earth’s tidal forces to 
permanently point a preferred body axis toward the 
Earth’s center. See GYROSCOPE; INERTIAL GUIDANCE 
SYSTEM; SPACECRAFT STRUCTURE. 

Translation of a spacecraft, independent of its con- 
trol technique, requires thrusters aligned parallel to 
the desired translational axis. Usually, all three axes 
require translational capability. Combining the two 
requirements for attitude control and translation re- 
sults in the minimum number of rocket engines re- 
quired to perform the mission. These are supple- 
mented with additional thrusters to allow for failures 
without degrading the performance of the propul- 
sion system. Thrust levels vary and are dependent on 
spacecraft maximum and minimum acceleration lev- 
els, which in turn are dependent on allowable burn 
efficiencies, power availability, and the structural ca- 
pability of deployed appendages, such as solar arrays 
or scientific instruments. In spacecraft applications, 
bipropellant and monopropellant engines are signifi- 
cantly lower in thrust by several orders of magnitude 
compared to solid rocket motors. This implies that 
burn durations, which are inversely proportional to 
thrust, are much longer for liquid propulsion systems 
than solid systems for a given velocity increment 
AV. Solid rocket motors are almost completely lim- 
ited to performing large velocity changes. Bipropel- 
lant rocket engine thrust levels are between 100 lbf 
(445 newtons) for large-AV burns and 2 Ibf (10 N) 
for attitude and translational maneuvers. Monopro- 
pellant thrusters range from 100 Ibf (4445 N) down 
to 0.05 Ibf (0.2 N). Figure 1 shows the schematic 
thruster layout for a typical three-axis controlled 
bipropellant propulsion system for a communica- 
tions system. The propellant budget shown in the 
table is based on the same spacecraft mission. 

Minimization of propulsion system mass. As discussed 
above, a critical parameter in the ultimate design of 
a spacecraft propulsion system is the mass of the 
propellant. This is dictated by the engine specific 
impulse and the spacecraft launch weight. Simplis- 
tically then, it would be reasonable to assume that 
the propellant-engine combination with the high- 
est specific impulse would be the preferable choice. 
However, the ultimate requirement is the lowest pos- 
sible mass for the entire propulsion system. The com- 
plexity of the system is greatly influenced by, and is 
roughly proportional to, the specific impulse, since 
bipropellants require more tanks, valves, and so forth 
than either solid systems or monopropellant systems. 
This is primarily due to the differences in density 


Spacecraft propulsion 


+Z (north), 
pitch axis 









+X (anti-Earth), 
yaw axis 


apogee 
kick motor 


——— 


y propellant tank 


\ 


rocket engine 


~~ solar array 






+Y (east), 
roll axis 






Fig. 1. Typical communications satellite thruster layout. 
Thrust is provided to control rotation about all three axes 
and translation, in six degress of freedom. 


between liquids and solids, and the fact that bipro- 
pellants require high-pressure gas sources to expel 
the fluid from the tanks and into the rocket engine 
chamber. Typical three-axis stabilized bipropellant 
and monopropellant systems are compared in Fig. 2. 
The definition of the mission requirements, in the 
form of total spacecraft launch mass and the mission 
total velocity increment AV, are translated into the 
hardware elements needed to control the system, 
and the mass of these elements is estimated. Sub- 
tracting the propellant and system hardware mass 
from the total launch mass provides an estimate of 
the spacecraft mass that is available for distribution 
among all the other systems that constitute a space- 
craft (for example, power, attitude control, and pay- 
load). For communications satellites in the lift-off 
weight range of 3000 Ibm (1360 kg), the trade-off 
between specific impulse and system mass dictates 
the use of a solid rocket motor for the main-orbit 
circularizing burn and a monopropellant propulsion 
system for on-orbit attitude control and translation. 
Spacecraft launch masses above about 5000 Ibm 
(2268 kg) require the use of all-bipropellant systems. 

Blowdown versus pressure-regulated systems. From Fig. 2 
it can be seen that there are two approaches to ex- 
pelling propellant from spacecraft in a zero-g envi- 
ronment. The simple monopropellant systems use a 
pressurant gas stored within the same tank as the hy- 
drazine fuel. Operation of the rocket engine valves 
allows the fuel to flow by virtue of the pressure differ- 
ence between the tanks and the spacecraft exterior 
(vacuum). As the fluid is expended, the tank pres- 
sure decreases exponentially until all the fluid is re- 
moved. This technique is called a blowdown system. 
It is lightweight but requires a significant gas vol- 
ume to ensure that the pressure remains at some rea- 
sonable value, since the specific impulse decreases 
with feed pressure. For most monopropellant sys- 
tems the ratio of initial to final tank pressure is 4:1. 
By using the ideal gas law, it can be shown that the 
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Fig. 2. Typical propulsion system designs for communications, Earth observation, and interplanetary spacecraft missions. 


(a) Monopropellant system. (b) Bipropellant system. 


initial gas volume must be 25% of the tank volume. 
See GAS. 

In systems with larger propellant mass, which 
must be bipropellant systems, the tank volumes re- 
quired for a simple blowdown system become im- 
practical. Therefore, bipropellant systems usually 
employ separate gas reservoirs, which store the gas 
elium) required to expel the propellant at very 
high pressures; 4500 Ib/in.? absolute (31 megapas- 
cals) is typical. The storage-tank pressure is reduced 
to the engine operating pressure of 220 Ib/in.? ab- 


solute (1.5 MPa) by a pressure regulator, hence 
the name pressure-regulated system. Although the 
storage-tank mass is high, this is offset by the gain in 
propellant-tank mass compared to that in the blow- 
down approach and the ability to operate the sys- 
tem at the optimum engine pressure to maximize 
specific-impulse performance. 

At the propellant loads typical of current space- 
craft propulsion systems (2000 Ibm or 907 kg), the 
use of turbopumps, which are often used in launch 
vehicles, has not proved to be weight-competitive 


with the blowdown or the pressure-regulated sys- 
tems. In the future, as propellant mass continues 
to increase as spacecraft mass increases, pumps are 
likely to be utilized. 

Implications of zero-g. A primary feature that distin- 
guishes spacecraft propulsion technology from that 
utilized on launch vehicles is the need to cope with 
the extremely low acceleration values that exist in 
orbit. (This environment is termed zero-g, although 
this is not really accurate, since a constant accel- 
eration level of the order of 10~ g is permanently 
experienced by Earth-orbiting spacecraft.) Such a 
low-acceleration environment primarily affects the 
behavior of any fluid on board (generally propellant), 
due to the domination of surface-tension forces. In 
1-g, Earth-bound experience, fluid geometry is con- 
trolled totally by gravitational forces. Hence, water 
in a cup exhibits essentially a flat surface. Any dif- 
ference in fluid height is removed by fluid flowing 
under the influence of gravity to a common surface. 
The relationship defining whether surface tension or 
gravitational forces are dominant is provided by the 
dimensionless Bond number Bo, defined by Eq. (3). 


_ pal? 


Bo = 6) 


o 


Here, p is the density of the fluid, a is the local 
acceleration, Z is a characteristic length of the sys- 
tem (for example, the tank diameter), and o is the 
fluid surface tension. For environments where Bo is 
much greater than 1, the system is said to be gravity- 
dominated. When Bo is much less than 1, the system 
is surface-tension-dominated. In those cases where 
the Bond number is close to 1, the system is inter- 
mediate and neither force clearly dominates. See SUR- 
FACE TENSION. 

Clearly, in a spacecraft environment gravity is no 
longer significant, and the fluid internal forces define 
the steady-state liquid surface. In this situation, fluids 
attempt to minimize their total energy by reducing 
their surface area, and therefore the surface tension, 
resulting in spherical propellant shapes. The fluid 
spheres can attach anywhere on the internal tank 
surface. In particular, the propellant may not be lo- 
cated over the outlet of the tank, and hence gas may 
be expelled instead of propellant, which is clearly 
undesirable. 

To keep the propellant “puddle” from locating it- 
self in arbitrary locations, the principle of surface 
tension is often used to preferentially locate the pro- 
pellant over the outlet. One way is by using channels 
constructed of metal with a single open face cov- 
ered with a fine-weave metal screen (Fig. 3a). The 
screen, which is woven like cloth, has pores of only 
about 15 micrometers that are wetted by the pro- 
pellant. From Eq. (3) it follows that the Bond num- 
ber is very low, and the pores are surface-tension- 
dominated. Figure 3b shows schematically the liquid 
configuration within a pore. When the channel is sur- 
rounded by fluid, the liquid flows through the pores 
with little frictional loss. In this case, gas-free propel- 
lant is provided to the propulsion system since only 
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Fig. 3. Surface-tension propellant management device. 
(a) Typical device. Liquid propellant flows through the wire 
screens and through the channels to the circular device at 
the center, where it is expelled from the spacecraft. 

(b) Configuration of liqud and gas within one of the pores 
(radius = R) of the screen. 


liquid is flowing out of the tank. In the more general 
case of the channel and the screen exposed to gas, 
the surface-tension forces restrict the ability of the 
gas to penetrate the liquid surface within the pore. 
The gas pressure required to ingest bubbles across 
the fluid barrier is termed the bubble point and is 
defined by Eq. (4), where AP is the bubble point 


o 


AP = — 
2R 


(4) 
for the given configuration, o is the surface tension 
of the propellant, and R is the characteristic radius 
(in this application it is the radius of the pore). For 
nitrogen tetroxide, the bubble point is about 2 Ib/in.? 
(14 kilopascals). Hence, the flow losses in the fluid 
moving through the screen must equal this value be- 
fore the gas can break through the fluid surface and 
enter the propulsion system tubing. By proper de- 
sign, it is possible to make channel devices that can 
expel 99.9% of the fluid content of a tank before the 
gas breaks through. When the gas does enter the 
tubing, and finally the engine, the spacecraft can no 
longer be predictably controlled, and this is the defi- 
nition of the end of the mission. Typical mission du- 
rations are 10-15 years of operation for communica- 
tions satellites and 1-5 years for low-Earth missions. 
Interplanetary spacecraft life requirements often ex- 
ceed 10 years. Keith Davies 
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The supporting structure for systems capable of leav- 
ing the Farth and its atmosphere, performing a useful 
mission in space, sometimes returning to the Earth 
and sometimes landing on other bodies. Among the 
principal technologies that enter into the design of 
spacecraft structures are aerodynamics, aerothermo- 
dynamics, heat transfer, structural mechanics, struc- 
tural dynamics, materials technology, and systems 
analysis. In applying these technologies to the struc- 
tural design of a spacecraft, trade studies are made to 
arrive at a design which fulfills system requirements 
at a minimum weight with acceptable reliability and 
which is capable of being realized in a reasonable 
period of time. 

The structural aspects of space flight can be di- 
vided into six broad regions or phases: (1) transporta- 
tion, handling, and storage; (2) testing; (3) boosting; 
(4) Earth-orbiting flight; (5) reentry, landing, and re- 
covery on the Earth; (6) interplanetary flight with 
orbiting of or landing on other planets. Each phase 
has its own structural design criteria requiring de- 
tailed consideration of heat, static loads, dynamic 
loads, rigidity, vacuum effects, radiation, meteoroids, 


acoustical loads, atmospheric pressure loads, for- 
eign atmospheric composition, solar pressure, fabri- 
cation techniques, magnetic forces, sterilization re- 
quirements, accessibility for repair, and interrelation 
of one effect with the others. Heavy reliance is placed 
on computer-generated mathematical models and 
ground testing. 

The basic spacecraft structural design considera- 
tions apply equally well to both crewed and crewless 
spacecraft. The degree of reliability of the design re- 
quired is, however, much greater for crewed mis- 
sions. Also, the spacecraft structures in the case of 
crewed missions must include life-support systems, 
and reentry and recovery provisions. In the case of 
lunar or planetary missions where landing on and 
leaving the foreign body are required, additional pro- 
visions for propulsion, guidance, control, spacecraft 
sterilization, and life-support systems must be real- 
ized, and the structure must be designed to accom- 
modate them. 

Various factors must be considered when selecting 
spacecraft structural materials for the three principal 
phases of space flight—launch, space journey, and 
reentry (Table 1). 

Transportation, handling, storage. For ground trans- 
portation and storage, special load isolation sys- 
tems and hermetically sealed carriers are often built 
to control shock and prevent contamination. The 
spacecraft is monitored continuously to prove that 
it has remained within its design envelope as it 
travels to the launch site by truck, airplane, or 
barge. 





TABLE 1. Materials selections factors* 





Factors Phase I: launch 


Phase II: space journey 


Phase Ill: reentry 





Technological 
factors 


Rocket power 

Thrust alignment 

Structural stiffness 

Operation of the launcher stages 


Materials factors Elastic modulus-density ratio 

Rocket nozzle erosion, particularly 
nonuniform erosion 

Low-temperature conditions in fuel tanks 

Construction of solid-propellant 


rocket-motor casings 


Acoustic fatigue 


Materials of 
special interest 


Graphite 

Refractory oxides 

Carbides, nitrides, and borides 
Structural plastics 

Steels 

Nickel alloys 

Cobalt alloys 

Some structural low-density alloys 


Satellite all-up weight 


Impact by meteoroids, particles, atoms, 
ions, and electrons 


Electromagnetic radiation 

Radiation heating and cooling 

Behavior of solids at low pressures 

Radiation damage, effect on the physical 
properties of materials 

Transparencies 

Emissivity 


Radiation shielding 
Resistance to impact: surface damage, 
spalling 


Behavior of bearing surfaces, friction 

Seals 

Behavior of electrical materials, 
semiconductors 

Lightweight vehicle construction 

Composite assemblies 

Light alloys: aluminum, magnesium, 
titanium, beryllium 

Polymers and elastomers 


Resistance to high kinetic heating 
Dissipation and redistribution ofheat 


Compatibility of structural materials under 
thermal shock 


Operation of control devices 

Thermal stresses 

Material survival under thermal shock at 
temperatures of ~8000 K (14,000°F) 

Erosion 

Ablation 


Thermal insulation 
Heat sinks 


Protective coatings 


Graphite 

Plastics 

Ceramics, particularly reinforced ceramics 
Molybdenum 

Niobium 





*After A. J. Kennedy, Materials Background to Space Technology, George Newnes Limited Tower House, London, 1964. 








Fig. 1. Finite element model of the SPOT 5 spacecraft in its 
stowed launch configuration. (European Space Agency/ 
Matra Morconi) 


Testing. To ensure that spacecraft structures will 
meet mission requirements criteria in general re- 
quires testing to levels above the expected envi- 
ronmental conditions by a specific value. The test 
level must be set to provide for variations in mate- 
rials, manufacture, and anticipated loads. In cases 
where structures are required to perform dynamic 
functions repeatedly, life testing is required to en- 
sure proper operation over a given number of cycles 
of operation. See INSPECTION AND TESTING. 

A major function of the test program is to validate 
the structural mathematical models. A typical model 
(Fig. 1) is constructed with a powerful finite element 
computer program such as NASTRAN or PATRAN. 
The model is used to predict dynamic loads, stress 
levels, natural frequencies in both the stowed and 
deployed configurations, on-orbit jitter and stabil- 
ity, temperature extremes and deformations, and op- 
eration loads for mechanisms. See COMPUTER-AIDED 
ENGINEERING; ENGINEERING DESIGN; FINITE ELEMENT 
METHOD. 

Boost. The purpose of the boost phase is to lift 
the vehicle above the sensible atmosphere, to ac- 
celerate the vehicle to the velocity required, and to 
place the spacecraft at a point in space, heading in 
the direction required for the accomplishment of its 
mission. For space missions, the required velocities 
range from 26,000 ft/s (8 km/s) for nearly circular 
Earth orbits to 36,000 ft/s (11 km/s) for interplan- 
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etary missions. Achievement of these velocities re- 
quires boosters many times the size of the space- 
craft itself. Generally, this boosting is accomplished 
by a chemically powered rocket propulsion system 
using liquid or solid propellants. Multiple stages are 
required to reach the velocities for space missions. 
Vertical takeoff requires a thrust or propulsive force 
that exceeds the weight of the complete flight sys- 
tem by approximately 30%. An example ofa multiple- 
stage booster is the Delta launch vehicle used for the 
crewless missions (Fig. 2). The Delta II 7925 vehi- 
cle has the capability to place 4000 lb (1800 kg) 
into a geosynchronous transfer orbit. See IN- 
TERPLANETARY PROPULSION; PROPELLANT; ROCKET 
PROPULSION. 

The boost trajectory (Fig. 3) can be controlled 
to some extent by proper selection of the thrust-to- 
weight ratio and launch angle. The higher thrust-to- 
weight ratios are more efficient, but they result in 
a higher dynamic pressure which in turn results in 
increased spacecraft loading and heating. 

Boost environments. The basis for design of the pri- 
mary spacecraft structure is the ability to withstand 
the boost environment. Typical launch vehicles pro- 
duce longitudinal acceleration loadings in excess of 
five times the Earth’s gravitational field, or 5 g. Lateral 
load factors are typically less than half of the longi- 
tudinal load factor, but cause the greatest portion of 
stress induced by the spacecraft base bending mo- 
ment. A coupled-loads analysis, which is a dynamic 
simulation of the critical transient loading events, 
is performed in order to predict actual component 
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Fig. 2. Exploded view of typical Delta Il 7925 three-stage 
structure. (After Commercial Delta II Payload Planners 
Guide, McDonnell Douglas Commercial Delta Inc., 1990) 
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Fig. 3. Typical flight profile trajectory of the Ariane 4. (Ariane 4 User’s Manual) 


acceleration, force, stress, or displacement response 
for each unique launch configuration. Sophisticated 
finite element modeling techniques are used to rep- 
resent the launch vehicle and spacecraft in such anal- 
yses. 

Vibroacoustic loads. Spacecraft structures are exposed 
to an acoustic environment throughout the boost 
phase of flight until the vehicle is out of the sensible 
atmosphere. The highest acoustical sound-pressure 
levels occur immediately after liftoff because of im- 
pingement on the payload fairing of sound originat- 
ing from engine exhaust energy which is reflected 
off the launch pad. Another significant acoustic 
event occurs during transonic flight and is caused by 
aerodynamic shock waves and a highly turbulent 
boundary layer. The sound-pressure level within 
the fairing creates random vibration of the space- 
craft components. This acoustically generated ran- 
dom vibration environment is the most signifi- 
cant loading for small components which are typ- 
ically mounted on panel-type secondary structures 
sensitive to high-frequency excitation. See BOUND- 


ARY LAYER FLOW; SHOCK WAVE; SOUND; TRANSONIC 
FLIGHT; VIBRATION. 

Space phase and design considerations. The space 
phase begins after the boost phase and continues 
until reentry. In this phase, the structures that were 
stowed for launch are deployed. Important design 
considerations include control system interaction, 
thermally induced stress, and minimization of jitter 
and creaks. 

Deployment of loads. In order to fit within the tight 
confines of the boost vehicle, the spacecraft must 
be tightly folded and stowed. When in Earth orbit, it 
must extend and deploy into its operational shape 
(Fig. 4). This operation places high reliability re- 
quirements on mechanisms, motors, and joints. In 
space, the temperature changes can be extreme. 
The deployment may be complicated and involve 
pyrotechnic devices, coiled springs, redundantly 
driven motors, gearboxes, latches, and cables. 

Extensive testing of the deployment is required 
because there are probably more failures of satel- 
lites from this phase than from any other. Of course, 
some space structures will not support themselves 
in Earth’s gravitational field, so accurate deployment 
simulation can be challenging. 

If an astronaut is involved, then the deployment 
operation must meet restricted force and access re- 
quirements. To assist the task, the astronaut is pro- 
vided with a special tool that produces torque at sev- 
eral preset levels. 

Control system. The spacecraft control system im- 
parts inertial loads throughout the structure. In the 
zero gravity environment, every change in loading or 
orientation must be reacted through the structure. 

The structural stiffness of the spacecraft must be 
considered, especially as it affects the control system. 
The control system sensor location and the higher- 
order modes must be considered so that control sys- 
tem coupling, which may result in overall spacecraft 
instability, is avoided. The liquid propellant rocket 
systems, using fairly low pressurization, result in thin- 
shell spacecraft structures which are not as rigid as 
solid propellant cases, which must withstand high 
internal pressures. 

For orbiting telescopes, line-of-sight jitter and con- 
trol is just as important as stability. As larger space 
structures are developed, they are increasingly dif- 
ficult to control. This is due to two main factors: 
(1) their large size creates very low natural fre- 
quencies, which are slow to respond and are usu- 
ally lightly damped, and (2) the structures often 
change configuration when telescopes point to new 
targets or solar arrays track the Sun. See CONTROL 
SYSTEMS. 

Pressure vessels. Spacecraft structural design usually 
requires that part of the principal structure be a pres- 
sure vessel. Efficient pressure vessel design is there- 
fore imperative. An important material property, es- 
pecially in pressure vessel design, is notch sensitivity. 
Notch sensitivity refers to the material’s brittleness 
under biaxial strain. This apparent brittleness con- 
tributed to premature failure of some early boosters. 
See PRESSURE VESSEL. 
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Fig. 4. INTELSAT VII in its stowed and deployed configurations. (Loral, LM Astro) 


In pressure vessel design the variation of ultimate 
tensile strength with temperature is important. The 
exposure to elevated temperature is short. 

Tank structures for spacecraft differ with the type 
of propellant used. With solid propellants the tank is 
primarily a pressure vessel; it must be able to con- 
tain the high pressures (500-1000 lb/in.? or 3.4- 
6.9 megapascals) generated by combustion. Other 
loads are usually of secondary importance, and the 
bursting strength of the tank is usually the most sig- 
nificant factor. Care must be taken in design to avoid 
stress concentrations, particularly near joints, which 
might cause premature failure. 


Tanks for liquid propellants, on the other hand, 
need to contain relatively low pressures (10- 
100 1b/in.? or 69-690 kilopascals). However, flight 
loads in the form of thrust and bending moment be- 
come important in the design of the tank. One ap- 
proach is to use the tank pressure to assist in carry- 
ing these loads, and to stabilize the light-gage unstiff- 
ened tank shell. The skin gage is chosen to be just 
adequate for containing the tank pressure stresses; 
the pressure is chosen on the basis of providing ade- 
quate stability under flight loads. Such pressures are 
generally in excess of those required to stabilize the 
tank to the point where the buckling strength is 
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developed. Alternative approaches are to provide 
stiffening in the form of stringers or frames to sta- 
bilize the tank, and lowering tank pressures only to 
that level required by the propulsion system. 

Interstages between tanks are generally designed 
for flight loads; their construction is similar to that 
of airplane fuselages. They may be monocoque un- 
stiffened shells or may have longitudinal (stringer) or 
circumferential (frame) stiffening. 

Gravity gradient loads. Gravity gradient loads are par- 
ticularly significant in long flexible structures lo- 
cated within a significant gravity field such as the 
Earth’s. Properly considered, these loads can work 
for the spacecraft designer by providing a method 
of passive spacecraft control. An excellent example 
of the use of this technique was the radio astron- 
omy explorer spacecraft (RAE-Explorer 38) which 
gravity-stabilized by use of its 750-ft (229-m) anten- 
nas (Fig. 5). 

Solar radiation pressure. Solar radiation pressure can 
cause long flexible structural members, such as the 
RAE antennas and booms, to deflect quite signifi- 
cantly. Solar pressure can also cause significant dis- 
turbing torques on spacecraft that have large surface 
areas. This effect cannot be ignored in spacecraft 
structural design or in control systems design. 

Space material and debris. Meteoroids are natural bod- 
ies remaining from the formation of the solar sys- 
tem or from collisions of other natural bodies. These 
particles may have extremely high velocities relative 
to the spacecraft (up to 225,000 ft/s or 68 km/s). 





Fig. 5. Radio Astronomy Explorer spacecraft with antennas 
extended in comparison to the Empire State Building in 
New York City. 


Orbital debris comprises residual particles result- 
ing from human space-flight activities. Collisions in- 
volving these bodies and a space station and other 
long-duration orbiting spacecraft are inevitable. The 
worst-case effects of such collisions include the 
degradation of performance and the penetration 
of pressure vessels, including high-pressure storage 
tanks and habitable crew modules. An essential pa- 
rameter in the design of these structures is the miti- 
gation of these effects. See METEOR. 

The probability of an object 0.4 in. (1 cm) or larger 
impacting any part of a structure with a 53,820-ft? 
(5000-m?) exposed surface area (roughly one foot- 
ball field) and a 250-mi (400-km) operating altitude 
is predicted to be one impact in 71 years. How- 
ever, an impact would not necessarily cause a fail- 
ure in a space station since the core components of 
the largest space station designed cover only about 
21,500 ft? (2000 m?) and the design life is only 15 
years. Further, space-station design features shield- 
ing to protect against objects as large as 0.55 in. 
(1.4 cm). This represents protection from 99.8% of 
the objects that would otherwise penetrate an un- 
shielded component (roughly 0.04 in. or 0.1 cm). 
These factors make a catastrophic penetration ex- 
tremely unlikely. 

The great majority of the debris is far smaller than 
the millimeter scale. Impacts from these objects, 
most of which are the size of a grain of sand, too 
small to cause significant structural damage, are the 
most frequent. These impacts cause the degradation 
of sensitive surfaces such as optical components and 
solar panels. Mitigation of these effects can be dealt 
with by routine maintenance operations. 

Another important consideration is protection 
for crew members during extravehicular activities 
(EVA). Presently the risk appears very small because 
of the relatively small exposed area of the space suit 
and short duration of the exposure. However, in- 
creases in the debris environment could pose oper- 
ational and design constraints on extravehicular ac- 
tivity. 

Energetic particles. Radiation shielding is required for 
some vehicles, particularly those operating for ex- 
tended times within the Earth’s magnetically trapped 
radiation belts or during times of high sunspot activ- 
ity. The shielding may be an integral part of the struc- 
ture. Computer memories are particularly suscepti- 
ble to radiation and cosmic-ray activity and must be 
shielded to survive. This is a minor concern for scien- 
tific spacecraft but a major concern for defense satel- 
lites. Effects of radiation on most metallic structures 
over periods of 10-20 years is not severe. The dura- 
bility of composite structures in space is a major con- 
cern for long life. Based on available data, the syner- 
gistic effects of vacuum, heat, ultraviolet, and proton 
and electron radiation degrade the mechanical, phys- 
ical, and optical properties of polymers. See MAGNE- 
TOSPHERE; RADIATION HARDENING; VAN ALLEN RADI- 
ATION. 

Thermal extremes. Temperatures in space can vary 
from 212°F (100°C) on sunlit surfaces to —292°F 
(—180°C) on dark surfaces. As a result, temperature 


is a major design consideration for all moving parts, 
such as motors, hinges, and valves. When a satellite 
enters or exits the Earth’s shadow, the rapid temper- 
ature change can induce a phenomenon called ther- 
mal jitter, where parts of the structure deform so fast 
that they oscillate about their neutral positions and 
shake the entire spacecraft. This happened in the 
early days of the Hubble Space Telescope. It was cor- 
rected with updated control software and new solar 
arrays. See HUBBLE SPACE TELESCOPE. 

Temperature extremes in the structure and the en- 
closed environment are controlled by several tech- 
niques. Passive control is accomplished by surface 
coatings and multilayer thermal blankets which con- 
trol the radiation transfer from the spacecraft to 
space and vice versa. Because incident solar radia- 
tion varies inversely with the square of distance from 
the Sun, means of adjusting surface conditions are 
required for interplanetary missions. Heat generated 
by internal equipment or other sources must be con- 
sidered in the heat balance design. Other techniques 
used to actively control spacecraft temperatures are 
thermal louvers and heat pipes. See HEAT PIPE. 

Thermal gradients must be considered in space- 
craft design, especially when the spacecraft has one 
surface facing the Sun continuously. In some cases it 
is desirable to slowly rotate the spacecraft to elimi- 
nate such gradients. 

Inertial loads. An important consideration in space- 
craft structural design is the distribution of weight 
which determines the balance and inertia ratios of 
the spacecraft. In general the structural weight of 
spacecraft ranges 15-30% of the total spacecraft 
weight. By proper design, additional weight needed 
to balance the spacecraft and obtain the proper iner- 
tia ratios can be minimized. For spin-stabilized space- 
craft the spin moment of inertia should be greater 
than the principal lateral moment of inertia by ap- 
proximately 5% in order to ensure stability about the 
spin axis. Consideration must be given to expend- 
ables such as jettisoned hardware and gas depletion. 
As spacecraft become larger and the equipment con- 
tained therein becomes more massive, inertia load- 
ing is increasingly important. 

Sloshing. Sloshing in the case of liquid propulsion 
systems must be carefully evaluated because it affects 
the structural integrity of the thin shell and the over- 
all stability of the spacecraft. The fuel mass move- 
ments result in shifts of the center of gravity, causing 
pitch or yaw of the system. Control system coupling 
with either pitch or lateral sloshing can occur unless 
proper baffling or damping is provided. 

Material considerations. The materials of general in- 
terest used in spacecraft structures are alloys of 
aluminum, magnesium, and titanium; steel; stain- 
less steel; superalloys; plastics; and graphite-epoxy, 
Kevlar, and fiberglass composites. The fabrication 
and design techniques that are employed to obtain 
the most efficient structure for a given job often in- 
clude honeycomb sandwich construction (Fig. 6), 
helical wound fiberglass and metal, expanded 
plastic sandwich construction, and woven fabrics 


(Fig. 7). 
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Fig. 6. Honeycomb sandwich construction provides strong and stiff but lightweight 
structures. (Hexcel Corp.) 


Spacecraft structures usually are required to be 
lightweight and rigid, which results in the selection 
of high-modulus materials. 

Titanium and beryllium have low densities and 
relatively high modulus-density ratios. Alloys of 
these metals are relatively difficult to fabricate, and 
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Fig. 7. Woven Kevlar fabric is light and strong. 
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therefore their application is quite limited. The more 
common aluminum, magnesium, and stainless steel 
alloys are basic spacecraft structural materials. They 
are easy to fabricate, relatively inexpensive, and in 
general quite suitable for use in the space environ- 
ment. 

Plastics are used in spacecraft structures when 
radio-frequency or magnetic isolation is required. 
They are also used in situations where some struc- 
tural damping is desired. The thermal blankets, men- 
tioned earlier as passive temperature control meth- 
ods, are primarily Mylar plastic. 

Composite materials. The modern requirements for 
low weight, high strength, high stiffness, and low 
thermal expansion (for precision optical pointing) 
have prompted the use of composite materials for 
spacecraft structure. These materials consist of high- 
strength reinforcement fibers which are supported 
by a binder material referred to as the matrix (Fig. 7). 
The fibers are typically made of glass, graphite, or car- 
bon, and the matrix is an epoxy resin. The material is 
usually produced with the fibers configured in a uni- 
directional ply which has been pre-impregnated with 
the matrix. Such plies exhibit tremendous strength 
in the fiber direction but are relatively weak in the 
transverse direction. As a result, the plies are bonded 
together to form a laminate in which the fiber angle 
in each unique ply can be specified to provide the 
desired combined properties in each direction. The 
combined laminate is cured in an autoclave under 
stringent temperature and pressure conditions to en- 
sure constant properties. With knowledge of the 
loading due to a unique application, the material 
properties can be tailored by optimizing the ply an- 
gles within the laminate to produce a material which 
may be substantially less dense than an equivalent- 
strength metallic structure. 

An example of a spacecraft structure relying on 
composite materials is the Earth Observation Sys- 
tem (EOS-AM1). The graphite-epoxy tubes compos- 
ing the primary structure have been optimized for 
the substantially compressive loading induced by the 
Atlas launch vehicle. The tubes are made from a 
70-ply rolled laminate consisting of 80% plies in 
the axial direction for high tensile and compres- 
sive strength and 20% plies in the 45° orientation 
for high torsional strength. Further, the EOS-AM1 in- 
struments require precision pointing accuracies in 
the range of 0.1°. Under the extreme temperature 
gradients imposed on the spacecraft because of its 
polar orbit, the thermally induced expansion would 
preclude the use of typical metals for the primary 
structure. The thermal expansion coefficient for 
the graphite-epoxy tubes is 1.8 microstrain/°F (3.2 
microstrain/°C) in comparison to 12.0 microstrain/°F 
(21.6 microstrain/°C) for typical aluminum alloys. 
Graphite-epoxy laminate is also used in the Hubble 
Space Telescope, where the laminates are arranged 
to minimize the coefficient of thermal expansion, 
preventing deformation when temperature changes 
occur (Fig. 8). In the Hubble, only 1 milliarcsecond 
G x 107” degree) of line-of-sight angular deforma- 
tion is allowed for thermal expansion. For spacecraft 


requiring high precision in pointing angle, graphite- 
epoxy composites are the materials of choice. See 
COMPOSITE MATERIAL. 

Reentry phase. Although the atmospheric layer of 
the Farth is relatively thin, it is responsible for the 
reduction of vehicle velocity and the resulting decel- 
eration loads, as well as for the severe heating ex- 
perienced by reentering vehicles. A body entering 
the Earth’s atmosphere possesses a large amount of 
energy. This energy must be dissipated in a manner 
which allows the reentering vehicle to survive. Most 
of the vehicle’s original energy can be transformed 
into thermal energy in the air surrounding the vehi- 
cle, and only part of the original energy is retained in 
the vehicle as heat. The fraction that appears as heat 
in the vehicle depends upon the characteristics of 
the flow around the vehicle. In turn, the flow around 
the vehicle is a function of its geometry, attitude, ve- 
locity, and altitude. Several different approaches are 
used for reentry vehicles with different trajectories 
(Fig. 9). See ATMOSPHERIC ENTRY. 

Atmospheric drag. Spacecraft are seldom designed to 
reenter the Earth’s atmosphere (the space shuttle 
being an exception), but may be designed to enter 
extraterrestrial atmospheres (Fig. 10). In either case, 
the structural design is similar. 

High-speed reentry causes extreme friction and 
heat buildup on spacecraft that must be dissipated 
by using high-temperature ceramic or ablative ma- 
terials. The space shuttle has reentered the Earth’s 
atmosphere over a hundred times without accident 
(Fig. 11). It is covered with special thermal insulat- 
ing tiles that allow the structural elements to remain 
cool when the surface reaches 1200°F (650°C), and 
its leading edges are protected by a carbon-carbon 
reinforced material that can withstand temperatures 
as high as 2300°F (1260°C). 

Satellites whose orbits decay into the Earth’s upper 
atmosphere become flaming objects as they rapidly 
descend. Generally, most or all of the satellite is con- 
sumed before it reaches the surface, but there are 
exceptions such as the March 22, 2001, reentry of 
the Russian space station, Mir 

Ablation. The normal application of ablation is to 
limit the temperature of the structural shell or in- 
ternal components of the payload, in order to with- 
stand the extreme thermal environment during bal- 
listic zero-lift reentry. This temperature limitation is 
the principal criterion. The most desirable material, 
therefore, is one which has a high heat of ablation 
and low thermal conductivity. Material properties, 
however, do not necessarily possess these character- 
istics simultaneously, to the same degree (Table 2). 
Therefore, specific investigations or trade studies 
must be made to determine the minimum weight of 
ablation material required to maintain the structural 
temperature within design limits. Care must be exer- 
cised to match the ablation material to the environ- 
ment. For example, if the maximum temperature that 
would be reached without ablation is below 3000°F 
(1650°C), quartz would not be considered because 
it does not ablate appreciably below 3000°E Many 
materials will rapidly deteriorate without ablating if 
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Fig. 8. Internal telescope of the Hubble Space Telescope, whose structural members are made of graphite-epoxy. 


(NASA/Lockheed) 


exposed to temperatures below their ablation tem- 
peratures, and thus will not provide the needed pro- 
tection for the structure. 

Cooling and transpiration. Cooling the vehicle by circu- 
lation of water, lithium, or other media, or by transpi- 
ration of gas or vapor is limited by system complexity, 
reliability, and adverse weight penalties. 

Thermal protective systems are time-limited be- 
cause the heat-absorption capacity of the protection 
system is continuously expended during the process. 


lifting or -m= zero-lift 

nonlifting y \— ballistic missile 
space vehicle / \ 

reentry / 


A atmosphere 





Earth's surface 


Fig. 9. Comparison of various space reentry and ballistic 
missile trajectories. 


These systems, therefore, apply primarily to short- 
duration (5-10 min) flights. 

Lifting reentry. In crewed applications, vehicles em- 
ploying aerodynamic lift during reentry have several 
advantages over zero-lift ballistic bodies: (1) The use 
of lift allows a more gradual descent, thus reducing 
the deceleration forces on both vehicle and occu- 
pants. (2) The vehicle’s ability to glide and maneu- 
ver within the atmosphere gives it greater accuracy 
in either hitting a target or landing at a predeter- 
mined spot. (3) It can accommodate greater errors 
of guidance systems because for a given decelera- 
tion it can tolerate a greater range of entry angles. 
(4) Greater temperature control is afforded because 
aerodynamic lift may be varied to control altitude 
with velocity. 

The total time required for a lifting vehicle to de- 
scend through the atmosphere depends on the drag 
characteristics of the vehicle. With high drag coeffi- 
cients the time may be as low as 30 min, and with 
low drag it may be approximately 1-2 h. In the case 
of grazing reentries, several orbital loops may be re- 
quired to bleed off the energy, and reentry duration 
can be measured in days. The relatively long times 
required for descent make time-dependent ablation 
or cooling impractical. 
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Fig. 10. Mars Pathfinder entry, descent, and landing. (VASA/Jet Propulsion Laboratory) 


Thermal stress during reentry. One of the most severe 
structural problems is thermal stress, which arises 
as a result of temperature gradients. These tempera- 
ture gradients result from causes such as heat sinks, 
transient heating, and crewed-compartment cooling. 
Thermal stress is a function of the coefficient of ex- 
pansion, proportional to the modulus of elasticity 
and the temperature gradient. 

Thermal stresses will be produced in sandwich or 
conventional sheet-stringer construction if the ther- 
mal deformations are externally constrained. Ther- 
mal stresses will also be produced in the absence 
of external constraints, solely because of the incom- 
patible deformations of the different parts of the 
body, for example, a rectangular anisotropic sand- 
wich panel with a temperature gradient through its 
thickness. 

The elastic thermal stresses are computed for the 
typical temperature gradients associated with the 
maximum temperatures. If the vehicle is intended for 
repeated use, the compressive yield point cannot be 


exceeded. As an example, sandwich construction of 
high-nickel alloy M-252 cannot be used above 1660°F 
(900°C) because the thermal stress equals the com- 
pressive yield stress at that temperature. The air-load 
stresses should be added to the thermal stresses, thus 
reducing even further the allowable temperature for 
material and type of construction under these con- 
ditions. The same curves are drawn for a titanium- 
molybdenum alloy. Its more favorable lower coeffi- 
cient of thermal expansion and its higher allowable 
stress in the higher-temperature region result in a 
better sandwich. However, there are many practical 
problems to be solved in the brazing of molybdenum 
alloys. 

If the vehicle were designed for a single shot, 
the thermal stresses could exceed the yield point 
considerably without seriously jeopardizing the mis- 
sion. However, tremendous benefits accrue if ther- 
mal stresses are eliminated. 

High-temperature materials. The extreme thermal envi- 
ronment requires materials with higher temperature 
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Fig. 11. The space shuttle (shown with cargo bay doors open) can return to Earth with help from insulation tiles that keep it 
from burning up. (NASA/Johnson Space Center) 


capability than standard aircraft steels and aluminum 
alloys. These materials are the superalloys, refractory 
alloys, and ceramics. 

The superalloys which contain high percentages 
of either nickel or cobalt have a maximum useful 
temperature range, under low stresses, of 1600- 
2000°F (870-1090°C). These alloys are readily avail- 
able in thin gages and can be formed, riveted, brazed, 
and fusion- or spot-welded with essentially standard 
procedures. The surface of these alloys is rapidly 
oxidized above 1300°F (700°C). However, the oxide 
forms an adherent coating that protects the metal 
from further oxidation and provides a high surface 
emissivity (0.8-0.9). 

The refractory alloys (niobium, molybdenum, tan- 
talum, and tungsten) have higher densities than the 
superalloys but retain their useful strength from 
3300°F or 1820°C (niobium) to over 4000°F or 
2200°C (tungsten). All refractory metals are oxidized 
catastrophically at high temperatures, and surface 
protection is required. Coatings have been devel- 
oped which protect molybdenum for approximately 
300 h at 3000°F (1650°C) and niobium for several 
hours at 2600°F (1430°C). Niobium alloys are avail- 
able in thin gages and can be formed by standard pro- 


cedures. The quality of thin-gage molybdenum alloys 
is not consistent and requires hot forming because 
of their low ductility at room temperature. Tungsten 
alloy, available in sheet, bar, and wire form, has the 
highest melting point of all the metallic elements. 
This material offers extremely high strength but at 
a high mass density of 19 g/cm?. Tantalum is similar 
to niobium except that it has a much higher den- 
sity. The refractory alloys can be joined by riveting, 
but fusion welding requires such special procedures 
as inert atmosphere. Spot welding is extremely diffi- 
cult. 

Conventional ceramics retain fairly high strengths 
up to 4000°F (2200°C), and the more exotic ceram- 
ics are potentially usable at over 8000°F (4430°C). 
Their high melting points and relative inertness to 
oxidizing atmospheres make them attractive materi- 
als for leading edges and nose cones of glide vehicles. 
However, the lack of ductility presents serious design 
problems. 

Cost is not necessarily the criterion for mate- 
rial selection. Availability, strength, stiffness, weight, 
producibility, and cost of fabrication must also be 
considered. The final material selected will be the 
one which results in the least expensive end item 





TABLE 2. Typical ablation materials and properties 








Ablation Thermal conductivity, Specific heat, Density, 
temperature, Btu/(in.)(s)(F) Btu/(Ib)(F) Ib/in. 
Material TREC) [J/(m)(s)(°C)] [kJ/(kg)(C)] (g/cm) 
Quartz 4400 (2430) 0.1292 x 10~* (0.9660) 0.25 (1.05) 0.08391 (2.323) 
Acrylic 250* (120) 0.4033 x 107° (30.15) 0.38 (1.59) 0.04282 (1.185) 
Poly(tetrafluoroethylene) 1000* (640) 2.896 x 107 (0.2165) (0.25 (1.05) 0.07523 (2.082) 
Refrasil-11% phenolic resin 2000° (1090) 3.475 x 10~® (0.2598) 0.23 (0.96) 0.06366 (1.762) 
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Fig. 12. ATS 7. (a) Structural assemblies. (b) Side view and (c) bottom view of deployed configuration. (d) Side view and 
(e) section A-A of launch configuration. (f) View of deployed spacecraft. 1 in. = 25 mm; 1 ft = 0.3 m. 
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Fig. 13. The International Space Station is an example of a large structure of changing configuration that cannot be 
completely assembled on Earth. (NASA) 


and best meets the design requirements. See HIGH- 
TEMPERATURE MATERIALS. 

Erectable structures. These take many and varied 
shapes. They are sometimes relatively simple hinged 
booms, while on other occasions they become quite 
large and massive. One type of unique erectable 
structure is the antennas on the radio astronomy 
explorer (RAE) spacecraft (Fig. 5). The four 750-ft 
(229-m) V antennas were erected in flight, as were 
two 300-ft (91-m) damper tubes and two 60-ft (18-m) 
dipole antennas. Each of these antenna booms was 
stored as a 0.002-in.-thick (50-micrometer) beryllium 
copper alloy tape approximately 2 in. (5 cm) wide in 
a mechanism within the spacecraft. Upon commands 
from the Earth, the booms were deployed from the 
mechanism to form '/:-in.-diameter (13-mm) tubes. 
The weight of the '/,-in.-diameter (13-mm) tube is ap- 
proximately '/;-0z/ft (23 g/m); therefore the 3750 ft 
(1143 m) of elements aboard the RAE weighed only 
60 lb (27 kg). The 750-ft (225-m) antennas were de- 
signed so that upon deployment the flat tape inter- 
locked into a torsionally rigid structural member. In 
addition to serving the function of radio astronomy 


antennas, the long booms provided gravity stabiliza- 
tion so that one axis of the spacecraft always faced 
the Earth. 

Rigid spacecraft structures. Many more spacecraft 
structures are rigid than erectable or inflatable. Some 
examples of rigid spacecraft structures are the Or 
biting Astronomical Observatory (OAO), the Nim- 
bus weather spacecraft, the Telstar communications 
spacecraft, the Syncom communications spacecraft, 
the Orbiting Solar Observatory (OSO), and the lunar 
and command service modules. 

Typical of a rigid spacecraft structure is the Ap- 
plications Technology Satellite (Fig. 12), which in- 
cludes a flexible, erectable, parabolic reflector 30 ft 
(9 m) in diameter. It is used to broadcast television 
signals into remote areas. 

Another spacecraft structure is the space shut- 
tle (Fig. 11) or Space Transportation System (STS), 
which can carry 65,000 Ib 0,000 kg) of cargo to 
and from low Earth orbit. 

The International Space Station (ISS) is a coop- 
erative, 16-nation effort (Fig. 13). It will include 
six laboratories and weigh a million pounds when 
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assembled as currently projected in 2010. See SPACE 
STATION. 

The Viking lander is a prime example of an inter- 
planetary spacecraft designed for orbiting and soft 
landing on Mars. See COMMUNICATIONS SATELLITE; 
METEOROLOGICAL SATELLITES; MILITARY SATELLITES; 
SATELLITE NAVIGATION SYSTEMS; SCIENTIFIC AND AP- 
PLICATIONS SATELLITES. 

William Haile; Peter S. Weinberger 
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A nuclear reaction that can take place when two nu- 
clei collide at very high energy (typically 500 MeV 
per nucleon and up), in which the involved nuclei 
are either disintegrated into their constituents (pro- 
tons and neutrons), light nuclei, and elementary par- 
ticles, or a large number of nucleons are expelled 
from the colliding system resulting in a nucleus with 
a smaller atomic number. This mechanism is clearly 
different from fusion reactions induced by heavy 
or light ions with modest kinetic energy (typically 
5 MeV per nucleon) where, after formation of a com- 
pound nucleus, only a few nucleons are evaporated. 
A spallation reaction can be compared to a glass that 
shatters in many pieces when it falls on the ground. 
The way that the kinetic energy is distributed over 
the different particles involved in a spallation reac- 
tion and the process whereby this results in residues 
and fluxes of outgoing particles are not well under- 
stood. See NUCLEAR FUSION. 

Reactions in space. Spallation reactions take place 
in interstellar space when energetic cosmic rays 
(such as high-energy protons) collide with interstel- 
lar gas, which contains atoms such as carbon, ni- 
trogen, and oxygen. This leads to the synthesis of 
light isotopes, such as °Li, °Be, !°Be, and '!B, that 
cannot be produced abundantly in nucleosynthesis 
scenarios in the big bang or stellar interiors. See NU- 
CLEOSYNTHESIS. 

Laboratory reactions. In terrestrial laboratories 
spallation reactions are initiated by bombarding tar- 
gets with accelerated light- or heavy-ion beams, and 
they are used extensively in basic and applied re- 
search, such as the study of the equation of state 
of nuclear matter, production of energetic neutron 
beams, and radioactive isotope research. 


Heavy-ion reactions. When a spallation reaction oc- 
curs as the result of a heavy nucleus colliding at high 
impact energy with another heavy nucleus (for ex- 
ample, two lead nuclei with 82 protons and 126 neu- 
trons), the kinetic energy of the bombarding nucleus 
is transformed into compression, thus increasing the 
density of the nuclear matter, and into thermal mo- 
tion of the nucleons. New particles are thereby cre- 
ated, some of which escape the nuclear medium. By 
studying these escaping particles, information can be 
extracted concerning the equation of state of nuclear 
matter. As in ordinary gases, liquids, and solids, this 
equation of state gives information about properties 
of nuclear matter, such as the compressibility, as a 
function of density and temperature. The equation 
of state of nuclear matter is relevant to better un- 
derstanding the creation of neutron stars and black 
holes. At extremely high bombarding energy, so high 
a temperature might be reached that the nucleons 
(protons and neutrons) would disintegrate into their 
constituents, known as quarks, and, together with 
the gluons that carry the strong interaction between 
the quarks—like small strings—form the so-called 
quark-gluon plasma. At present it is assumed that 
such a state was present at the very beginning of 
the universe, immediately after the big bang. See BIG 
BANG THEORY; QUARK-GLUON PLASMA; RELATIVISTIC 
HEAVY-ION COLLISIONS. 

Light-ion-induced reactions. When a light nucleus (such 
as a proton) hits a heavy nucleus (such as lead) 
with an energy of a few hundred megaelectron- 
volts or more, spallation can also take place. The 
resulting neutrons that escape the reaction are inten- 
sively used in condensed-matter, chemistry, geology 
and biology research at different spallation neutron 
source facilities. See NEUTRON; NEUTRON DIFFRAC- 
TION; RADIATION DAMAGE TO MATERIALS; SLOW NEU- 
TRON SPECTROSCOPY. 

The light-ion-induced spallation reaction is less vi- 
olent, and there is a nonnegligible probability that 
only a number of nucleons (10-40) will be removed 
from the target nucleus, resulting in a radioactive iso- 
tope. Such spallation reactions are used intensively 
to produce new isotopes. On the chart of nuclides, 
a representation of the different isotopes as a func- 
tion of number of neutrons and protons, these ra- 
dioactive isotopes can lie far away from the line of 
stable isotopes and have very short half-lives, on the 
order of a few milliseconds. Their study yields infor- 
mation about the manifestation of the electromag- 
netic, strong, and weak interactions in the nuclear 
medium, and their properties determine to a certain 
extent the nucleosynthesis processes taking place 
in explosive stellar environments. However, spalla- 
tion of heavy atoms can lead to the production of 
more then 100 different nuclei. Therefore the de- 
sired isotopes must be separated from the debris. 
Electromagnetic mass spectrometers combined, for 
example, with laser ion sources are used, and numer- 
ous new isotopes have been discovered and stud- 
ied. Several projects around the world have been set 
up to accelerate radioactive isotopes that are pro- 
duced in proton-induced spallation reactions (among 


others), in order to carry out studies of nuclear re- 
actions that, in turn, involve these reaction prod- 
ucts. See MASS SPECTROSCOPE; NUCLEAR REACTION; 
RADIOACTIVE BEAMS; RESONANCE IONIZATION SPEC- 
TROSCOPY. Piet Van Duppen 

Bibliography. H. L. Ravn and B. W. Allardyce, On-line 
mass separators, in D. A. Bromley (ed.), Treatise on 
Heavy Ion Science, vol. 8, p. 363, Plenum Press, New 
York, 1989; C. E. Rolfs and W. S. Rodney, Cauldrons 
in the Cosmos, pp. 508-514, University of Chicago 
Press, 1988; R. Silberberg and C. H. Tsao, Spallation 
process and nuclear interaction products of cosmic 
rays, Phys. Rep., 191:351-408, 1990. 


E) 
Sparganosis 

An infection by the plerocercoid, or sparganum, of 
certain species of the genus Spirometra. Members 
of this genus have a life cycle much like that of Di- 
phyllobothrium latum. The adult normally occurs 
in the intestine of dogs and cats, the procercoid in 
copepods, and the plerocercoid in the musculature 
of frogs, snakes, or aquatic mammals, but not in fish. 
The whitish, unsegmented plerocercoid usually is 
an inch or so in length, but may reach a length of 
a foot or more. Humans may become infected acci- 
dentally by drinking water with infected copepods 
or by eating hosts containing plerocercoids. Since 
humans are not suitable hosts for the adult stage, 
the plerocercoids leave the gut and enter the mus- 
culature, body cavities, or other sites and remain as 
plerocercoids. In Eastern countries, freshly opened 
frogs are sometimes used as poultices on sores, par- 
ticularly around the eyes, and if plerocercoids are 
present, they may invade the human tissues. Human 
sparganosis is rare in North America. See PSEUDO- 
PHYLLIDEA. Reino S. Freeman 


E) 
Spark gap 

The region between two electrodes in which a dis- 
ruptive electrical spark may take place. The gap 
should be taken to mean the electrodes as well as 
the intervening space. Such devices may have many 
uses. The ignition system in a gasoline engine fur- 
nishes a very important example. Another important 
case is the use of a spark gap as a protective device in 
electrical equipment. Here, surges in potential may 
be made to break down such a gap so that expensive 
equipment will not be damaged. The dependence 
of spark-gap operation on such factors as pressure, 
length, electrode characteristics, and time depen- 
dence of potential is quite complicated. See BREAK- 
DOWN POTENTIAL; ELECTRIC SPARK. Glenn H. Miller 


[=E] 
Spark knock 

The sound and related effects produced in spark- 
ignited internal combustion engines by instanta- 
neous ignition and combustion (autoignition) of the 


gaseous fuel-air mixture ahead of the advancing 
flame front. Spark knock is a complex physicochem- 
ical phenomenon. 

After spark ignition, a flame travels outward from 
the spark plug and, under normal combustion, will 
progressively burn the entire fuel-air charge. The 
burned gas liberates heat and expands, leading to in- 
creased pressure and temperature in the unburned 
gas ahead of the flame front. In addition, cylin- 
der pressure and temperature are increasing due 
to the upward, compressive motion of the piston. 
The result is that the unburned gas may be raised 
above its autoignition temperature. However, be- 
fore autoignition occurs, time must be allowed for 
the chemical reactions that precede knock. If the 
flame front passes through the unburned gas before 
these reactions are completed, normal combustion 
results. If these reactions occur too quickly or if the 
flame front velocity is too small, the unburned gas 
spontaneously ignites and burns instantaneously (see 
illus.). While this process is often called detonation, 
it is not identical to the detonation of explosives. The 
instantaneous combustion results in a very intense 
pressure wave that is internally reflected within the 
cylinder at a characteristic frequency (5-9 kHz) re- 
lated to the velocity of sound within the cylinder and 
the dimensions of the combustion chamber. The flex- 
ing of the cylinder wall and cylinder head produces 
the audible, high-frequency (pinging) sound known 
as spark knock. See COMBUSTION CHAMBER; EXPLO- 
SIVE. 

Besides sound, spark knock can result in pitting, 
or erosion, of the combustion chamber, damage to 
spark plug electrodes, and possible structural dam- 
age to the engine. Spark knock also leads to loss 
of engine efficiency by inducing spark plug preig- 
nition, resulting in overly advanced spark timing. 
Spark knock also causes intense turbulence within 
the cylinder, aggravating heat loss from the burned 
gas to the colder cylinder and head surfaces and re- 
ducing efficiency. 

There are three general factors affecting the exis- 
tence of spark knock: the effect of engine design, 
the effect of fuel, and the effect of engine operating 
variables. 

Effect of engine design. Engine design variables 
influencing spark knock are compression ratio, com- 
bustion chamber design, spark plug location, and in- 
cylinder gas velocity. Designs that increase the tem- 
perature, pressure, and chemical residence time of 
the unburned gas (end gas) increase spark knock. 
Increased compression ratio, off-center plug loca- 
tion, and slow-burn combustion chambers all lead 
to increased spark knock. A faster-burning chamber, 
due to higher in-cylinder gas velocity, and central 
plug location increase knock resistance. Character- 
istics of faster-burning chambers include rotational 
motion (swirl) of the charge (due to off-cylinder axis 
charge admission); in-cylinder jetting action at the 
top of the compression stroke caused by nonuni- 
form squeezing of the charge (due to proximity of 
portions of the piston and cylinder head); and use 
of two or more spark plugs. A spark plug should be 
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Sequence of spark knock. (a) Initial flame growth around the end of the spark plug. (b) Beginning of combustion and 
progression of flame through unburned gas. (c) Autoignition in the unburned end gas ahead of the flame. (d) Spark knock, or 


detonation. (Champion Spark Plug Co.) 


located closer to the exhaust valve than the intake 
valve, since the former is hotter. Turbocharged and 
supercharged engines have increased spark-knock 
tendency because of higher intake-manifold air pres- 
sure. Stratified charge spark-ignited engines, having 
inhomogeneous air-fuel ratio distribution in each 
cylinder, have decreased knock tendency. See IG- 
NITION SYSTEM; SPARK PLUG; SUPERCHARGER; TUR- 
BOCHARGER. 

Effect of fuel. The fuel octane number represents 
its resistance to spark knock. Higher octane numbers 
are due to higher autoignition temperatures or longer 
end-gas chemical reaction times. Spark knock can be 
eliminated by substituting a high-octane fuel for one 
of lower octane. See OCTANE NUMBER. 

Either fuel structure or fuel additives determine oc- 
tane number. More compact hydrocarbon molecules 
have higher octane numbers than do long-chain 
molecules. For many years, one of the most popular 
and effective antiknock additives was tetraethyllead. 
However, tetraethyllead use for new automobile en- 
gines sold in the United States has been substantially 
eliminated because of lead’s ability to poison plat- 
inum catalysts. Platinum-based catalysts are required 
for aftertreatment of HC, CO, and NO, exhaust emis- 
sions to render them harmless, as required by fed- 
eral regulations. With the introduction of catalysts, 
engine compression ratios were reduced to permit 
extensive use of lower-octane nonleaded gasoline. 
Increasingly, nonleaded octane improvers, such as 
ethanol, methanol, and methyl butyl tertiary ether, 
are added to refinery gasoline. However, consider- 
ably larger amounts of these than of tetraethyllead 
are required to provide equivalent antiknock. See AL- 
COHOL FUEL; GASOLINE. 

Effect of operating variables. Engine operating vari- 
ables, such as spark advance, revolutions per minute, 
throttle angle, coolant temperature, intake air tem- 
perature and humidity, and air/fuel ratio, can have a 
major influence on spark knock. To reduce or elim- 
inate spark knock, spark advance can be reduced 
(retarded), revolutions per minute reduced, throttle 
angle decreased, coolant temperature decreased, in- 
take air temperature reduced, intake air humidity in- 
creased, and air/fuel ratio reduced. 


Many automotive engines, including most super- 
charged and turbocharged spark-ignition engines, 
have a piezoelectric knock sensor. When detonation 
occurs, the sensor sends a small signal voltage to 
the detonation-control system or the electronic en- 
gine control system. This retards the ignition timing 
until the detonation stops. On some supercharged 
or turbocharged engines, the detonation-control 
system reduces the boost pressure in the intake 
manifold. On other supercharged or turbocharged 
engines, detonation is controlled by first retarding 
ignition timing and then reducing boost pressure. See 
AUTOMOTIVE ENGINE; COMBUSTION; INTERNAL COM- 
BUSTION ENGINE. Joseph R. Asik; Donald Anglin 

Bibliography. E. A. Avallone and T. Baumeister III 
(eds.), Marks’ Standard Handbook for Mechanical 
Engineers, 10th ed., 1996; A. By, B. Kempinski, and 
J. M. Rife, Knock in Spark Ignition Engines, Soc. Au- 
tomot. Eng. Pap. 810147, 1981; E. E Obert, Internal 
Combustion Engines and Air Pollution, 1973; C. E 
Taylor, The Internal Combustion Engine in Theory 
and Practice, 2d rev. ed, vol. 2, 1984. 
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Spark plug 

A device that screws into the combustion chamber 
of an internal combustion engine to provide a pair of 
electrodes between which an electrical discharge is 
passed to ignite the combustible mixture. The spark 
plug consists of an outer steel shell that is electri- 
cally grounded to the engine and a ceramic insulator, 
sealed into the shell, through which a center elec- 
trode passes (see illus.). The high-voltage current 
jumps the gap between the center electrode and the 
ground electrode fixed to the outer shell. 

The electrodes are made of nickel and chrome 
alloys that resist electrical and chemical corrosion. 
Some center electrodes have a copper core, while 
others have a platinum tip. Many spark plugs have a 
resistor in the center electrode to help prevent radio- 
frequency interference. The parts exposed to the 
combustion gases are designed to operate at temper- 
atures hot enough to prevent electrically conducting 
deposits but cool enough to avoid ignition of the 
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Cross section of a typical spark plug. (Champion Spark 
Plug Co.) 


mixture before the spark occurs. See IGNITION SYS- 
TEM. Donald L. Anglin 
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Spatangoida 

An order of exocyclic Euechinoidea in which the pos- 
terior ambulacral plates form a shield-shaped area 
behind the mouth. The plates are arranged in two 
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Echinocardium cordatum, showing diagnostic characters 
of spatangoids. (a) Aboral view. (b) Posterior aspect. 
(c) Amphisternous plastron. (d) Apical system. 


similar parallel longitudinal series. The structure is 
termed an amphisternous plastron. Teeth are lack- 
ing; phyllodes develop, but without bourrelets. Thus 
there is no floscelle. The apical system is compact. 
The families are defined mainly by reference to the 
fascioles, which are ribbonlike bands of minute, 
close-set uniform ciliated spinules on various parts 
of the test (see illus.). According to their position, 
they are defined as anal, subanal, inner (surround- 
ing the apical system), peripetalous (surrounding 
the petals), or marginal. The earliest forms were 
the Toxasteridae of the Lower Cretaceous, which 
lacked fascioles and petals. The surviving Palaeop- 
neustidae are deep-sea forms with an oval test, long 
spines, and weakly developed fascioles and petals. 
The Spatangoida are more specialized, heart-shaped 
forms which live in mud or sand. The order reached 
its maximum in the mid-Tertiary, but is still richly rep- 
resented today in most seas. See ECHINODERMATA; 
ECHINOIDEA; EUECHINOIDEA. Howard B. Fell 


—— 
Spearmint 


Either of two vegetatively propagated, clonal culti- 
var species of mints of the family Lamiaceae (Labi- 
atae). They are grown primarily in Idaho, Indiana, 
Michigan, Washington, and Wisconsin as a source of 
essential oil of spearmint. 

Cultivation. In agricultural production the dor- 
mant stolons, or leafless underground stems (often 
misnamed roots), of these perennial species are 
planted in a 6-in-deep (15-cm) trench in rows 
38 in. (97 cm) apart from February to April and 
chemically weeded with a terbacil herbicide; the leaf 
crop is mowed and windrowed in July or August 
at the time of flowering and plant maturity, partly 
dried, and then steam-distilled. Fields are shallow- 
plowed in early November, and the second-year, 
third-year, and subsequent crops are a solid stand, 
or meadow mint, in all areas that do not use rill 
irrigation. 

Cultivars. The original cultivar “Native spearmint” 
with flowers in terminal inflorescences (see illus.), 
introduced from Europe into New England in pio- 
neer days, is a sterile allotriploid hybrid having 36 
somatic chromosomes, an example of a botanical 
mule. Native cultivar is usually called Mentha spicata 
(2n = 48) but is actually a hybrid between 
this species and M. longifolia (2n = 24). Native 
spearmint comprises about one-third of the pro- 
duction in the United States. Scotch spearmint culti- 
var (illus. b) with flowers in the axils of the leaves, 
M. cardiaca (2n = 72, some strains have 60 chro- 
mosomes), was introduced in 1910, apparently from 
Scotland. Mentha cardiaca has hybrid vigor and usu- 
ally yields 50-65 lb (23-30 kg) of oil per acre in the 
Midwest as opposed to 30-40 lb (14-18 kg) per acre 
for Native, with yields up to 80-125 lb (6-57 kg) in 
the Far West. Both varieties have similar but not iden- 
tical oil composition, with 60-69% carvone, 7-20% 
limonene, and 1-2% cineole as main compounds. 
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Spearmint. (a) Native spearmint (Mentha spicata). (b) Scotch spearmint (M. cardiaca). 


(Courtesy of A. M. Todd Co.) 


Research has shown that Scotch is a very specific, 
very select, sterile F,; hybrid between M. arvensis 
(2n = 96 or 72) and M. spicata. Both cultivars are 
subject to Puccinia rust and powdery mildew, and 
Scotch is also susceptible to Verticillium wilt. Muta- 
tion breeding is in progress to obtain resistant strains 
for these major diseases. 

Uses. Principal uses of the oil are in flavoring gum, 
toothpaste, and candy. Chopped fresh leaves of M. 
spicata preserved in vinegar are used as a condiment 
served with lamb, especially in England, and dried 
or freeze-dried leaves of several strains are used in 





flavoring soups, stews, tea, or sauces. Sprigs of the 
decorative curly mint M. crispa (or M. spicata var. 
crispata) are often used in mixed drinks such as mint 
juleps. 

Other mint species, as M. niliaca, M. rotundifolia, 
and M. cordifolia, may have carvone and a spearmint 
odor, but the presence of too much dihydrocarvone 
or of the alcohols and esters of carvone and dihydro- 
carvone (as carveol and carvyl acetate) or other hy- 
drocarbons such as myrcene makes the commercial 
utilization of their oils unlikely. Although the cultivar 
species have been introduced throughout the world, 
in the United States they grow best between 40 to 
48° north latitude, as in northern Europe, or farther 
north to 55° in the milder climate of Great Britain. 
See LAMIALES; SPICE AND FLAVORING. 

Merritt J. Murray 


——— ey 
Special functions 


Functions which occur often enough to acquire a 
name. Some of these, such as the exponential, log- 
arithmic, and the various trigonometric functions, 
are extensively taught in school and occur so fre- 
quently that routines for calculating them are built 
into many pocket calculators. See DIFFERENTIATION; 
LOGARITHM; TRIGONOMETRY. 

The more complicated special functions, or higher 
transcendental functions as they are often called, 
have been extensively studied by many mathemati- 
cians because they arose in the problems which 
were being studied. Among the more useful func- 
tions are the following: the gamma function defined 
by Eq. (1), which generalizes the factorial; the related 
beta function defined by Eq. (2), which generalizes 
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the binomial coefficient; and elliptic integrals, which 
arose when mathematicians tried to determine the 
arc length of an ellipse, and their inverses, the ellip- 
tic functions. The hypergeometric function and its 
generalizations include many of the special functions 
which occur in mathematical physics, such as Bessel 
functions, Legendre functions, error functions, and 
the classical orthogonal polynomials of Jacobi, La- 
guerre, and Hermite. The zeta function defined by 
Eq. G) has many applications in number theory, 
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and it also arises in M. Planck’s work on radiation. 
See BESSEL FUNCTIONS; ELLIPTIC FUNCTION AND IN- 
TEGRAL; GAMMA FUNCTION; HEAT RADIATION; HYPER- 
GEOMETRIC FUNCTIONS; LEGENDRE FUNCTIONS; NUM- 
BER THEORY; ORTHOGONAL POLYNOMIALS. 

Bernoulli polynomials and numbers. Bernoulli poly- 
nomials, defined by the generating function in 


Eq. (4), and Bernoulli numbers, B,, = B,(0), arise 
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in many applications. (Different notations are used, 
especially for Bernoulli numbers.) They occur in the 
Euler-Maclaurin summation formula (5), where a, m, 
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and n are arbitrary integers, a < n, m > 0, and nota- 
tion (6) applies. The symbol [x] stands for the largest 
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integer less than or equal to x. This formula is very 
useful in finding numerical values of some slowly 
convergent infinite series or partial sums of diver- 
gent series. An important sum which can be eval- 
uated using Bernoulli numbers is given by Eq. (7). 
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Bernoulli polynomials and numbers also play an 
important role in number theory, combinatorial anal- 
ysis, and the study of spline functions. Splines are 
piecewise polynomials which are spliced together 
in a smooth manner, up to a certain degree of differ- 
entiability. They are very useful in fitting curves to 
numerical data and are the solutions to a number of 
important extremal problems. See COMBINATORIAL 
THEORY. 

More complicated functions. All of the special func- 
tions mentioned above can be given by explicit rep- 
resentations, either as a series or an integral. There 
are other functions, among them Lamé functions, 
spheroidal and ellipsoidal wave functions, and Math- 
ieu functions, which arise as solutions to important 
differential equations, and for which explicit formu- 
las are lacking. At least the formulas are not nearly as 
explicit as for the above functions. The ordinary dif- 
ferential equations satisfied by these functions arise 
from Laplace’s equation or the wave equation when 
it is solved by separation of variables in certain sys- 
tems of curvilinear coordinates. Instead of having ex- 
plicit integral representations, these functions satisfy 
special integral equations. See DIFFERENTIAL EQUA- 
TION; INTEGRAL EQUATION; LAPLACE’S DIFFERENTIAL 
EQUATION; WAVE EQUATION. 

For example, the prolate spheroidal wave func- 
tions of order zero, W,@), n = 0, 1, ..., are 
the bounded continuous solutions of differential 
equation (8) and integral equation Y), where x , and 
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Àn are eigenvalues. Differential equation (8) is 
the usual starting place for the study of these 
functions. It arises when solving the wave equation 
in prolate spheroidal coordinates, and the integral 
equation has classically been derived as a property 
of these functions. Integral equation (9) is of interest 
because of its applications to stochastic processes, 
the study of lasers, the uncertainty principle, 
antenna theory, and the statistical theory of energy 
levels of complex systems. See STOCHASTIC PROCESS; 
UNCERTAINTY PRINCIPLE. 

Historical development. Typically, special functions 
are discovered in the course of working on one prob- 
lem. Many properties are then discovered, not only 
to aid in the solution of the problem which gave rise 
to them, but as a mathematical exercise, and then 
some of the new properties are used to solve com- 
pletely different problems. 

Each generation of mathematicians has a way of 
looking at mathematics which is different from pre- 
vious generations. In the nineteenth century, com- 
plex analysis was being developed, and many impor- 
tant properties of special functions were discovered 
in the course of this development. In the second 
half of the twentieth century, Lie group and Lie al- 
gebra methods have been applied to obtain other 
important properties of special functions. See COM- 
PLEX NUMBERS AND COMPLEX VARIABLES; LIE GROUP. 

Richard Askey 
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Speciation 

The process by which new species of organisms 
evolve from preexisting species. It is part of the 
whole process of organic evolution. The modern pe- 
riod ofits study began with the publication of Charles 
Darwin’s and Alfred Russell Wallace’s theory of evo- 
lution by natural selection in 1858, and Darwin’s On 
the Origin of Species in 1859. 

There was no problem of speciation during the pe- 
riod when it was believed that species were divinely 
created and immutable. Belief in the fixity of species 
was almost universal before the middle of the nine- 
teenth century. Then it was gradually realized that 
all species continuously change, or evolve; however, 
the causative mechanism remained to be discovered. 

Darwin proposed a mechanism. He argued that (1) 
within any species population there is always some 
heritable variation; the individuals differ among 
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themselves in structure, physiology, and behavior; 
and (2) natural selection acts upon this variation by 
eliminating the less fit. Thus if two members of an an- 
imal population differ from each other in their ability 
to find a mate, obtain food, escape from predators, re- 
sist the ravages of parasites and pathogens, or survive 
the rigors of the climate, the more successful will be 
more likely than the less successful to leave descen- 
dants. The more successful is said to have greater 
fitness, to be better adapted, or to be selectively fa- 
vored. Likewise among plants: one plant individual 
is fitter than another if its heritable characteristics 
make it more successful than the other in obtaining 
light, water, and nutrients, in protecting itself from 
herbivores and disease organisms, or in surviving ad- 
verse climatic conditions. Over the course of time, 
as the fitter members of a population leave more de- 
scendants than the less fit, their characteristics be- 
come more common. 

This is the process of natural selection, which 
tends to preserve the well adapted at the expense 
of the ill adapted in a variable population. The ge- 
netic variability that must exist if natural selection is 
to act is generated by genetic mutations in the broad 
sense, including chromosomal rearrangements to- 
gether with point mutations. See GENETICS; MUTA- 
TION. 

If two separate populations of a species live in 
separate regions, exposed to different environments, 
natural selection will cause each population to ac- 
cumulate characters adapting it to its own environ- 
ment. The two populations will thus diverge from 
each other and, given time, will become so different 
that they are no longer interfertile. At this point, spe- 
ciation has occurred: two species have come into 
existence in the place of one. This mode of speci- 
ation, speciation by splitting, is probably the most 
common mode. Two other modes are hybrid specia- 
tion and phyletic speciation; many biologists do not 
regard the latter as true speciation. 

Splitting. There are four ways in which a species 
population can become separated into two (or more) 
parts that can undergo genetic divergence and evolve 
into separate species (Fig. 1). G. L. Bush has named 
them types Ia, Ib, II, and III. 

In type Ia an extensive population becomes split 
into comparatively large parts (Fig. 1a). In terrestrial 
species this may happen because movement of the 
Earth’s crustal plates rafts the parts away from each 
other; or else a barrier such as a mountain range or a 
tract of desert may form and split a once-continuous 
population. A marine population is split if a land 
bridge forms which cuts through it; for example, 
the fall in sea level that accompanied the Pleistocene 
glaciations created a land bridge between Alaska and 
Siberia. 

A type Ib split occurs if a small, outlying segment 
of a population becomes detached from the rest 
(Fig. 1b). Such a peripheral isolate is usually a colony 
founded by a few emigrants from the main body of 
the species population. The colony can evolve into 
a new species even if it is not totally isolated from 
the parent population, provided its necessarily small 


gene pool chances to be well adapted to the habitat 
in which it finds itself. The “preadapted” possessors 
of these genes will be selectively favored within their 
small local habitat, and will quickly proliferate. 

Population splits of both types Ia and Ib cause 
the diverging populations to be geographically sepa- 
rated, or allopatric. The resultant genetic separation 
is known as allopatric speciation. 

In type II a population becomes split into geo- 
graphically contiguous parts between which there 
is a limited amount of gene exchange. The environ- 
ments of the daughter populations are assumed to 
differ sufficiently for natural selection to override 
the effects of the restricted gene flow, and specia- 
tion can therefore occur. This is parapatric speciation 
(Fig. 1c). 

A type NI split occurs if, within a single geographic 
region, parts of a population in some way become 
so isolated from each other that gene exchange be- 
tween them falls below the level needed to maintain 
the integrity of the original population. The result is 
known as sympatric speciation (Fig. 1d). It is proba- 
bly common among parasites and parasitoids, but is 
thought to be rare or nonexistent in free-living organ- 
isms. Even though closely related species adapted to 
different habitats, or with different modes of life, or 
with different resource requirements, often co-occur 
in the same geographic area, their co-occurrence 
should not be ascribed to sympatric speciation un- 
less a mechanism can be found that could have ini- 
tiated their divergence in the first place. A more 
likely explanation for such co-occurrences is that the 
species evolved allopatrically or parapatrically, that 
ecological divergence accompanied speciation, and 
that the new species then became secondarily sym- 
patric by migrating into each other’s areas after the 
geographic barrier that formerly separated them dis- 
appeared. See ISLAND BIOGEOGRAPHY. 

Descriptions of allopatric, parapatric, and sym- 
patric speciation often tacitly assume that in the 
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Fig. 1. Different modes of speciation; a-e are described in 
the text. The two tints represent the geographic ranges of 
diverging and speciation species populations. 


absence of clear, unmistakable barriers, gene ex- 
change occurs freely throughout very extensive pop- 
ulations; often an entire species is treated as a sin- 
gle breeding population. However, P. R. Ehrlich and 
P. H. Raven have argued that usually the total pop- 
ulation of a whole species is broken into a large 
number of small breeding populations (demes) that 
are fairly isolated from each other and among which 
gene exchange is comparatively rare. Thus a deme 
rather than a species population is the unit of evolu- 
tion. As a result, sympatric speciation will sometimes 
seem to have occurred if a deme near the center of a 
species’ range chances to diverge from the surround- 
ing demes; the process may be called quasisympatric 
speciation (Fig. le). The fact that all demes do not 
diverge from one another, giving as many species 
as there are demes, presumably results from stabiliz- 
ing selection. Stabilizing selection causes groups of 
demes to persist in their resemblance to each other, 
and such groups are the “populations” constituting 
the visible units of evolution. See POPULATION GE- 
NETICS. 

Isolating mechanisms. Two genetically diverging 
populations are not separate species so long as it re- 
mains possible for them to lose their distinctness, by 
interbreeding, if they chance to become sympatric. 
For speciation to be complete, the populations must 
evolve isolating mechanisms that prevent gene ex- 
change between them; this allows them to maintain 
their genetic integrity. 

There are many kinds of isolating mechanisms. 
The diverging populations may become genetically 
incompatible, so that hybrids either are never pro- 
duced or are sterile. Even when the diverging popula- 
tions can be shown to be infertile, they may be effec- 
tively isolated by a variety of mechanisms; they may 
breed at different times or in different habitats; they 
may have evolved different patterns of mating behav- 
ior, different mating calls, or different sex attractants 
(pheromones), so that individuals of the two pop- 
ulations never attempt to copulate; copulation may 
be mechanically impossible because of the structure 
of the genitalia (this occurs in some insects); and in 
insect-pollinated plants, diverging populations some- 
times become adapted to different species of insect 
pollinators. 

Hybrid speciation. In plants, new species are some- 
times formed abruptly by the following mechanism. 
The accidental crossing of two diploid plants of dif- 
ferent, though related, species yields a healthy but 
sterile diploid hybrid. The hybrid is sterile because 
the chromosomes it has received from its parents 
cannot form normal bivalent pairs; meiosis is up- 
set, and functional gametes cannot be produced. But 
now chromosome doubling occurs in the sterile hy- 
brid, making it tetraploid; normal meiosis becomes 
possible; fertile gametes are produced, and a new 
species has appeared. 

In symbols, let the genomic constitution of the 
parents be AA and BB respectively. That of their ster- 
ile hybrid is AB, in which none of the chromosomes 
has a homologous partner to pair with at meiosis. 
After chromosome doubling, however, the hybrid, 


now tetraploid, has the constitution AABB; meiosis 
can proceed normally, and the gametes are fertile. 
The new species is called an amphidiploid (or allo- 
tetraploid). 

A whole range of similar, but more elaborate, vari- 
ants of this process occur as well, yielding hybrids 
with different numbers of chromosome sets. The 
general name for the process, with numbers unspeci- 
fied, is amphiploidy (or allopolyploidy or alloploidy). 

The process happens because chromosome dou- 
bling is a common occurrence in plants; doubling 
results either from somatic doubling at mitosis (the 
paired chromosomes fail to separate) or from nonre- 
duction at meiosis. Amphiploids may be produced 
repeatedly, with new lineages being added in a suc- 
cession of outcrossings, backcrossings, and chromo- 
some doublings. The result is a polyploid complex, 
an array of interrelated diploid and polyploid species. 
See MEIOSIS. 

Speciation in uniparental organisms. Many plants re- 
produce uniparentally for long periods of time. Uni- 
parental reproduction occurs in two ways: by au- 
togamy (self-fertilization) and by apomixis (asexual 
reproduction); the latter term includes both vege- 
tative multiplication and agamospermy (the asexual 
formation of viable seeds). A uniparental plant is not 
a member of a “biological species” as the term is usu- 
ally defined, namely, a reproductively isolated collec- 
tion of interbreeding individuals. This has led to the 
definition, by G. G. Simpson, of the “evolutionary 
species,” a more inclusive category, defined so as to 
apply to both uniparental and biparental organisms. 
An evolutionary species is a collection of genotypi- 
cally similar individuals of common ancestry whose 
similarity is maintained by stabilizing selection in a 
given environment. 

In describing speciation in uniparental organ- 
isms, one must consider how a new uniparental 
lineage arises in the first place, and how its mem- 
bers can subsequently speciate. A uniparental lin- 
eage arises when two sexual species hybridize and 
the hybrids, though sexually sterile, reproduce uni- 
parentally. If many crossings occur between the two 
sexual species, the result is a collection of hybrid indi- 
viduals that are similar, but not identical, genetically. 
The collection constitutes an evolutionary species. 
Suppose now that each individual becomes the an- 
cestor of a succession of uniparentally produced de- 
scendants. If the progenitors of these lineages begin 
growth in a number of slightly different habitats, 
natural selection will lead to differentiation among 
them: the survivors in each habitat will be adapted 
to that habitat. Thus the plants in any one habi- 
tat are likely to differ slightly from those in other 
habitats. The differences are genetic, and they per- 
sist. Each distinguishable group of plants is called a 
microspecies. An array of microspecies, descended 
from the same ancestral pair of sexual species, is an 
agamic complex. 

Microspecies that reproduce asexually (apomic- 
tically) are usually highly heterozygous; but auto- 
gamous microspecies lose the heterozygosity of 
their hybrid origin after several generations of 
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Fig. 2. Evolution of two lineages. Lineage 1 (species A) persists unchanged. Lineage 2 
changes with time and has been divided, at an arbitrary boundary, into two 
chronospecies, B and C. (After P. D. Gingerich, Univ. Mich. Pap. Paleontol., 15:1-140, 
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self-fertilization and become complete, or nearly 
complete, homozygotes. 

Phyletic evolution and chronospecies. Consider a se- 
ries of fossils spanning a long interval of time dong in 
the geological sense) and believed to represent the 
successive forms of a single lineage. If there has been 
appreciable evolution within the lineage, the change 
is described as phyletic evolution or anagenesis. 

Fossils from a single lineage but of different ages 
may differ from each other so conspicuously that 
even though the younger is directly descended 
from the older, paleontologists find it convenient to 
treat them as taxonomically different species. Such 
species are called chronospecies (or successional 
species, paleospecies, or evolutionary species). They 
are not species in the usual sense; questions as to 


Fig. 3. Two contrasted modes of species evolution: (a) punctuated equilibrium model and 
(b) phyletic gradualism model. (After S. M. Stanley, Macroevolution, Pattern and Process, 
W. H. Freeman, 1979) 


whether or not they can interbreed do not arise, 
since they are never alive at the same time. In a lin- 
eage evolving gradually, decisions on where to put 
the dividing line between each chronospecies and its 
successor are arbitrary. A “vanished” chronospecies, 
one that has disappeared and been replaced by a suc- 
cessor chronospecies of the same lineage, is said to 
have undergone pseudoextinction. 

Figure 2 contrasts the behavior of a lineage evolv- 
ing phyletically dineage 2) with one persisting un- 
changed (lineage 1). The “species” labeled B and C 
in lineage 2 are chronospecies. At an arbitrary point 
on the time scale, B becomes pseudoextinct and is 
succeeded by C. 

The transition from a chronospecies to its succes- 
sor is not usually regarded as speciation in the strict 
sense. Speciation as usually defined entails the split- 
ting of lineages. 

Quantum speciation versus phyletic gradualism. 
When a lineage splits, the two descendant lineages 
may diverge slowly or rapidly. There has been much 
debate on whether one or other of these modes has 
dominated the course of evolution and, if so, which. 
The two modes to be contrasted are shown diagram- 
matically in Fig. 3. 

The tree on the right in the figure represents evo- 
lution and speciation according to the model known 
as phyletic gradualism. It supposes that all lineages 
undergo continuous gradual change (that is, phyletic 
evolution as described above) at a fairly constant rate. 
When splits occur, the descendant lineages diverge 
gradually from each other. There is no appreciable 
change in evolutionary rate at the time a split takes 
place. While the separated lineages are still capable 
of interbreeding to a limited extent, though with 
reduced fertility, they are known as semispecies. 
Their morphological divergence may sometimes be 
so slow that they remain practically indistinguishable 
even after they have lost the capacity to interbreed 
and have therefore reached specific rank; if so, they 
are known as cryptic species. 

In Fig. 3a the tree represents the punctuated equi- 
librium model of evolution and speciation, so called 
by N. Eldredge and S. J. Gould. It supposes that the 
splitting of a parent species population is always (or 
nearly always) accompanied by quantum speciation, 
the extremely rapid (in the geological sense) and 
pronounced divergence of one of the descendant 
lineages from the ancestral form. It is assumed that 
the divergent species originates most often by the 
chance splitting off of a small peripheral isolate from 
an extensive parental population; this is allopatric 
speciation of type Ib (Fig. 1). If the isolate is small, 
its small gene pool may not be at all representative 
of the parent gene pool from which it is drawn. It 
may, simply by chance, deviate markedly. Further, 
because of its peripheral location, the environment 
of the isolate is likely to differ from that of the main 
body of the species population, and natural selection 
will then tend to magnify the deviation. 

Evolutionary changes associated with the appear- 
ance of new species are thus, according to this 
model, concentrated in speciation events of very 


short duration; they are represented by the hori- 
zontal segments of the tree’s branches (Fig. 3a). 
During the long time intervals between succes- 
sive speciation events, a species remains almost un- 
changed, as shown by the vertical segments of the 
branches. While a species is in this static condition, 
phyletic evolution may bring about minor adjust- 
ments; this within-species evolution is known as mi- 
croevolution. But the magnitude of these phyletic 
changes is negligible compared with that accompa- 
nying macroevolution, the formation of new species. 
Ernst Mayr introduced the terms maintenance evo- 
lution and switch evolution to describe these two 
evolutionary modes. 

Many students of evolution are of the opinion 
that most groups of organisms evolve in accordance 
with the punctuated equilibrium model rather than 
by phyletic gradualism. There are two chief argu- 
ments for this view. First, it is clear from the fossil 
record that many species persist without perceptible 
change over long stretches of time and then suddenly 
make large quantum jumps to radically new forms. 
Second, phyletic gradualism seems to be too slow a 
process to account for the tremendous proliferation 
of species needed to supply the vast array of living 
forms that have come into existence since life first 
appeared on Earth. See ANIMAL EVOLUTION; SPECIES 
CONCEPT. E. C. Pielou 
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Species concept 


The idea that the diversity of nature is divisible 
into a finite number of definable species. In general, 
species concepts grow out of attempts to understand 
the very nature of biological organization above the 
level of the individual organism. There are two basic 
questions: (1) What does it mean to be a species 
in general? Do all species have certain characteris- 
tics, such as forming genealogical lineages, just as all 
atoms have certain characteristics, such as the abil- 
ity to undergo chemical reactions? (2) What does 
it mean to be a particular species? The first ques- 
tion addresses species concepts. The second ques- 
tion addresses how to apply a species concept to 
living organisms of the world. Does the name Homo 
sapiens apply to a group of organisms existing in 
nature? If so, does it belong, as a member, to a nat- 
ural kind that can be characterized by some set of 
properties? The difference between the questions 
of what it is to be a species in general versus what 
does it mean to be a particular species represents 
the dividing line between what it is to be a natural 
kind and what it is to be a natural individual. To un- 
derstand this distinction, we must first take up the 
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more general question of the nature of kinds. We 
can then return to the question of species concepts. 
See SPECIATION. 

Kinds and particulars. Kinds are attempts to divide 
the world of particulars according to a definition that 
specifies properties that separate individuals into 
two classes: those particulars that have the proper- 
ties are members of the kind, and those particulars 
that do not have the properties belong to another 
kind. What do we mean when we say “particulars”? 
Particulars are individual things in nature. We can 
usually point to particulars, such as pointing to a cat 
or pointing to a truck. A particular belongs to a kind 
if the particular has the properties that define the 
kind. For example, we might define the kind “black 
cat” as those cats that have the property of being 
covered with black fur. My black cat belongs to this 
kind, but my gray cat does not since it lacks the prop- 
erty of the kind. In fact, black cats in general (every 
black cat that has ever lived or will live in the future) 
belong to the kind “black cat.” 

As it turns out, there are at least two types of kinds: 
nominal kinds and natural kinds. The difference be- 
tween nominal and natural kinds lies in how the 
kind functions. Natural kinds function in scientific 
theories of how nature operates, because the theory 
asserts that certain particulars (individuals) have cer- 
tain properties. For example, the kind “atom” func- 
tions in theories of physics and chemistry as the 
smallest kind of matter whose members (particular 
individual atoms) can take part in chemical reactions 
(the property). In turn, hydrogen is a kind of atom 
whose members have one proton (property), while 
helium is a kind of atom whose members have two 
protons (another property). Kinds such as hydrogen 
and helium are natural kinds because members of 
each kind (individual atoms of hydrogen and helium) 
are predicted to exist under the right circumstances 
and have their properties as an assertion of our the- 
ories of chemistry and physics. In contrast to natu- 
ral kinds, most of our language is filled with nom- 
inal kinds. Nominal kinds are not predicted by sci- 
entific theories, but function to allow us to commu- 
nicate ideas. For example, no scientific theory pre- 
dicts the kinds “truck” and “bicycle.” Yet, nominal 
kinds have properties (trucks have four wheels, bi- 
cycles have two) that allow us to distinguish trucks 
from bicycles. Thus, nominal kinds may have use- 
ful functions in communication, but they are not di- 
rectly derived from scientific theories. See SCIENTIFIC 
METHODS. 

Concepts and theories. Since Aristotle, biologists 
have attempted to develop a concept of “species” 
that will function in biology as a natural kind in a 
manner similar to natural kinds in other disciplines 
such as “quark” in physics and “atom” in chem- 
istry. This search was inhibited because philoso- 
phers have traditionally thought of particular species 
(Homo sapiens, Pinus ponderosa, etc.) as natu- 
ral kinds, or sets; or they thought that particular 
species were nominal—arbitrary but useful inven- 
tions of taxonomists to organize the real entities, that 
is, individual organisms. However, work by Michael 
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Ghiselin, David Hull, and others asserted that partic- 
ular species were, in fact, particulars that function 
in evolutionary theory in the same manner as indi- 
vidual atoms function in chemical theory—as inter- 
actors or participants in process. If this is so, then 
each of these individual species would be members 
of a natural kind, “species,” that had properties pre- 
dicted from evolutionary theory. Attempts to capture 
the properties of the kind “species” result in a species 
concept. 

The ideas that organisms could be grouped into 
more or less discrete units, and the idea that some 
of these units were more similar to each other than 
to other units, long predated ideas about evolution. 
That there was a pattern to life’s diversity gave rise 
to the first species concept: members of one species 
are different in their physical characteristics from 
members of other species, and their offspring look 
like them (like begets like). This concept was the 
basis for both folk taxonomies (names of different 
groups of individual organisms in common language) 
and scientific taxonomies (formal names meant to 
have scientific import). As the practice of scientific 
taxonomy developed, species were gathered into 
larger groups based on similarities and differences. 
These groupings were hierarchical and seemed to 
reflect patterns of similarities and differences seen 
in nature. Current evolutionary theories seek to ex- 
plain why life’s diversity seems to be arranged in 
the hierarchical manner that we see in nature. See 
TAXONOMIC CATEGORIES; TAXONOMY; ZOOLOGICAL 
NOMENCLATURE. 

Like all scientific theories, theories of evolution 
predict natural kinds with attendant properties that 
are found in individuals. Theories of evolution can 
be generally divided into two basic themes. Mi- 
croevolutionary theory is concerned with evolu- 
tionary change within and between populations of 
individual organisms. Processes include natural se- 
lection, gene flow, and stochastic processes such as 
genetic drift. These theories do not address the hier- 
archical patterns that predate Darwin and have fasci- 
nated biologists since Linnaeus. Rather, they predict 
that populations of individual organisms will change 
through mutation, selection, and drift in predictable 
ways. Macroevolutionary theory is concerned with 
the origin and fate of species. It directly addresses 
the hierarchical nature of diversity. Process theories 
that purport to explain the hierarchy include spe- 
ciation and extinction. These two themes, micro- 
and macroevolution, intersect at the level of species. 
From the macroevolutionary perspective, species 
are those particulars that originate from various pro- 
cesses of speciation and disappear through extinc- 
tion. Macroevolutionary processes follow the “rules” 
of microevolutionary processes, but add an extra 
level of processes such as response to a physical bar- 
rier that divides a once continuous population into 
two populations. Thus, the natural kind “species” 
has members (particular species) that undergo such 
processes. See MACROEVOLUTION; ORGANIC EVOLU- 
TION. 

The problem with species concepts derives from 


the problem with changing ideas of the world in 
general. Our concepts of natural kinds, and indeed 
whether we think that a certain kind is real (has mem- 
bers in the world) or unreal (properties can be listed, 
but nothing exists that has the properties, like uni- 
corns or fairies), depend on our current theories. The 
recent controversy concerning the definition of the 
kind “planet” is an example. Pluto has been rejected 
(at least for the moment) as being a planet based 
on current theory, and a new kind, dwarf planet, has 
been proposed that includes not only Pluto but other 
smaller bodies in the solar system, such as the aster- 
oid Ceres. The hope is that the closer the theory is 
to nature, the more predictive will be the kinds and 
their properties. So it is with species concepts. 
Types of species concepts. Species concepts, in 
one form or another, have been around for several 
hundred years. Each concept (and there are over 
20 current ones) is an attempt to capture the prop- 
erties of the largest (or smallest) kind and thus per- 
mit a search for particulars (Homo sapiens, etc.) 
that are members of the kind. Of the plethora of 
species concepts, evolutionary biologists (including 
systematists and taxonomists) seem to be converging 
on the concept of species-as-lineages, the Evolution- 
ary Species Concept—species are those things that 
exist between speciation events and that originate 
through speciation events (Henning, 1996; Wiley 
and Mayden, 2000). Other concepts capture part of 
this more general concept. For example, the ven- 
erable Morphological Species Concept claims that 
different species have different or slightly different 
physical characteristics. This is true for many species 
(humans versus chimpanzees), but not true for oth- 
ers where speciation has resulted in behavioral or 
genetic changes with little or no detectable change 
in physical characteristics (as in some closely related 
fruit flies). The popular Biological Species Concept 
asserts that species have the property of reproduc- 
tive isolation. This seems to be true for a great num- 
ber of species, but not for all species and especially 
not for many species of recent origin. This has led 
workers such as Richard Mayden to assert that all 
currently valid species concepts (kinds for which 
examples can be found in nature) are different re- 
flections of the more general natural kind, that is, 
the Evolutionary Species Concept. Of course, even 
ifall evolutionary biologists agreed with Mayden, this 
does not end controversies as to whether a particu- 
lar species is real. For example, is Pinus ponderosa 
(a pine tree) real? We might vigorously disagree as 
to whether or not a speciation event has occurred, 
based on the evidence presented. We might argue 
that Pinus ponderosa is not a species but is actually 
two or more species, or simply a population of an- 
other species. However, if we can agree, our agree- 
ment will be founded both on the empirical data 
and on a species concept that applies in an unam- 
biguous manner, leading to a consilience (a synthesis 
of knowledge) between practice and theory. 
E. 0. Wiley 
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Specific charge 


The ratio of charge to mass, expressed as e/m, of a 
particle. The acceleration of a particle in electromag- 
netic fields is proportional to its specific charge. Spe- 
cific charge can be determined by measuring the ve- 
locity v which the particle acquires in falling through 
an electric potential V(v =./2eV/m); by measuring 
the frequency of revolution in a magnetic field H (the 
so-called cyclotron frequency w = eH/mc, where c is 
the velocity of light); or by observing the orbit of the 
particles in combined electric and magnetic fields. 
In the mass spectrograph, the fields are arranged so 
that particles of differing velocities but of the same 
e/m are focused at a point. See ELECTRON MOTION 
IN VACUUM; ELEMENTARY PARTICLE; MASS SPECTRO- 
SCOPE. Charles J. Goebel 


a 
Specific fuel consumption 


The ratio of the fuel mass flow of an aircraft engine 
to its output power, in specified units. Specific fuel 
consumption (abbreviated sfc or SFC) is a widely 
used measure of atmospheric engine performance. 
For reciprocating engines it is usually given in U.S. 
Customary units of pound-mass per hour per horse- 
power [dbm/h)/hp or lbm/chp-h)], and International 
System (SD units of kilograms per hour per kilowatt 
[(kg/h)/kW]. See RECIPROCATING AIRCRAFT ENGINE. 

For the gas turbine family of atmospheric air- 
craft engines, and for ramjets, performance is usually 
given in terms of thrust specific fuel consumption 
(abbreviated tsfc or TSFC) expressed as fuel mass 
flow per unit thrust output with Customary units of 
pound-mass per hour per pound-force [dbm/b)/1bf] 
or SI units of kilograms per hour per newton 
[(kg/b)/N; 1 N equals approximately 0.225 lbf]. 
For high-supersonic and hypersonic ramjets, spe- 
cific fuel consumption is sometimes given in pound- 
mass per second per pound-force [dbm/s)/Ibf] or 
kilograms per second per newton [(kg/s)/N]. Spe- 
cific thrust (or specific impulse), which is the 
inverse of specific fuel consumption, is often used 
especially for supersonic combustion ramjet (scram- 
jet) performance at Mach numbers above 6 (the 


Mach number Ny, equals 1.0 at the local speed of 
sound). See AIRCRAFT PROPULSION; TURBINE ENGINE 
SUBSYSTEMS; MACH NUMBER; PROPULSION; SPECIFIC 
IMPULSE; TURBINE PROPULSION; TURBOFAN; TURBO- 
JET. 

Since the combustion process in atmospheric air- 
craft engines is supported by oxygen in the ambient 
air, only the hydrocarbon fuel (gasoline for recipro- 
cating engines and kerosine for turbine engines and 
ramjets), which must be carried on board the air- 
craft, needs to be accounted for in determining en- 
gine performance. Methane and hydrogen may be 
used in the future as fuels for atmospheric engines. 
To obtain a low specific fuel consumption, the fuel 
should have both a high heat of combustion (that is, 
energy content per unit mass of fuel) and high cycle 
efficiency. The cycle efficiency is related to the oper- 
ating compression ratio of the engine. See AIRCRAFT 
FUEL; GASOLINE; KEROSINE. 

A typical specific fuel consumption value for 
reciprocating engines and turboprops (in terms 
of equivalent shaft horsepower) is approximately 
0.5 dbm/h)/hp [0.3 (kg/h)/kW]. Gas turbines range 
between 0.6 dbm/h)/Ibf [0.06 (kg/h)/N] for turbo- 
fans and 0.8 (bm/h)/Ibf [0.08 (kg/h)/N] for turbo- 
jets. Ramjet engines with hydrocarbon fuel range 
upward from a minimum of about 2.0 dbm/h)/Ibf 
[0.20 (kg/h)/N] at flight speeds between Mach num- 
bers 2 and 4. Values of specific fuel consumption 
for ramjets and scramjets with hydrogen fuel be- 
ginning at Mach 4 trend from approximately 0.9 to 
2.4 (bm/h)/Ibf [0.09 to 0.24 (kg/h)/N] at Mach 16 
and beyond. J. Preston Layton 

Bibliography. D. G. Shepherd, Aerospace Propul- 
sion, 1972. 
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Specific heat 


A measure of the heat required to raise the tempera- 
ture of a substance. When the heat AQ is added to a 
body of mass m, raising its temperature by AT, the 
ratio C given in Eq. (1) is defined as the heat capacity 


AQ 


c= 
AT 


a) 


of the body. The quantity c defined in Eq. (2) is called 
C 1 AQ 


m mAT 


2 


the specific heat capacity, following the recommen- 
dation by the International Union of Pure and Ap- 
plied Physics (UPAP). However, this quantity is also 
often referred to with the shorter name specific heat. 
A commonly used unit for heat capacity is joule - 
kelvin™! (J - K~}; for specific heat capacity, the unit 
joule - gram~! - K~' (J. g-! - K7) is often used. Joule 
should be preferred over the unit calorie = 4.18 J. As 
a unit of specific heat capacity, Btu - Ib~! - °F! = 
4.21 J - g 1 - K”! is also still in use in English- 
language engineering literature. If the heat capac- 
ity is referred to the amount of substance in the 
body, the molar heat capacity c,, results, with the unit 


Specific heat 
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Specific heat of solids 


J- mol"! - K~!. The IUPAP recommends distinguish- 
ing this quantity from the specific heat capacity c 
through the subscript m, although this recommenda- 
tion is usually disregarded in an attempt to conserve 
subscripts. 

If the volume of the body is kept constant as the 
energy AQ is added, the entire energy will go into 
raising its temperature. If, however, the body is kept 
at a constant pressure, it will change its volume, 
usually expanding as it is heated, thus converting 
some of the heat AQ into mechanical energy. Con- 
sequently, its temperature increase will be less than 
if the volume is kept constant. It is therefore nec- 
essary to distinguish between these two processes, 
which are identified with the subscripts V (con- 
stant volume) and p (constant pressure): Cy, cy, and 
Cp, Cp. For gases at low pressures, which obey the 
ideal gas law, the molar heat capacities differ by R, 
the molar gas constant, as given in Eq. (3) where 


Cp — cy = R 6) 


R= 8.31]J- mol"! - K~}; that is, the expanding gas 
heats up less. For monatomic gases, for example, 
Cp = */2R, considerably smaller than cy = */2R. 

For solids, the difference between c, and cy is 
of the order of 1% of the specific heat capaci- 
ties at room temperature, and varies roughly lin- 
early with temperature. It results from the ther- 
mal expansion. This small difference can often be 
ignored. See CALORIMETRY; CHEMICAL THERMODY- 
NAMICS; HEAT CAPACITY; SPECIFIC HEAT OF SOLIDS; 
THERMODYNAMIC PROCESSES. Robert O. Pohl 

Bibliography. International Union of Pure and Ap- 
plied Physics, Document U.I.P. 20, 1978; G. N. Lewis 
and M. Randall, Thermodynamics, 2d ed., 1961; 
E W. Sears and G. L. Salinger, Thermodynamics, 
Kinetic Theory, and Statistical Mechanics, 3d ed., 
1975; K. Wark and D. E. Richards, Thermodynamics, 
6th ed., 1999; M. W. Zemansky and R. H. Dittman, 
Heat and Thermodynamics, 7th ed., 1996. 
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Specific heat of solids 


The specific heat (short for specific heat capacity) 
of a solid is the amount of heat required to increase 
the temperature of a unit mass of the solid by a 
unit amount. It is determined by the vibrations of 
its atoms or excitations of its electrons, and also by 
a variety of phase transitions. See SPECIFIC HEAT. 

Contribution of atomic vibrations. If a solid object, 
say a piece of rock salt, is said to have a certain tem- 
perature, it means that its constituent atoms per- 
form motions around their equilibrium positions. 
The larger the amplitude of these motions, the larger 
the temperature. 

Classical case. In the classical picture of this thermal 
motion, each atom is considered to be a harmonic, 
three-dimensional oscillator with six degrees of free- 
dom, three of kinetic and three of potential energy. 
Each degree of freedom contains on average the en- 
ergy (1/2)kgT, where T is the absolute temperature 


[measured in kelvins (K)] and kg is Boltzmann’s con- 
stant (kg = 1.38 x 107” J- K~}). Thus, an object con- 
taining 1 mole or N4 = 6.02 x 10” atoms, contains 
the thermal energy Q in the form of atomic vibrations 
given by Eq. (1), and has the molar heat capacity c, 
given by Eq. (2), where R is the molar gas constant, 


O = 6(1/2)RT qd) 
cy =3R Q) 


and R = Nakg = 8.31J- mol”! - K~!. The molar heat 
capacity can be converted to specific heat capacity 
by dividing the molar heat capacity by the mass of 
1 mole of the substance. Equation (2) is called the 
rule of Dulong and Petit, proposed in 1819 on the 
basis of measurements near room temperature. See 
DEGREE OF FREEDOM (MECHANICS); KINETIC THEORY 
OF MATTER; TEMPERATURE. 

Quantized atomic vibrations. When H. E Weber (1875) 
extended the measurements over a wide tempera- 
ture range (220 to 1300 K or —84 to 1880°F), he 
found that Eq. (2) holds only in the limit of high tem- 
peratures. Figure 1 shows measurements on cop- 
per, extended to even lower temperatures: at ~20 K 
(—424°F) the specific heat capacity has dropped to 
~3% of its room-temperature value which agrees 
with the value based on the rule of Dulong-Petit; 
the arrow on the right ordinate of Fig. 1 indicates 
the molar heat capacity of 3R. The explanation for 
the decrease at low temperatures was offered by A. 
Einstein (1907). He suggested that the atomic vibra- 
tions are quantized; the atoms cannot vibrate with 
continuously variable amplitude (or energy). Rather, 
the atoms can only have energies varying in inte- 
ger steps of bv, where hb is Planck’s constant, b = 
6.63 x 10-3J - s, and v is the classical frequency of 
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Fig. 1. Specific heat capacity for copper (data points), and 
predictions according to theories of Einstein and Debye 
(curves), on logarithmic scales. Above ~300 K (80°F), the 
specific heat capacity approaches the value based on the 
rule of Dulong-Petit, which for copper is J-g~'- K~'. The 
molar heat capacity of 3R for copper (molecular weight 
63.546) is indicated. 










1 

107 

107 & glass, @7=500 K 
pe 

10° 

104 


105 crystal, ©p=550 K 


specific heat capacity, J- g-!-K-! 


10-6 


107 


108 = 








| | | 
0.1 1.0 10 100 500 


temperature, K 





Fig. 2. Specific heat capacity of silicon dioxide (SiOz) in its 
crystalline and amorphous (glass) phases. The Debye 
characteristic temperatures Op, determined from elastic 
constants, are so similar that the Debye specific heat 
prediction is shown as a single curve for both solids. 


vibration. As the temperature decreases, an increas- 
ing fraction of the atoms will not be excited at all; 
they will be frozen out. This leads to an exponen- 
tial decrease of the specific heat capacity. Figure 1 
shows Einstein’s fit to the data, which he obtained 
by using the frequency vg = 4.6 x 10! s7! CEin- 
stein frequency for copper) as an adjustable param- 
eter. Through this work, the validity of the quantum 
concept was demonstrated for the motion of mate- 
rial particles. See PLANCK’S CONSTANT; QUANTUM ME- 
CHANICS. 

Quantized elastic waves. In 1911, Einstein realized a 
shortcoming in his theory: not only did it lead 
to a drop-off of the specific heat capacity at low 
temperatures faster than observed experimentally 
(Fig. 1), but it also led to a thermal conductivity 
of the wrong magnitude and temperature depen- 
dence. He also realized the cause for this problem: 
he had assumed that each atom vibrates indepen- 
dently of its neighbors; however, every atom, as it 
vibrates, will push and pull on its neighbors, and this 
will lead to a coupling between neighboring atoms. 


Specific heat of solids 


M. Born and T. von Karmam (1912) showed that 
this coupling will lead to elastic waves, which will 
propagate with the velocity of sound through the 
solid. Using the analogy with electromagnetic waves, 
P Debye (1912) showed that these elastic waves 
lead to a slower temperature dependence of the spe- 
cific heat capacity at low temperatures. According to 
Debye, the molar heat capacity is given at low tem- 
peratures (T < 0.10 p) by Eq. G), where Op is called 


TN? 
c= 24n (2) 6) 





the Debye characteristic temperature and is deter- 
mined by the elastic constants of the solid, which 
are measured. 

Figure 1 shows the specific heat capacity of cop- 
per predicted by Debye’s theory. Another compar- 
ison is shown in Fig. 2, for crystalline quartz. The 
agreement below 10 K (—442°F) for quartz is excel- 
lent and demonstrates the validity of the theoretical 
model. The discrepancy between theory and experi- 
ment above this temperature is also well understood: 
Not all the elastic waves in quartz have the same 
speeds of sound; the high-frequency ones, which 
will be excited predominantly at high temperatures, 
are particularly variable. This variation of the speeds 
of sound leads to variations in the number density 
of the elastic waves that can be thermally excited, 
and hence will lead to variations in the specific heat. 
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Fig. 3. Molar heat capacity at constant pressure (Cp) of 
aluminum. The specific heat capacity is obtained by 
dividing the molar heat capacity by the molecular weight of 
aluminum (26.982). 
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Specific heat of solids 


See LATTICE VIBRATIONS; THERMAL CONDUCTION IN 
SOLIDS. 

Difference of specific heats. While in gases the differ- 
ence between the specific heat capacity at constant 
pressure and that at constant volume is significant, 
in solids the difference is relatively small. Therefore, 
the specific heat is usually measured under constant 
pressure p (usually p = 0, vacuum). In the exam- 
ple shown in Fig. 3, the molar heat capacity under 
constant pressure (cp) of aluminum above room tem- 
perature does not saturate at the value given by the 
Dulong-Petit rule, which is strictly valid only for c,; 
the gradual increase of cp above room temperature 
is the result of the thermal expansion of the solid. 
Above 700 K (800°F), there is also a sudden increase 
of the specific heat as the melting point T,,, (933.2 
K or 1220.1°F) is approached. See THERMAL EXPAN- 
SION. 

Electronic specific heat. A typical metal like cop- 
per or aluminum contains roughly as many free elec- 
trons as atoms. According to classical theory, each 
electron should contain a thermal energy of ('/)kgT 
for each degree of freedom, three in the case of 
translational motion. It is, therefore, remarkable that 
the Debye theory yields such excellent agreement 
with the experiment for copper, a metal (Fig. 1, 
considering that this theory completely ignores an 
electronic contribution. In fact, free electrons are 
noticed only at lower temperatures. In aluminum, 
the lattice specific heat according to Debye’s the- 
ory, shown in Fig. 3, agrees well with the measure- 
ments above 20 K (—424°F). Below 1 K (—458°F), the 
experimental molar heat capacity approaches a lin- 
ear temperature dependence, also shown in Fig. 3, 
that is given by Eq. (4). At 1.163 K, aluminum be- 

Cp = 1.35 x 103 2_r (4) 

mol - K? 

comes superconducting: its electrical resistivity sud- 
denly drops to zero. By applying a magnetic field 
(~100 gauss or 0.01 tesla), this transition can be sup- 
pressed, and the linear temperature dependence of 
the electronic specific heat can be observed to the 
lowest temperatures. The electronic specific heat in 
the normal (that is, nonsuperconducting) state can 
be understood on the basis of the Pauli exclusion 
principle: each quantum state can be occupied by at 
most one electron, and thus only electrons occupy- 
ing the uppermost energy states have unoccupied 
energy states nearby, to which they can move by 
picking up thermal energy. According to a theory 
due to A. Sommerfeld, this leads to a molar specific 
heat given by Eq. (5), where 7; is a characteristic 


m? T 


= = ©) 


Cy = 


temperature called the Fermi temperature, which is 
of the order of 10° K. In deriving this theory, it has 
been assumed that each atom in the metal supplies 
one conduction electron. See EXCLUSION PRINCIPLE; 
FERMI-DIRAC STATISTICS; FREE-ELECTRON THEORY OF 
METALS; SUPERCONDUCTIVITY. 


When the metal sample is cooled in the absence of 
a magnetic field below its superconducting transition 
temperature T, (1.163 K in aluminum), the specific 
heat rises abruptly as the electronic energy states 
are rearranged. Upon further cooling, the specific 
heat decreases exponentially. However, even at 0.2 K 
the specific heat of the electrons exceeds that of 
the atomic vibrations in aluminum (according to the 
Debye theory) by at least tenfold. 

Specific heat of phase transitions. The transition 
from the normal to the superconducting state shown 
in Fig. 3 is an example of a phase transition occurring 
among the metal electrons. Other phase transitions 
in the solid state are, for example, those from mag- 
netic disorder (paramagnetic) to magnetic order 
(ferro-, ferri-, or antiferromagnetic), or from electric 
dipolar disorder (paraelectric) to an ordered phase 
(for example, ferroelectric). The transition from an 
ordered to a disordered phase ordinarily occurs at 
a critical temperature 7, near which fluctuations 
between regions of order and disorder cause an in- 
crease of the specific heat capacity. Thus a graph 
of specific heat capacity versus temperature 7 has 
a sharp peak at T = T,. See CRITICAL PHENOMENA; 
PHASE TRANSITIONS. 

While much progress has been achieved in the un- 
derstanding of the approach to phase transitions in 
the solid state, including the specific heat near the 
critical temperature 7,, the solid-liquid phase transi- 
tion (for example, the rapid rise of the specific heat 
of aluminum near the melting point, shown in Fig. 3) 
is still only poorly understood. 

Amorphous solids. In contrast to crystals, these 
solids lack long-range order. Their structure resem- 
bles that of frozen-in liquids. The specific heat of 
these solids exceeds that of the Debye prediction, 
even if they are electrical insulators, notably at the 
lowest temperatures; below 1 K, it varies approxi- 
mately linearly with the temperature and has almost 
the same magnitude (to within a factor of 10) in all 
amorphous solids. (in amorphous metals, the low- 
temperature specific heat will be increased further 
by the contribution by the electrons.) Figure 2 shows 
a comparison of the specific heat capacity of the 
electric insulator silicon dioxide (SiO2) in the amor- 
phous (silica) and the crystalline (œ-quartz) phases. 
The Debye theory predicts nearly the same heat ca- 
pacity for both phases, which is therefore shown as 
a single curve. Obviously, the Debye model is only 
a poor approximation in the amorphous phase. The 
physical nature of the excitations giving rise to the 
linear specific heat anomaly is not yet understood. A 
particular challenge is the universality of these exci- 
tations. See AMORPHOUS SOLID. Robert O. Pohl 

Bibliography. N. W. Ashcroft and N. D. Mermin, 
Solid State Physics, 1976; A. Cezairliyan (ed.), 
Specific Heat of Solids, 1988; E. S. R. Gopal, Specific 
Heats at Low Temperatures, 1966; C. Kittel, Intro- 
duction to Solid State Physics, 8th ed., 2005; Y. S. 
Touloukian et al. (eds.), Thermophysical Properties 
of Matter, vols. 4-6, 1970; G. K. White and P. J. 
Meeson, Experimental Techniques in Low- 
Temperature Physics, áth ed., 2002. 
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Specific impulse 
The impulse produced by a rocket divided by the 
mass m, of propellant consumed. Specific impulse 
Isp isa widely used measure of performance for chem- 
ical, nuclear, and electric rockets. It is usually given 
in seconds for both U.S. Customary and International 
System (SD units, but this is not strictly correct. 
The impulse produced by a rocket is the thrust 
force F times its duration ¢ in seconds. The spe- 
cific impulse is given by Eq. (1). Its equivalent, 


Ip = — qd) 


specific thrust F,,, that is sometimes used alterna- 
tively, is the rocket thrust divided by the propellant 
mass flow rate F/mp,. See IMPULSE (MECHANICS), 
THRUST. 

The rationale for using seconds as the units of spe- 
cific impulse is the lack of distinction in U.S. Cus- 
tomary usage between pound-force (dbf) and pound- 
mass (bm) that are numerically equivalent, so that 
cancellation leaves units of seconds; however, this 
procedure cannot be justified in SI units, where the 


Specific impulse 


newton is the base unit of force and the kilogram the 
unit of mass, so that the unit of specific impulse is 
N-s/kg. See UNITS OF MEASUREMENT. 

Usage of the quantity effective jet velocity Vje is 
preferred to specific impulse, since it is a more fun- 
damental and better conceived measure of perfor- 
mance. It is related to specific impulse by Eq. (2), 


Vie = cly @) 


where g,, the standard gravitational acceleration, is 
32.18 ft/s? or 9.807 m/s’, to give Vj. in units of ft/s 
and m/s, respectively. 

Effective jet velocity for thermal rockets can be cal- 
culated from Eq. (3), where R is the universal gas con- 
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stant, k is the propellant specific heat ratio, T, is the 
maximum propellant temperature, M is the molec- 
ular mass of the propellants, and p,/p, is the ratio 
of nozzle exit to chamber pressure. The importance 
of high propellant temperature and low molecular 
mass of the propellant is shown by this equation. 
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Nominal rocket performance characteristics of typical space propulsion systems 





Specific impulse Effective jet velocity 








Type Propellants Ibf-s/lbm N-s/kg ft/s m/s Status* Examples 
Thermal rockets 
Cold gas rockets Nitrogen 50 490 1,609 490 O Manned Maneuvering Unit 
(MMU) 
Chemical rockets 
Solid propellant Ammonium 290 2,844 9,332 2,844 O Minuteman 
perchlorate/organic Space shuttle boosters 
polymers and powdered Titan boosters 
aluminum 
Liquid propellant 
Monopropellant Hydrazine 230 2,256 7,401 2,256 O Attitude control of 
communication satellites 
Bipropellant Nitrogen tetroxide/monomethyl 310 3,040 9,976 3,040 O Titan launch vehicle 
hydrazine (hydrazine) (315) (3,089) (10,137) (3,089) 
Oxygen/kerosine (RP-1) 350 3,432 11,260 3,432 O v-2t 
Oxygen/hydrogen 450 4,413 14,480 4,413 O Space shuttle main engine 
Nuclear rockets Hydrogen 900 8,826 28,960 8,826 R&T 
Electric rockets 
Electrothermal rockets 
Resistojet Hydrazine 300 2,942 9,654 2,942 O Station keeping for Satcom 
communications satellite 
Ammonia 500 4,904 16,090 4,904 R&T 
Hydrogen 700 6,865 22,530 6,865 R&T 
Arcjet Hydrazine 500 4,904 16,090 4,904 R&T 
Ammonia 830 8,140 26,710 8,140 R&T 
Hydrogen 1,250 12,260 40,220 12,250 R&T 
Electromagnetic rockets 
Pulsed plasma Poly(tetrafluoroethylene) 1,000 9,807 32,180 9,807 oO Orbit control for Nova 
navigation satellite 
Argon 1,500 14,710 48,270 14,710 R&T 
Magnetoplasmadynamic 
Electrostatic rockets 
lon Mercury, argon, xenon 2,000 to 19,610 to 64,360 to 19,610 to ST&E 
6,000 58,840 193,100 58,840 








t No longer used. 


*O = operational; R&T = research and technology; ST&E = space test and evaluation. 
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Calculation of specific impulse for the various 
forms of electric rockets involves electrothermal, 
resistance or arc heating of the propellant or its 
ionization and acceleration to high jet velocity by 
electrostatic or electromagnetic body forces. Ions 
in the exhaust jets of these devices must be neu- 
tralized so the spacecraft will not suffer from space 
charging or other effects from the plumes of the 
devices’ operation. Nominal values of performance 
of typical space propulsion systems with various 
types of rockets and their propellants are given in 
the table. See ELECTROTHERMAL PROPULSION; ION 
PROPULSION; PLASMA PROPULSION; ROCKET PROPUL- 
SION; SPACECRAFT PROPULSION. J. Preston Layton 

Bibliography. G. P Sutton, Rocket Propulsion Ele- 
ments, 7th ed., Wiley, 2001. 
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Speckle 


The generation of a random intensity distribution, 
called a speckle pattern, when light from a highly 
coherent source, such as a laser, is scattered by a 
rough surface or inhomogeneous medium. Although 
the speckle phenomenon has been known since the 
time of Isaac Newton, the development of the laser 
is responsible for the present-day interest in speckle. 
Speckle has proved to be a universal nuisance as far 
as most laser applications are concerned, and only 
in the mid-1970s did investigators turn from the un- 
wanted aspects of speckle toward the uses of speckle 
patterns, in a wide variety of applications. See LASER. 

Basic phenomenon. Objects viewed in coherent 
light acquire a granular appearance. The detailed irra- 
diance distribution of this granularity appears to have 
no obvious relationship to the microscopic proper- 
ties of the illuminated object, but rather it is an irreg- 
ular pattern that is best described by the methods of 
probability theory and statistics. Although the math- 
ematical description of the observed granularity is 
rather complex, the physical origin of the observed 
speckle pattern is easily described. The surfaces of 
most materials are extremely rough on the scale of 
an optical wavelength (approximately 5 x 1077 m). 
When nearly monochromatic light is reflected from 
such a surface, the optical wave resulting at any 
moderately distant point consists of many coherent 
wavelets, each arising from a different microscopic 
element of the surface. Since the distances traveled 
by these various wavelets may differ by several wave- 
lengths if the surface is truly rough, the interference 
of the wavelets of various phases results in the gran- 
ular pattern of intensity called speckle. If a surface 
is imaged with a perfectly corrected optical system 
(Fig. 1), diffraction causes a spread of the light at an 
image point, so that the intensity at a given image 
point results from the coherent addition of contribu- 
tions from many independent surface areas. As long 
as the diffraction-limited point-spread function of the 
imaging system is broad by comparison with the mi- 
croscopic surface variations, many dephased coher- 
ent contributions add at each image point to give a 
speckle pattern. 


The basic random interference phenomenon un- 
derlying laser speckle exists for sources other than 
lasers. For example, it explains radar “clutter,” re- 
sults for scattering of x-rays by liquids, and electron 
scattering by amorphous carbon films. Speckle the- 
ory also explains why twinkling may be observed for 
stars, but not for planets. See COHERENCE; DIFFRAC- 
TION; INTERFERENCE OF WAVES; TWINKLING STARS. 

Applications. The principal applications for speck- 
le patterns fall into two areas: metrology and astro- 
nomical speckle interferometry. 

Metrology. In the metrology area, the most obvious 
application of speckle is to the measurement of sur- 
face roughness. If a speckle pattern is produced by 
coherent light incident on a rough surface, then 
surely the speckle pattern, or at least the statistics of 
the speckle pattern, must depend upon the detailed 
surface properties. If the surface root-mean-square 
roughness is small compared to one wavelength of 
the fully coherent radiation used to illuminate the 
surface, the roughness can be determined by mea- 
suring the speckle contrast. If the root-mean-square 
roughness is large compared to one wavelength, 
the radiation should be spatially coherent poly- 
chromatic, instead of monochromatic; the rough- 
ness is again determined by measuring the speckle 
contrast. 

An application of growing importance in engineer- 
ing is the use of speckle patterns in the study of 
object displacements, vibration, and distortion that 
arise in nondestructive testing of mechanical compo- 
nents. The key advantage of speckle methods in this 
case is that the speckle size can be adjusted to suit the 
resolution of the most convenient detector, whether 
it be film or a television camera, while still retaining 
information about displacements with an accuracy 
of a small fraction of a micrometer. Although sev- 
eral techniques for using laser speckle in metrology 
are available, the basic idea can be explained by the 
following example for measuring vibration. A vibrat- 
ing surface illuminated with a laser beam is imaged 
(Fig. 1). A portion of the laser beam is also superim- 
posed on the image of the vibrating surface. If the 
object surface is moving in and out with a travel of 
one-quarter of a wavelength or more, the speckle 
pattern will become blurred out. However, for areas 
of the surface that are not vibrating (that is, the nodal 
regions) the eye will be able to distinguish the fully 
developed high-contrast speckle pattern. 

Electronic speckle interferometry (ESPD is find- 
ing much use in industrial applications involving 
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Fig. 1. Physical origin of speckle for an imaging system. 





Fig. 2. Electronic speckle interferogram of vibrating object. 


measurement of the vibrational properties of struc- 
tures. In this technique, a television camera is used 
to view a speckle pattern. The output of the tele- 
vision camera is processed electronically to empha- 
size the high spatial frequency structure. The result 
is viewed on a television monitor (Fig. 2). The bright 
fringes correspond to the region of the object which 
are at the nodes of the vibration pattern. By know- 
ing the angles at which the object is illuminated and 
viewed, it is possible to calculate the amplitude of the 
vibration from the number of dark fringes present. 
Thus, simply by looking at the television monitor it 
is possible to determine the vibrational pattern over 
the entire object. The amplitude and frequency of 
the drive source can be changed and instantly the 
new vibrational pattern can be observed. See NON- 
DESTRUCTIVE EVALUATION. 

Speckle patterns can also be used to examine the 
state of refraction of the eye. If a diffusing surface il- 
luminated by laser light moves perpendicular to the 
line of sight of an observer, the speckles may ap- 
pear to move with respect to the surface. For a nor- 
mal eye, movement in a direction opposite to that 
of the surface indicates underaccommodation, and 
movement with the surface indicates overaccommo- 
dation. If the speckles do not move, but just seem to 
“boil,” the observed surface is imaged on the retina. 

James C. Wyant 

Astronomical speckle interferometry. Astronomical spec- 
kle interferometry is a technique for obtaining 
spatial information on astronomical objects at the 
diffraction-limited resolution of a telescope, despite 
the presence of atmospheric turbulence. 

In the absence of turbulence, a telescope of aper- 
ture D operating at wavelength à would produce 
a diffraction-limited image of a point source of an 
angular size œ = 1.22 4/D (the Rayleigh criterion). 
When observed through an Earth-based telescope, 
the image of a source in space is corrupted by dis- 
tortions introduced during propagation of its light 
through the Earth’s atmosphere. These distorting ef- 
fects result from variations in the density, temper- 
ature, and index of refraction of the atmosphere, 
and give a blurred image, or seeing disk, which is 
independent of telescope aperture and seldom less 
than 1-2 arc-seconds in diameter. A sub-arc-second 
seeing disk is observed only under exceptionally 


good seeing conditions, at the most favorable high- 
altitude telescope locations, when the telescope en- 
vironment is characterized by uniform temperature 
and nonturbulent conditions. Even in these optimal 
conditions, the seeing disk is dozens of times larger 
than the diffraction-limited angular size of the point 
source produced by a telescope of several meters’ 
aperture. For example the diffraction-limited angular 
size of a point source produced at 4 = 500 nanome- 
ters with a 157-in. (4-m) telescope is approximately 
0.03 arc-second, and the seeing disk obtained at the 
very best observing conditions is at least 10 times 
larger. 

At very short exposures (10-20 milliseconds), 
astronomical images consist of a fine-scale struc- 
ture that fills the seeing disk, very similar to the 
speckle structures observed with laser-illuminated 
diffusers. These individual speckles contain spatial 
frequency information up to the diffraction limit of 
the telescope. Speckle interferometry techniques ex- 
tract this information, and therefore reach spatial 
detail down to the diffraction-limited resolution of 
the telescope. In this process, long sequences of 
short-exposure images (each exposure a few millisec- 
onds in duration) are recorded through magnifying 
optics and narrow-band filters by using detectors 
such as charge-coupled devices (CCDs) and photon- 
counting devices. These images are then combined 
by using various computational algorithms to recon- 
struct a single image with diffraction-limited reso- 
lution. The number of images required for conver- 
gence of speckle image reconstruction algorithms 
increases as the fourth power of the atmospheric see- 
ing. See CHARGE-COUPLED DEVICES; IMAGE PROCESS- 
ING. 

Speckle interferometry techniques have proven to 
be an invaluable tool for astronomical research, al- 
lowing studies of a wide range of scientifically inter- 
esting problems. They have been widely used to de- 
termine the separation and position angle of binary 
stars, and for accurate diameter measurements of a 
large number of stars, planets, and asteroids. Speckle 
imaging techniques have successfully uncovered de- 
tails in the morphology of a range of astronomical 
objects, including the Sun, planets, asteroids, cool gi- 
ants and supergiants, young stellar objects, the super- 
nova SN1987A in the Large Magellanic Cloud, Seyfert 
galaxies, and quasars. See BINARY STAR; INTERFEROM- 
ETRY. Margarita Karovska 
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Wykes, Holographic and Speckle Interferometry, 
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[a 
Spectral type 

A label used to indicate the physical and chemical 
characteristics of a star, as indicated by study of 
the star’s spectrum. Stars possess a remarkable va- 
riety of spectra, some simple, others extraordinarily 


Spectral type 


217 


218 


Call 
[ 
K H-He Hel 


BO lae itd 





a E e 
wT tke 


A5 III ne 


F2IV E wll 
i E 
maven" mmr” m n 


GS Ill 
Go Ill 





rie 
10 A 
T || 


Spectral type 


Ii 


Hy Hel oH Hell HE 
al | E e Orionis 
| i 19 Tauri 
l Sirius 
ve lie | B Trianguli 
| IE Geminorum 
Procyon 


| 1 Capella 
| 


Cal Gi Hg 
1 K Geminorum 


i abe |l jj) Arcturus 
| Bi Aldebaran 


Nh Š 
w a 
g oT ee 


"D HB 


Call 


Fig. 1. Spectral sequence from B to M. (University of Michigan) 


complex. To understand the natures of the stars, it 
was first necessary to bring order to the subject and 
to classify the spectra. 

Standard sequence. Based in part on earlier work 
by A. Secchi, who had developed a system that corre- 
lated with stellar color, W. C. Pickering initiated the 
modern system starting about 1890. He ordered the 
spectra by letter, A through O, largely on the basis of 
the strengths of the hydrogen lines. He and Mrs. W. P. 
Fleming then found that several of the letters were 
unneeded or redundant. On the basis of continu- 
ity of lines other than hydrogen, A. Maury and A. J. 
Cannon found that B preceded A and O preceded 
B. The result is the classical spectral sequence, 
OBAFGKM (Fig. 1). Cannon also decimalized the 
classes, setting up the sequence O5, ..., O9, 
BO, ..., B9, AO, and so forth. (Not all the numbers 
are used, however.) The modern Harvard sequence, 
after the observatory where it was formulated, runs 
from O2 to M9. Over 350,000 stars, classified by Can- 
non, were included in the final result, the Henry 
Draper Catalog. Classes L and T were added to the 
end of the sequence in 1999. See ASTRONOMICAL 
CATALOGS. 

Class A has the strongest hydrogen lines, B is 
characterized principally by neutral helium (with 
weaker hydrogen), and O by ionized helium. 


Hydrogen weakens notably through F and G, but 
the metal lines, particularly those of ionized cal- 
cium, strengthen. In K, hydrogen becomes quite 
weak, while the neutral metals grow stronger. The 
M stars effectively exhibit no hydrogen lines at all, 
but are dominated by molecules, particularly tita- 
nium oxide (TiO). L stars are dominated by metal- 
lic hydrides and neutral alkali metals, while T is de- 
fined by methane, ammonia, and water. At G, the 
sequence branches into R and N, whose stars are 
rich in carbon molecules; class S, in which the tita- 
nium oxide molecular bands of class M are replaced 
by zirconium oxide (ZrO), was introduced in 1923 
(Fig. 2). Classes R and N are commonly combined 
into class C. 

Spectral variation with temperature. At first appear- 
ance, the different spectral types seem to reflect dif- 
ferences in stellar composition. However, within the 
sequence OBAFGKMILIT the elemental abundances 
are roughly similar. The dramatic variations seen 
in Fig. 1 are the result of changes in temperature 
(Fig. 3). The strength of an ion’s absorption lines 
depends on the number of electrons capable of ab- 
sorbing at those wavelengths, which in turn depends 
on the product of the atomic absorption probability, 
the number of electrons excited to the appropriate 
level of the ion, and the number of ions. In classes 
M and L (which divide near 2000 K), the tempera- 
ture is too low to allow a significant number of elec- 
trons in the second level of hydrogen, from which 
the optical Balmer series arises. As a result, there are 
no hydrogen lines. As temperature climbs and colli- 
sional activity increases, more electrons are knocked 
into level 2 and the Balmer lines strengthen from K 
through A. Above class A, however, the temperature 
becomes so high that the hydrogen becomes ion- 
ized. That ionization and changes in the opacity of 
the stellar atmosphere cause the hydrogen lines to 
weaken. 

The other ions are subject to the same kind of vari- 
ation. The second level of neutral helium lies at a very 
high energy. As a result, the helium lines do not be- 
come visible until class B. It also takes a great deal of 
energy to ionize helium, so the ionized helium lines 
are not observed until nearly class O. Ionized metal 
lines are produced at intermediate temperatures and 
neutral metal lines at low temperatures. Molecules, 
which are relatively fragile, cannot exist unless the 
temperature is relatively low. 

Spectral variation with composition. The different 
spectra of the R, N, and S stars, however, are caused 
by true and dramatic variations in the chemical com- 
position. The M stars have more oxygen than carbon. 
In the N stars, which have similar temperatures, the 
ratio is reversed and carbon dominates oxygen. The 
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Fig. 2. Spectral classes. The stars in the sequence O 
through T have common chemical compositions. Those in 
the side branches are rich in carbon. 
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Fig. 3. Variation in line strength for different ions as a function of temperature and spectral class. In class L (off the right end 
of the graph), TiO disappears and is replaced by metallic hydrides. (After H. Voigt, ed., Landolt-Bornstein Numerical Data and 
Functional Relationships in Science and Technology, Group VI, vol. 1: Astronomy and Astrophysics, Springer-Verlag, 1965) 


R stars are carbon-rich versions of classes G and K. In 
the cool S stars, carbon and oxygen have about the 
same abundance. That condition and the increased 
abundance of zirconium (which has a greater affin- 
ity for oxygen than titanium) yield strong zirconium 
oxide lines. These composition variations are the re- 
sult of internal thermonuclear processing and con- 
vection. See STELLAR EVOLUTION. 

Morgan-Keenan-Kellman system. In the 1940s, 
W. W. Morgan, P. C. Keenan, and E. Kellman expanded 
the Harvard sequence to include luminosity. A sys- 
tem of roman numerals is appended to the Harvard 
class to indicate position on the Hertzsprung-Russell 
diagram: I for supergiant, II for bright giant, II for 
giant, IV for subgiant, and V for dwarf or main se- 
quence. The Morgan-Keenan-Kellman class, which 
includes the refined Harvard type, is defined by a 
set of standard stars in the MKK atlas, to which a 
program star must be compared. On this system, the 
Sun is a G2 V star. Though some dwarfs are carbon- 
rich, the R, N, and S stars are all giants. L stars are all 
dwarfs and brown dwarfs, and T stars are all brown 
dwarfs. Additional refinements have been added to 
indicate a variety of spectral peculiarities. See ASTRO- 
NOMICAL SPECTROSCOPY; HERTZSPRUNG-RUSSELL DI- 
AGRAM; STAR. James B. Kaler 
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[EE 
Spectrograph 

An optical instrument that consists of an entrance 
slit, collimator, disperser, camera, and detector and 
that produces and records a spectrum. A spectro- 


graph is used to extract a variety of information about 
the conditions that exist where light originates and 
along its paths. It does this by revealing the details 
that are stored in the light’s spectral distribution, 
whether this light is from a source in the laboratory 
or a quasistellar object a billion light-years away. 
Proper spectrograph design takes into account the 
type of light sources to be measured, and the circum- 
stances under which these measurements will be 
made. Since observational astronomy presents un- 
usual problems in these areas, the design of astro- 
nomical spectrographs may also be unique. 
Astronomical versus laboratory design. Astronomi- 
cal spectrographs have the same general features as 
laboratory spectrographs (Fig. 1). The width of the 
entrance slit influences both spectral resolution and 
the amount of light entering the spectrograph, two 
of the most important variables in spectroscopy. The 
collimator makes this light parallel so that the dis- 
perser (a grating or prism) may properly disperse 
it. The camera then focuses the dispersed spectrum 
onto a detector, which records it for further study. 
Laboratory spectrographs usually function prop- 
erly only in a fixed orientation under controlled 
environmental conditions. By contrast, most astro- 
nomical spectrographs are used on a moving tele- 
scope operating at local temperature. Thus, their 
structures must be mechanically and optically insen- 
sitive to orientation and temperature. 
The brightness, spectral characteristics, and ge- 
ometry of laboratory sources may be tailored to 
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Fig. 1. Basic optical components of a spectrograph. 
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Fig. 2. Typical laboratory spectrographs. (a) Classical concave grating spectrograph. (b) Czerny-Turner plane grating 


spectrograph. 


experimental requirements and to the capabilities of 
a spectrograph. Astronomical sources, in the form 
of images at the focus of a telescope, cannot be ma- 
nipulated, and their faintness and spectral diversity 
make unusual and difficult demands on spectrograph 
performance. 

Typical laboratory spectrographs use either con- 
cave gratings, which effectively combine the func- 
tions of collimator, grating, and camera in one 
optical element, or plan reflection gratings with 
spherical reflectors for collimators and cameras. An 
example of each is shown in Fig. 2. These designs 
may severely constrain the optical characteristics of 
the collimator and camera, and sometimes the grat- 
ing, to obtain acceptable imaging at the detector. As 
a result, they are generally too limited in the choice 
of important spectrograph parameters for use in as- 
tronomy. In particular, they tend to preclude the use 
of very short cameras, an essential feature of spec- 
trographs intended for study of faint astronomical 
objects. The Czerny-Turner design (Fig. 2b) is used 
in some solar spectrographs, where it offers some 
advantages and ample light is available. See SPECTRO- 
HELIOGRAPH; SUN. 

To provide maximum flexibility in its application, 
the collimators, gratings, and cameras of an astro- 
nomical spectrograph must be relatively free of aber- 
rations when used in collimated light. Then the 
optical subassemblies can be interchanged without 
disturbing the spectrograph’s optical performance. 
Grating interchange changes spectral resolution and 
wavelength coverage with minimal effect on effi- 


ciency. Camera and detector interchange achieves 
operating characteristics that may be crucial for cer- 
tain applications. Examples of typical astronomical 
spectrographs are shown in Fig. 3. 

Details of astronomical spectrographs. Astronomi- 
cal spectrographs typically use a pair of adjustable 
polished slit jaws, which are tilted to the telescope 
beam so that unused light is reflected to guide op- 
tics for accurate image positioning. The collimator 
may be either an off-axis paraboloid or a two-mirror 
Cassegrain-type optical system. It has a relatively long 
focal length and an f/ratio matching that of the tele- 
scope’s for optimum slit transmission. 

The disperser is usually a plane reflection grating. 
In laboratory applications, larger gratings are gen- 
erally used for their increased spectral resolution. 
However, in astronomy the resolution of the grating 
(number of grooves times interference order) is sel- 
dom realized, and larger gratings are used to obtain 
longer collimator focal lengths in order to preserve 
or increase slit demagnification at the camera’s focus. 

The spectral resolution of many laboratory spec- 
trographs can be increased (grating permitting) only 
by decreasing the slit width. However, this decrease 
reduces available light and taxes the imaging capa- 
bilities of the optical system and the resolution ca- 
pabilities of the detector. A better approach, usually 
used in astronomical spectrographs, is to keep the 
slit width within a relatively small range of values, 
for a given camera focal length, and vary spectral 
resolution by using gratings having different angular 
dispersions. 
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Fig. 3. Typical astronomical spectrographs. (a) Spectrograph for use at the Cassegrain focus (often called a Cassegrain 
spectrograph) using a parabolic collimator and semisolid Schmidt camera. (b) Plan view of a coude spectrograph with four 
Schmidt cameras and four interchangeable gratings. Camera 1 is the shortest camera and camera 4 is the longest. The 


spectrograph is shown using camera 2. 


The cameras used in astronomy are often quite 
short (fast) to allow the entrance slit to be as wide as 
possible for maximum light transmission and to max- 
imize spectral coverage for a given detector size. The 
camera is often a variation of the classical Schmidt 
telescope, using a combination of mirrors and ultra- 
violet transmitting optics. Some spectrographs may 
have a selection of cameras to permit higher reso- 
lutions (with narrower slit openings) when there is 
sufficient light, or to improve imaging in some wave- 
length ranges. In particular, very large spectrographs, 
used at the coude or Nasmyth foci, may have an array 
of camera focal lengths (Fig. 3b). When this arrange- 
ment is combined with interchangeable gratings, it 
permits a large selection of spectral resolutions and 
wavelength coverages. See SCHMIDT CAMERA. 

In the past, the detector is astronomical spec- 
troscopy was almost always the photographic emul- 
sion. However, image intensifiers and, more recently, 
the very efficient and accurate cooled charge- 
coupled devices (CCDs) have become the preferred 
detectors. See CHARGE-COUPLED DEVICES; IMAGE 
TUBE (ASTRONOMY). 

Observing sites and telescopes. Spectrographs are 
particularly appropriate for use at poor observing 
sites because, although the atmosphere might re- 
duce available light, it has little effect on its spec- 
trum. Light pollution is also less of a problem, since 
dispersing the light both dilutes the intensity of the 
unwanted background and identifies the spectrum 
of this background so that it can be subtracted later. 

Astronomical spectrographs can be used effec- 
tively on smaller telescopes. The reason is that star 
images are typically larger than the entrance-slit 


width, and the amount of light passing through the 
slit is thus proportional to the diameter, not area, of 
a telescope’s aperture. Thus, the magnitude limit of 
a 50-in. (1.25-m) telescope will be only about 1 stel- 
lar magnitude fainter than that of a far less costly 
20-in. (0.5-m) telescope using the same spectro- 
graph. See ASTRONOMICAL SPECTROSCOPY. 
Ron Hilliard 
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Spectroheliograph 

A spectrographic instrument that produces mono- 
chromatic images of the Sun. The principle of oper- 
ation was first suggested by J. Janssen in 1869, anda 
photographic version was constructed by G. E. Hale 
at Mount Wilson Observatory in 1892. Around the 
same time, H. A. Deslandres devised a closely re- 
lated instrument for operation at l’Obervatoire de 
Paris, Meudon. In a simple form of the instrument, 
an image of the Sun from a solar telescope is focused 
on a plane containing the entrance slit of the spec- 
troheliograph (see illustration). The light passing 
through the slit is collimated by a concave mirror 
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Optical configuration of a simple spectroheliograph. 


that is tilted such that the light is incident on a plane 
diffraction grating. Part of the dispersed light from 
the grating is focused by a second concave mirror, 
identical to the first mirror, at an exit slit identical to 
the entrance slit. By symmetry of the optical system, 
the portion of the solar disk imaged on the entrance 
slit is reimaged in the plane of the exit slit with the 
same image scale but in dispersed wavelength. The 
light imaged along the exit slit then corresponds to 
the portion of the solar image falling on the entrance 
slit, but in the light of only a narrow region of the 
spectrum, as determined by the spectrographic dis- 
persion. The particular wavelength sampled is set by 
the grating angle. By uniform transverse motion of 
the instrument such that the entrance slit is scanned 
across the solar image, the light passing through the 
exit slit maps out a corresponding monochromatic 
image of the Sun, which can be recorded photo- 
graphically with a stationary camera. Alternatively, 
the solar image can be scanned across the entrance 
slit of a stationary spectroheliograph, the camera 
then synchronously moved in step with the image. 
See DIFFRACTION GRATING. 

Narrow-bandpass tunable filters are also used to 
obtain monochromatic images of the Sun, with the 
advantage that the full image is recorded in one rela- 
tively short exposure. However, the transmittance of 
such filters is not as spectrally pure as that of a spec- 
troheliograph of the same nominal bandwidth, and 
extremely narrow passband filters have restricted an- 
gular fields. Spectroheliographs also have the advan- 
tage that the slit widths can be adjusted to match 
lines of different spectral widths or other specific 
observing requirements, or can be set for different 
image-quality observing conditions. A disadvantage 
is that the time required to build up an image of the 
full Sun with a spetroheliograph that uses a photo- 
graphic film-type camera can be many minutes, dur- 
ing which time critical solar features observed might 
change. See INTERFERENCE FILTER. 

Digital recording has greatly expanded the perfor- 
mance capabilities, versatility, and range of applica- 
tions of spectoheliographs, reducing the observing 
time and allowing flexibility in observing programs. 
Sequential readout of a linear array that is aligned 
with the exit slit, and that moves with the slit as the 
solar image is scanned at the entrance slit, provides a 
fast and convenient observing system. The maximum 
scan rate is set by the array readout rate. A charge- 
coupled-device (CCD) array, used directly at the focal 
plane without a slit, allows simultaneous recording 


of an image at the wavelength ofa spectral line as well 
as the adjacent spectral region. Computer process- 
ing of the digital data can then provide velocity and 
magnetic field information, and extensive solar fields 
can be recorded in a few seconds. Digital video cam- 
era systems have been used for such applications. 
Without an exit slit, the optical configuration in this 
case is simply that of a spectrograph, except for an 
image scanning capability. See ASTRONOMICAL IMAG- 
ING; CHARGE-COUPLED DEVICES; SPECTROGRAPH. 

Spectroheliographs have been designed that have 
more than one camera to record several spectral 
regions simultaneously, either in the same spectral 
order or in more than one order of the diffrac- 
tion grating. Spectroheliographs have also been used 
at ultraviolet, extreme ultraviolet (EUV), and x-ray 
wavelengths, and a miniature version has been used 
successfully in spacecraft observations of the solar 
corona. Raymond N. Smartt 
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Spectroscopy 


An analytic technique concerned with the measure- 
ment of the interaction (usually the absorption or 
the emission) of radiant energy with matter, with the 
instruments necessary to make such measurements, 
and with the interpretation of the interaction both at 
the fundamental level and for practical analysis. Mass 
spectroscopy is not concerned with the interaction 
of light with matter, but was so named because the 
appearance of the data resembles that of the spec- 
troscopic data as just defined. 

A display of such data is known as a spectrum, 
that is, a plot of the intensity of emitted or transmit- 
ted radiant energy (or some function of the inten- 
sity) versus the energy of that light. Spectra due to 
the emission of radiant energy are produced as en- 
ergy is emitted from matter, after some form of ex- 
citation, then collimated by passage through a slit, 
then separated into components of different energy 
by transmission through a prism (refraction) or by 
reflection from a ruled grating or a crystalline solid 
(diffraction), and finally detected. Spectra due to the 
absorption of radiant energy are produced when ra- 
diant energy from a stable source, collimated and 
separated into its components in a monochromator, 
passes through the sample whose absorption spec- 
trum is to be measured, and is detected. Instruments 
which produce spectra are known as spectroscopes, 
spectrometers, spectrographs, and spectrophotome- 
ters. See SPECTRUM. 

Interpretation of spectra provides fundamental in- 
formation on atomic and molecular energy levels, 
the distribution of species within those levels, the 
nature of processes involving change from one level 
to another, molecular geometries, chemical bonding, 


and interaction of molecules in solution. At the 
practical level, comparisons of spectra provide a 
basis for the determination of qualitative chemical 
composition and chemical structure, and for quanti- 
tative chemical analysis. 

Early history. In the historical development of 
spectroscopy, following the fundamental studies of 
crude spectra of sunlight by Isaac Newton in 1672, 
certain contributions and achievements are espe- 
cially noteworthy. The significance of using a narrow 
slit instead of a pinhole or round aperture so as to pro- 
duce spectra lines, each one an image of the slit and 
representing a different color or wave length, was 
demonstrated independently by W. H. Wollaston in 
1802 and by Joseph Fraunhofer in 1814. Fraunhofer 
made many subsequent contributions to optics and 
spectroscopy, including first observation of stellar 
spectra, discovery and construction of transmission 
diffraction gratings, first accurate measurements of 
wavelengths of the dark lines in the solar spectrum, 
and invention of the achromatic telescope. The ori- 
gin of the dark Fraunhofer lines in the solar spectrum 
was accounted for by G. R. Kirchhoff in 1859 on the 
basis of absorption by the elements in the cooler 
Sun’s atmosphere of the continuous spectrum emit- 
ted by the hotter interior of the Sun. Further studies 
by Kirchhoff with R. Bunsen demonstrated the great 
utility of spectroscopy in chemical analysis. By sys- 
tematically comparing the Sun’s spectrum with flame 
or spark spectra of salts and metals, they made the 
first chemical analysis of the Sun’s atmosphere. In 
1861, while investigating alkali metal spectra, they 
discovered two new alkali metals, cesium and rubid- 
ium. These achievements by Kirchhoff and Bunsen 
provided tremendous stimulus to spectroscopic re- 
searches. The adoption in 1910 of the first interna- 
tional standards of wavelength gave further impetus. 
These and later standards made possible the mea- 
surement of wavelengths of any electromagnetic ra- 
diation with unprecedented accuracy. Since World 
War II, remarkable developments in spectroscopy 
have occurred in instrumentation, achieved largely 
through advances in electronics and in manufactur- 
ing technology. Direct reading, automatic recording, 
improved sensitivity with good stability, simplicity 
of operation, and extended capabilities are features 
provided by many commercial instruments, many 
of which now are microprocessor-controlled. Many 
newer instruments have dedicated data systems. 
Predictably, these developments, by facilitating 
widespread use of spectroscopic techniques, have 
had an enormous influence in promoting develop- 
ments in both applied and theoretical spectroscopy. 

The ultimate standard of wavelength is that of the 
meter, defined since 1983 as the length of the path 
traveled by light in vacuum during a time interval of 
1/299,792,458 of a second. See WAVELENGTH STAN- 
DARDS. 

Spectroscopic units. The change in energy of an 
ion, atom, or molecule associated with absorption 
and emission of radiant energy may be measured by 
the frequency of the radiant energy according to Max 
Planck, who described an equality E = hv, where E 


is energy, v is the frequency of the radiant energy, 
and 4 is Planck’s constant. The frequency is related 
to the wavelength A by the relation vA = c/n, where 
cis the velocity of radiant energy in a vacuum, and n 
is the refractive index of the medium through which 
it passes; n is a measure of the retardation of radiant 
energy passing through matter. The units most com- 
monly employed to describe these characteristics of 
light are the following: 


Wavelength: 1 micrometer (um) = 1076 m 
1 nanometer (nm) = 10°? m 
(= 10 angstroms) 
Frequency: 1 hertz (Hz) =is! 


For convenience the wave number v (read nu bar), 
the reciprocal of the wavelength, may be used; for 
this, the common units are cm~!, read as recipro- 
cal centimeters or occasionally kaysers. This number 
equals the number of oscillations per centimeter. 

Spectral regions. Visible light constitutes only a 
small part of the spectrum of radiant energy, or elec- 
tromagnetic spectrum; the human eye responds to 
electromagnetic waves with wavelengths from about 
380 to 780 nm, though there is individual variation 
in these limits. The eye cannot measure color and 
intensity quantitatively, even for the visible portion 
of the electromagnetic spectrum; therefore, instru- 
ments are used for measurements and recording. 

There are broad regions of the electromagnetic 
spectrum with associated wavelengths greater or less 
than the visible region (see table). 

Origin of spectra. Atoms, ions, and molecules emit 
or absorb characteristically; only certain energies of 
these species are possible; the energy of the pho- 
ton (quantum of radiant energy) emitted or absorbed 
corresponds to the difference between two permit- 
ted values of the energy of the species, or energy lev- 
els. (if the flux of photons incident upon the species 
is great enough, simultaneous absorption of two or 
more photons may occur.) Thus the energy levels 
may be studied by observing the differences between 
them. The absorption of radiant energy is accompa- 
nied by the promotion of the species from a lower to 
a higher energy level; the emission of radiant energy 
is accompanied by falling from a higher to a lower 
state; and if both processes occur together, the con- 
dition is called resonance. 

Transitions. Transitions between energy levels asso- 
ciated with electrons or electronic levels range from 
the near infrared, the visible, and ultraviolet for out- 
ermost, or highest-energy, electrons, that is, those 
which can be involved in bonding, to the x-ray re- 
gion for the electrons nearest the nucleus. At low 
pressures such transitions in gaseous atoms produce 
sharply defined lines because the energy levels are 
sharply defined. Transitions between energy levels of 
the nucleus are observed in the gamma-ray region. In 
the absence of an applied electric or magnetic field, 
these electronic and nuclear transitions are the only 
ones which atoms can undergo. See ATOMIC SPEC- 
TROMETRY; ATOMIC STRUCTURE AND SPECTRA. 

Electronic transitions in molecules are also ob- 
served. In addition, transitions can occur between 
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Principal spectral regions and fields of spectroscopy 
Approximate 
Spectral wavelength Typical Typical Energy transitions 
region range source detector studied in matter 
Gamma 1-100 pm Radioactive nuclei Geiger counter; Nuclear transitions and 
scintillation counter disintegrations 
X-rays 6pm-100nm_ X-ray tube (electron Geiger counter lonization by inner electron 
bombardment of metals) removal 
Vacuum ultraviolet 10-200 nm High-voltage discharge; high- |= Photomultiplier lonization by outer electron 
vacuum spark removal 
Ultraviolet 200-400 nm Hydrogen-discharge lamp Photomultiplier Excitation of valence electrons 
Visible 400-800 nm Tungsten lamp Phototubes Excitation of valence electrons 
Near-infrared 0.8-2.5 um Tungsten lamp Photocells Excitation of valence electrons; 
molecular vibrational 
overtones 
Infrared 2.5-50 wm Nernst glower; Globar lamp Thermocouple; Molecular vibrations: stretching, 
bolometer bending, and rocking 
Far-infrared 50-1000 um Mercury lamp (high-pressure) Thermocouple; Molecular rotations 
bolometer 
Microwave 0.1-30 cm Klystrons; magnetrons Silicon-tungsten Molecular rotations; electron 
crystal; bolometer spin resonance 
Radio-frequency 1071-10 m Radio transmitter Radio receiver Molecular rotations; nuclear 
magnetic resonance 








levels associated with the vibrations and rotations of 
molecules. Spacings between electronic levels are 
greater than between vibrational levels, and those 
between vibrational levels are greater than between 
rotational levels; each vibrational level has a set of ro- 
tational levels associated with it, and each electronic 
level a set of vibrational levels (Fig. 1). Transitions 
between vibrational levels of the same electronic 
level correspond to photons in the infrared region; 
transitions between rotational levels, to photons in 
the far-infrared and microwave region. Rotational 
spectra of gaseous molecules consist of sharp lines; 
vibrational spectra consist of bands, each of which 
arises from a given transition between vibrational lev- 
els altered in energy by increments due to changes 
in rotation occurring when the vibrational level 
changes. Likewise, molecular electronic spectra con- 
sist of bands due to transitions between electronic 
energy levels, altered by increments resulting from 
changes in vibration and rotation of the molecule 
on changing electronic state. See BAND SPECTRUM; 
INFRARED SPECTROSCOPY; LINE SPECTRUM; MICRO- 
WAVE SPECTROSCOPY; MOLECULAR STRUCTURE AND 
SPECTRA; SPECTRUM ANALYZER. 
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Fig. 1. Spacing of singlet electronic states S, vibrational 
states v, and rotational states J. 


External fields. The application of a magnetic or elec- 
tric field to the sample often separates normally in- 
distinguishable, or degenerate, states in energy from 
each other. Thus, the orientation of the spin of an 
unpaired electron in an atom, ion, or molecule with 
respect to an applied magnetic field may have dif- 
ferent values, and in the magnetic field the atom 
or molecule may have different energies. For typi- 
cal field strengths, the difference in energy between 
these newly separated energy levels occurs in the 
microwave region. Similarly, for an atomic nucleus 
in an ion or molecule, differences in the orientation 
of the spin of the nucleus with respect to a magnetic 
field give rise to different energy levels of the nucleus 
in that field, so that energy differences will be found 
in the microwave region. The former phenomenon 
produces electron spin resonance or electron para- 
magnetic resonance spectra, and the latter produces 
nuclear magnetic resonance spectra (Fig. 2). 

Nuclear magnetic resonance has been advanced 
by pulsed irradiation techniques which permit the 
identification of solid samples and complex biolog- 
ical molecules, and which are useful for the pro- 
duction of three-dimensional plots to permit analysis 
across the interior of objects without destruction. In 
medicine the imaging of internal organs is known 
as magnetic resonance imaging. Electron paramag- 
netic spectroscopy is used to establish structures 
of species containing unpaired electrons in solution 
or in a crystalline solid; some qualitative and quan- 
titative analysis is also performed. See ELECTRON 
PARAMAGNETIC RESONANCE (EPR) SPECTROSCOPY; 
MEDICAL IMAGING; NUCLEAR MAGNETIC RESONANCE 
(NMR). 

Electronic spectra are also altered by external 
fields; removal of degeneracy by an externally ap- 
plied electric field is termed the Stark effect, and 
removal of degeneracy by an externally applied mag- 
netic field is termed the Zeeman effect. See STARK 
EFFECT; ZEEMAN EFFECT. 
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Fig. 2. Splitting degenerate energy levels of an appropriate 
atom in a magnetic field. Transitions between the separated 
levels require interaction with a photon of energy AE. 


Spontaneous emission. The spontaneous emission of 
light from a sample by the decay of an electron from 
a higher level to the lowest available level is known 
either as fluorescence or as phosphorescence. The 
fundamental difference between these two terms is 
associated with the orientation of the spin of this 
electron. If the spin of the electron is such that it 
falls to the lowest state without a change of spin (the 
atom or molecule having the same spin before and 
after the transition), fluorescence occurs, and the 
process is characterized as allowed and is relatively 
fast. If the spin is such that the electron can fall to the 
lower state only with a change in spin dictated by the 
Pauli exclusion principle, the process is phosphores- 
cence, characterized as a forbidden process, which is 
relatively slow (Fig. 3). In practice, other factors also 
govern the time the electron spends in the higher 
level, and the time intervals associated with each pro- 
cess overlap; that for fluorescence is 1074 to 1078 s, 
and that for phosphorescence is 10~* to 10 s. 
The spontaneous emission process thus cannot be 
completely distinguished merely on the basis of the 
time interval associated with the emission. Time- 
resolved emission studies may allow separation of 
the processes. Fluorescence and phosphorescence 
are measured at a right angle to the incident beam 
exciting the sample, in order to avoid background 
problems. The emission of light beyond blackbody 
radiation when a sample is heated (thermolumines- 


Sı — 5 


—— So — = So 


(a) (b) 


Fig. 3. Spontaneous emission of light by the decay of an 
electron from a higher to the lowest available level. 
(a) Fluorescence. (b) Phosphorescence. 


cence) or when it is strained or fractured (tribolumi- 
nescence) has limited applicability. Similar emission 
during chemical reactions (chemiluminescence; in 
biochemical systems, bioluminescence) has broader 
analytical use if the reactions in which products emit 
light are, for example, catalyzed by a small amount 
of the substance to be analyzed. See BIOLUMINES- 
CENCE; CHEMILUMINESCENCE; EXCLUSION PRINCIPLE; 
FLUORESCENCE; PHOSPHORESCENCE. 

Light rotation. Compounds whose molecules have a 
structure which cannot be superimposed on its re- 
flection in a mirror (asymmetric molecules) rotate 
the plane of polarized light. If a device can be in- 
cluded in an instrument to plane-polarize the light 
from the source, that is, to produce light oscillating 
in only one plane, the amount of rotation of that 
plane by the sample can be studied as a function of 
the wavelength of light used in the experiment. A 
plot of this rotation versus wavelength is called an 
optical rotatory dispersion curve; such curves are 
useful in studying fundamentals of light rotation and 
in establishing the absolute structure of asymmetric 
molecules empirically. Asymmetric samples whose 
absorption of the components of plane-polarized 
light differs are said to exhibit circular dichroism; 
analysis of a sample in a magnetic field (magnetic 
circular dichroism) has a broader applicability in 
structural analysis. See COTTON EFFECT; OPTICAL 
ACTIVITY. 

Instrumentation. Spectrometers require a source of 
radiation, a dispersion device, and a detector. For 
emission spectra, the source may be taken as the sam- 
ple to be measured itself, although another source of 
radiation may be needed to excite the sample to an 
excited state. Absorption spectra require a source 
to generate appropriate radiation across the range 
of energies needed for the experiment. These radi- 
ant sources produce continuous spectra and can be 
distinguished from those producing discontinuous 
spectra. 

Continuous spectra. Of the sources listed in the table, 
most of those from the vacuum ultraviolet region 
to the far-infrared region produce continuous spec- 
tra. Within this type, sources for the near ultraviolet, 
visible, and infrared regions consist of hot, incandes- 
cent solids; the maximum intensity of their emission 
occurs at a wavelength dependent on their temper- 
ature and emissivity. Electrical discharges in gases 
at higher pressures provide sources of continuous 
radiation for the vacuum ultraviolet and ultraviolet 
regions; hydrogen or deuterium gas discharges are 
especially used for the latter. X-ray continuous emis- 
sion can be produced by collision of electrons or 
other particles accelerated through tens of kilovolts 
with a target. See BEAM-FOIL SPECTROSCOPY. 

Discontinuous spectra. The sources of high-energy 
radiation which yield discontinuous spectra emit be- 
cause of discrete processes in individual atoms, ions, 
or molecules. Such sources may include flames and 
furnaces to vaporize and atomize samples, and elec- 
trical discharges in gases at lower pressures. Other at- 
omizing sources may be produced by high-frequency 
radio discharges and microwave discharges through 
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a flowing gas. For sources of radiation connected 
with nuclear processes, solid samples may be used 
containing the element to be studied, either a ra- 
dioactive isotope or one which can be made radioac- 
tive by particle bombardment. See ARC DISCHARGE. 

Among these so-called line sources are hollow- 
cathode lamps, in which a few layers of a cylindri- 
cal cathode containing the same element as is to be 
analyzed are vaporized by collisions with ions gener- 
ated from rare gases and accelerated electrons. The 
gaseous atoms, also excited to higher energy states 
by this process, emit characteristic line spectra use- 
ful of analysis of that element. Lasers have become an 
especially useful source, and have opened different 
areas of spectroscopy, not only because of the inten- 
sity of the radiation supplied to the sample, which 
permits absorption of several photons in a single pro- 
cess, but also because of the coherent properties 
of the radiation, which permit the study of events 
occurring on a picosecond time scale. See LASER; 
ULTRAFAST MOLECULAR PROCESSES. 

The sources for radio-frequency studies are tun- 
able radio-frequency oscillators, designed with spe- 
cial regard for stability. The microwave sources may 
be spark discharges, magnetrons, or most commonly 
klystrons. An electron beam traveling through a 
closed cavity of defined geometry sets up electro- 
magnetic oscillations, of which a certain value will 
be reinforced as a function of the dimension of the 
containing unit; the value can be varied somewhat 
by adjusting the size of the cavity. See KLYSTRON; 
MAGNETRON. 

Dispersive elements. There are two kinds of dispersive 
elements, prisms and diffraction elements. The ear- 
lier and more commonly available prism has been 
supplanted in many cases by diffraction elements. 

After collimation of the source radiation through 
a slit, diffraction is achieved in x-ray spectroscopy 
by the use of a pure crystal. The distances between 
atomic nuclei in the crystal are on the same order as 
the wavelength of the radiant energy; under these 
conditions an efficient dispersion of the radiation 
over an arc of many degrees is possible. For a distance 
d between repeating planes of atoms, the wave- 
length A will be diffracted through an angle 0 related 
by the Bragg law nd = 2d sin 0, where n is an 
integer called the diffraction order. Materials used 
include gypsum (CaSO, - 2H2O), ammonium dihy- 
drogen phosphate (NH4H2PO,), and alkali halides 
(lighter elements). Each of these gives a wide angular 
dispersion in a portion of the 0.03-1.4-nm range of 
the spectrum commonly used for x-ray studies; the 
appropriate one must be chosen according to the 
experiment to be performed. See X-RAY OPTICS. 

For spectroscopic techniques using visible light 
and the regions adjacent to it, ultraviolet and in- 
frared, ruled diffraction gratings are often employed. 
Because their production has become economi- 
cal, they have largely replaced prisms, which may 
require the use of several materials to get ade- 
quate angular dispersion and wavelength resolution 
over the entire range of interest, for example, 2- 
15 um for an infrared spectrum. The dispersion of a 


prism increases near the limit of its transparency, and 
there is always a compromise in efficiency between 
transmission and dispersion. The resolution R, de- 
fined as the quotient of the wavelength of a line and 
the wavelength difference between it and another 
line just separated from it, is given by T/(dn/dh), 
where Tis the thickness of the prism base and dn/di 
is the variation of the refractive index with respect to 
wavelength. Efficient prisms are made of quartz for 
the ultraviolet, glass for the visible, and various salts 
for the infrared. See DIFFRACTION GRATING; OPTICAL 
MATERIALS; OPTICAL PRISM. 

Very short- or very long-wavelength spectra are 
usually produced without a dispersive element; 
gamma-ray detection uses a wavelength-sensitive de- 
tector (a pulse height discriminator, for example), 
and microwave and radio-frequency detection may 
use a variable tuning radio receiver; in the latter case 
the source is tuned to emit a highly resolved fre- 
quency as well. 

Detectors. Detectors commonly used for the various 
spectral regions are listed in the table. The devices for 
the infrared and microwave regions are basically heat 
sensors. Except for the eye and photographic meth- 
ods used sometimes in the regions for gamma ray 
to visible, each detector converts the signal into an 
electrical response which is amplified before trans- 
mittal to a recording device. Direct imaging of the 
response by vidicon detection has also been used. 
Otherwise an oscilloscope or oscillograph is used 
to record very rapidly obtained signals. See GAMMA- 
RAY DETECTORS; OSCILLOSCOPE; TELEVISION CAMERA 
TUBE. 

Instruments. Spectroscopic methods involve a 
number of instruments designed for specialized ap- 
plications. 

Spectroscope. An optical instrument consisting of a 
slit, collimator lens, prism or grating, and a telescope 
or objective lens which produces a spectrum for vi- 
sual observation is called a spectroscope. The first 
complete spectroscope was assembled by Bunsen in 
1859 (Fig. 4). 

Spectrograph. If a spectroscope is provided with a 
photographic camera or other device for recording 
the spectrum, the instrument is known as a spectro- 
graph. For recording ultraviolet and visible spectra, 
two types of quartz spectrographs are in common 
use. One type utilizes a Cornu quartz prism con- 
structed of two 30° prisms, left- and right-handed, as 
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Fig. 4. Prism spectroscope or spectrograph. S, slit; C, 
collimator lens, P, 60° prism; T, telescope; RV, red to violet 
spectrum. For producing ultraviolet spectra, the glass 
items in C, P, and T must be replaced by equal amounts of 
left- and right-handed crystal quartz. 


well as left- and right-handed lenses, so that the rota- 
tion occurring in one-half the optical path is exactly 
compensated by the reverse rotation in the other. 
The other type of quartz spectrograph employs a Lit- 
trow 30° quartz prism with a rear reflecting surface 
that reverses the path of the light through prism and 
lens, thus compensating for rotation of polarization 
in one direction by equal rotation in the opposite 
direction. Thus, in either type the effect of optical 
activity and birefringence in crystal quartz which 
produces double images of the slit is eliminated. 

Grating spectrographs cover a much broader range 
of wavelengths (vacuum ultraviolet to far infrared) 
than do prism instruments. Various-type mountings 
are employed. The most common mounting for a 
plane-reflection grating is in a Littrow mount, en- 
tirely analogous to that of the Littrow quartz spec- 
trograph. Mountings for concave reflection gratings 
require that the grating, slit, and camera plate all 
lie on the Rowland circle (imaginary circle of radius 
equal to one-half the radius of curvature of the grat- 
ing) in order to achieve proper focus of spectral lines 
(slit images) on the plate. Paschen, Rowland, Eagle, 
and Wadsworth mountings are common in grating 
spectrographs. 

Spectrometers. A spectroscope that is provided with 
a calibrated scale either for measurement of wave- 
length or for measurement of refractive indices of 
transparent prism materials is called a spectrometer. 
Also, the term frequently is used to refer to spectro- 
graphs which incorporate photoelectric photome- 
ters instead of photographic means to measure radi- 
ant intensities. 

Spectrophotometer. A spectrophotometer consists ba- 
sically of a radiant-energy source, monochromator, 
sample holder, and detector (Fig. 5). It is used for 
measurement of radiant flux as a function of wave- 
length and for measurement of absorption spec- 
tra. The wavelength is scanned by the monochro- 
mator at a constant rate (geared to chart speed of 
recorder) while the detector responds to, and dis- 
criminates between, the two alternating beams, one 
from the sample and the other from the attenua- 
tor. The servomechanism drives the attenuator, as 
well as the recorder pen, so that the intensities of 
the alternating signals are kept identical. The dis- 
placement of the pen on the chart can be calibrated 
in terms of relative intensity, percent transmittance, 
or absorbance. Generally a linear attenuator is em- 
ployed, and the pen displacement is directly propor- 
tional to the attenuator displacement and thus the 
transmittance. 

Interferometer. This optical device divides a beam of 
radiant energy into two or more parts which travel 
different paths and then recombine to form interfer- 
ence fringes. Since an optical path is the product of 
the geometric path and the refractive index, an in- 
terferometer measures differences of geometric path 
when two beams travel in the same medium, or the 
difference of refractive index when the geometric 
paths are equal. Interferometers are employed for 
high-resolution measurements and for precise deter- 
mination of relative wavelengths: they are capable of 
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Fig. 5. Schematic diagram of a double beam (in time) spectrophotometer. 


distinguishing between two spectral lines that differ 
by less than 10~° of a wave. See INTERFERENCE OF 
WAVES; INTERFEROMETRY. 

Quantitative relationships. For practical analysis, 
the absorption of radiant energy is related to the 
concentration of the sample by the relationship 
—log P/Py = abc, where P and Po are respectively the 
power of the light attenuated after passing through 
the sample and its unattenuated power before reach- 
ing the sample, c is the concentration of the sample, 
b is the path length of the light through the sam- 
ple, and a is a proportionality constant called the 
absorptivity of the sample. If c is in moles per liter, 
the absorptivity is called the molar absorptivity and 
symbolized e. This relationship is known as the Beer- 
Lambert-Bouguer law, or simply Beer’s law. For x-ray 
spectroscopy, the right-hand variables are grouped 
differently. 

The practical relation between emission of light 
and concentration is given by the equation F = 
kolabc, where F is the fluorescence intensity, k is 
an instrumental parameter, ¢ is the fluorescence ef- 
ficiency (quanta emitted per quantum absorbed), Jo 
is the intensity of the incident light upon the sam- 
ple, and the other units are as defined previously. 
The choice between absorption and fluorescence 
techniques for determining concentration is some- 
times made on the basis of the accuracy of the lin- 
ear method (fluorescence) versus the versatility of 
the logarithmic method (absorption). See LUMINES- 
CENCE ANALYSIS. 

Other methods and applications. Since the early 
methods of spectroscopy there has been a prolifera- 
tion of techniques, often incorporating sophisticated 
technology. 

Acoustic spectroscopy. When modulated radiant en- 
ergy is absorbed by a sample, its internal energy in- 
creases. The loss of that excess produces a temper- 
ature increase that can be monitored as a periodic 
pressure change in the gas around the sample by 
using a microphone transducer. This is the op- 
toacoustic effect. Its application provides a rapid 
method for study of some difficult samples. See PHO- 
TOACOUSTIC SPECTROSCOPY. 

Astronomical spectroscopy. The radiant energy emitted 
by celestial objects can be studied by combined 
spectroscopic and telescopic techniques to ob- 
tain information about their chemical composition, 
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temperature, pressure, density, magnetic fields, elec- 
tric forces, and radial velocity. Radiation of wave- 
lengths much shorter than 300 nm is absorbed by 
the Earth’s atmosphere, and can only be studied 
by spectrographs transported into space by rockets. 
See ASTRONOMICAL SPECTROSCOPY; SPECTROHELIO- 
GRAPH. 

Atomic absorption and fluorescence spectroscopy. This 
branch of electronic spectroscopy uses line spectra 
from atomized samples to give quantitative analysis 
for selected elements at levels down to parts per 
million, on the average. However, detection limits 
vary greatly from element to element. Plasmas, espe- 
cially inductively coupled plasmas, provide a stable, 
efficient, novel method for atomization and ioniz- 
ation. 

Attenuated total reflectance spectroscopy. Spectra of sub- 
stances in thin films or on surfaces can be obtained 
by the technique of attenuated total reflectance or 
by a closely related technique called frustrated mul- 
tiple internal reflection. In either method the sample 
is brought into contact with a total-reflecting trape- 
zoid prism and the radiant-energy beam is directed 
in such a manner that it penetrates only a few mi- 
crometers of the sample one or more times. 

The technique is employed primarily in infrared 
spectroscopy for qualitative analysis of coatings and 
of opaque liquids. 

Electron spectroscopy. This area includes a number of 
subdivisions, all of which are associated with elec- 
tronic energy levels. The outermost or valence lev- 
els are studied in photoelectron spectroscopy, which 
uses photons of the far-ultraviolet region to remove 
electrons from molecules and to infer the energy lev- 
els of the remaining ion from the kinetic energy of 
the expelled electron. This technique is used mostly 
for fundamental studies of bonding. Electron impact 
spectroscopy uses low-energy electrons (0-100 eV) 
to yield information similar to that observed by vis- 
ible and ultraviolet spectroscopy, but governed by 
different selection rules. 

X-ray photoelectron spectroscopy CXPS), also 
called electron spectroscopy for chemical analysis 
(ESCA), uses x-ray photons to remove inner-shell 
electrons in a similar way, and the kinetic energy 
of the removed electron is measured. In Auger spec- 
troscopy, an x-ray photon removes an inner electron. 
Another electron falls from a higher electron to take 
its place, and a third is ejected to conserve energy, 
since the second electron gains energy when it falls 
to the lower level. The kinetic energy of the third 
electron is measured. The chemical elements con- 
tained within a sample can then be identified, except 
for hydrogen. The energies of inner-shell electrons 
are dependent (in the third or fourth significant fig- 
ure) upon the oxidation state of the atom, and so 
the kinetic energy of an XPS or Auger electron often 
provides information on the oxidation states of the 
elements in a compound too. Both techniques are 
used to study surfaces, since x-rays penetrate many 
objects for only a few atomic or molecular layers. 
The x-rays may be directed at the sample at various 


angles to the normal, and because they penetrate 
the sample for a more or less fixed distance, x-rays 
that are directed at the sample at a large angle from 
the normal (for example, 80°) encounter far fewer 
atomic or molecular layers than those that enter at 
smaller angles from the normal. This angle-resolved 
x-ray photoelectron spectroscopy permits more re- 
finement in the analysis of the first few surface layers 
of a material. Ion neutralization spectroscopy uses 
protons or other charged particles instead of pho- 
tons. See AUGER EFFECT; ELECTRON SPECTROSCOPY; 
SURFACE AND INTERFACIAL CHEMISTRY; SURFACE 
PHYSICS. 

Fourier transform spectroscopy. This family of techniques 
consists of a fundamentally different method of irra- 
diation of the sample, in which all pertinent wave- 
lengths simultaneously irradiate it for a short period 
of time and the absorption spectrum is obtained 
by mathematical manipulation of the cyclical power 
pattern so obtained. It has been applied particularly 
to infrared spectrometry and nuclear magnetic res- 
onance spectrometry, and allows the acquisition of 
spectra from smaller samples in less time, with high 
resolution and wavelength accuracy. The infrared 
technique is carried out with an interferometer of 
conventional instrumentation. See FOURIER SERIES 
AND TRANSFORMS. 

Gamma-ray spectroscopy. Of special note in this cate- 
gory are the techniques of activation analysis, which 
are performed on a sample without destroying it by 
subjecting it to a beam of particles, often neutrons, 
which react with stable nuclei present to form new 
unstable nuclei. The emission of gamma rays for each 
element at different wavelengths with different half- 
lives as these radioactive nuclei decay is character- 
istic. See ACTIVATION ANALYSIS; NEUTRON SPECTRO- 
METRY. 

MOssbauer spectroscopy results from resonant ab- 
sorption of photons of extremely high resolution 
emitted by radioactive nuclei in a crystalline lattice. 
The linewidth is extremely narrow because there is 
no energy loss for the nucleus recoiling in the lat- 
tice, and differences in the chemical environment of 
the emitting and absorbing nuclei of the same iso- 
tope may be detected by the slight differences in 
wavelength of the energy levels in the two samples. 
The difference is made up by use of the Doppler ef- 
fect as the sample to be studied is moved at different 
velocities relative to the source, and the spectrum 
consists of a plot of gamma-ray intensity as a func- 
tion of the relative velocity of the carriage of the 
sample. It yields information on the chemical en- 
vironment of the sample nuclei and has found ap- 
plication in inorganic and physical chemistry, solid- 
state physics, and metallurgy. See DOPPLER EFFECT; 
MÖSSBAUER EFFECT. 

Laser spectroscopy. Laser radiation is nearly mono- 
chromatic, of high intensity, and coherent. Some or 
all of these properties may be used to acquire new 
information about molecular structure. In multipho- 
ton absorption the absorption of more than one pho- 
ton whose energy is an exact fraction of the distance 


between two energy levels allows study of transi- 
tions ordinarily forbidden when only a single photon 
is involved. Multiphoton ionization refers to the re- 
moval of an election after the absorption of several 
photons. Raman spectroscopy has also become more 
common because of the use of lasers as sources, es- 
pecially with the development of resonance Raman 
spectroscopy. 

Information on processes which occur on a pico- 
second time scale can be obtained by making use of 
the coherent properties of laser radiation, as in co- 
herent anti-Stokes-Raman spectroscopy. Lasers may 
also be used to evaporate even refractory samples, 
permitting analysis of the atoms so obtained, and 
since the radiation may be focused, such a technique 
may be applied to one small area or a sample surface 
at a time, permitting profiling of the analysis across 
the sample surface. Laser fluorescence spectroscopy 
provides the lowest detection limits for many mate- 
rials of interest in biochemistry and biotechnology, 
and intense work on development of rapid, sensi- 
tive methods proceeds. This will have application 
in the research that involves mapping of the human 
genome. See GENETIC MAPPING. 

Ultrafast laser spectroscopy may be used to study 
some aspects of chemical reactions, such as tran- 
sition states of elementary reactions and orienta- 
tions in bimolecular reactions. See LASER SPECTROS- 
COPY. 

Mass spectrometry. The source of a mass spectrome- 
ter produces ions, often from a gas, but also in some 
instruments from a liquid, a solid, or a material ad- 
sorbed on a surface. Ionization of a gas is most com- 
monly performed by a current of electrons acceler- 
ated through about 70-80 V, but many other methods 
are used. The dispersive unit may consist of an elec- 
tric field, or sequential electric and magnetic fields, 
or a number of parallel rods upon which a combina- 
tion of dc and ac voltages are applied; these provide 
either temporal or spatial dispersion of ions accord- 
ing to their mass-to-charge ratio. See MASS SPECTROM- 
ETRY; SECONDARY ION MASS SPECTROMETRY (SIMS); 
TIME-OF-FLIGHT SPECTROMETERS. 

Multiplex or frequency-modulated spectroscopy. ‘The basis of 
this family of techniques is to encode or modulate 
each optical wavelength exiting the spectrometer 
output with an audio frequency that contains the 
optical wavelength information. Use of a wavelength 
analyzer then allows recovery of the original optical 
spectrum. The primary advantage of such a system 
is that it enables the detector to sense all opti- 
cal wavelengths simultaneously, resulting in greatly 
decreased scan times and the possibility for signal 
averaging. Frequency modulation can be achieved, 
for example, by means of a special encoding mask 
as in Hadamard transform spectroscopy or by use of 
a Michelson interferometer as in Fourier transform 
spectroscopy. 

Raman spectroscopy. Consider the passage of a beam 
of light, of a wavelength not corresponding to an 
absorption, through a sample. A small fraction of 
the light is scattered by the molecules, and so exits 
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Fig. 6. Interactions of a molecule with a photon of energy 
AE which is not absorbed. (a) Rayleigh scattering. 

(b) Raman scattering, producing a Stokes line. (c) Raman 
scattering, producing an anti-Stokes line. 


the sample at a different angle. This is known as 
Rayleigh scattering if the wavelength of the scattered 
light is the same as the original wavelength; but if the 
wavelength is different, it is called Raman scattering 
(Fig. 6). The fraction of photons thus scattered by 
the Raman effect is even smaller than by Rayleigh 
scattering. Differences in wavelength correspond to 
the wavelengths of certain vibrational and rotational 
processes. If the scattered light has a longer wave- 
length, the new line in the spectrum is known as 
a Stokes line; if shorter, an anti-Stokes line. The in- 
formation obtained is of use in structural chemistry, 
because the rules allowing and forbidding transitions 
are different from those in absorption spectroscopy. 
Use of lasers for sources has revived this technique. 
A related process, resonance Raman spectroscopy, 
makes use of the fact that Raman probabilities are 
greatly increased when the exciting radiation has an 
energy which approaches the energy of an allowed 
electronic absorption. Raman scattering is also 
enhanced by surfaces, providing a valuable tool for 
surface analysis. See RAMAN EFFECT. 

X-ray spectroscopy. The excitation of inner electrons 
in atoms is manifested as x-ray absorption; emis- 
sion of a photon as an electron falls from a higher 
level into the vacancy thus created is x-ray fluores- 
cence. The techniques are used for chemical analysis. 
Extended x-ray absorption fine structure, obtained 
under conditions of high resolution, shows changes 
in the energy of inner electrons due to the pres- 
ence of neighboring atoms, and can be used in struc- 
tural chemistry for the determination of the nature 
and location of neighbor atoms nearest to that 
being studied. Generation of x-rays by bombarding a 
sample with a proton beam gives the microanaly- 
tical technique of proton-induced x-ray emission. 
See X-RAY FLUORESCENCE ANALYSIS; X-RAY SPECTROM- 
ETRY. Maurice M. Bursey 

Bibliography. J. D. Ingle and S. R. Crouch, Spectro- 
chemical Analysis, 1988; S. P. Parker (ed.), Spec- 
troscopy Source Book, 1988; F A. Settle et al., 
Instrumental Methods of Analysis, 8th ed., 
2007; Spectroscopy (trade journal), monthly; D. H. 
Williams and I. Flemming, Spectroscopic Methods in 
Organic Chemistry, 5th ed., 1995. 
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Spectrum 


The term spectrum is applied to any class of similar 
entities or properties strictly arrayed in order of in- 
creasing or decreasing magnitude. In general, a spec- 
trum is a display or plot of intensity of radiation (par- 
ticles, photons, or acoustic radiation) as a function of 
mass, momentum, wavelength, frequency, or some 
other related quantity. For example, a beta-particle 
spectrum represents the distribution in energy or 
momentum of negative electrons emitted sponta- 
neously by certain radioactive nuclides, and when 
radionuclides emit alpha particles, they produce an 
alpha-particle spectrum of one or more character- 
istic energies. A mass spectrum is produced when 
charged particles Gonized atoms or molecules) are 
passed through a mass spectrograph in which elec- 
tric and magnetic fields deflect the particles accord- 
ing to their charge-to-mass ratios. The distribution of 
sound-wave energy over a given range of frequencies 
is also called a spectrum. See MASS SPECTROSCOPE; 
SOUND. 

In the domain of electromagnetic radiation, a 
spectrum is a series of radiant energies arranged 
in order of wavelength or of frequency. The entire 
range of frequencies is subdivided into wide inter- 
vals in which the waves have some common char- 
acteristic of generation or detection, such as the 
radio-frequency spectrum, infrared spectrum, visible 
spectrum, ultraviolet spectrum, and x-ray spectrum. 
Spectra are also classified according to their origin 
or mechanism of excitation, as emission, absorption, 
continuous, line, and band spectra. See ELECTROMAG- 
NETIC RADIATION. 

An emission spectrum is produced whenever the 
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radiations from an excited light source are dispersed. 
Excitation of emission spectra may be by thermal en- 
ergy, by impacting electrons and ions. or by absorp- 
tion of photons. Depending upon the nature of the 
light source, an emission spectrum may be a contin- 
uous or a discontinuous spectrum, and in the latter 
case, it may show a line spectrum, a band spectrum, 
or both. 

An absorption spectrum is produced against a 
background of continuous radiation by interposing 
matter that reduces the intensity of radiation at cer- 
tain wavelengths or spectral regions. The energies 
removed from the continuous spectrum by the in- 
terposed absorbing medium are precisely those that 
would be emitted by the medium if properly excited. 
This reciprocity of absorption and emission is known 
as Kirchhoff’s principle; it explains, for example, the 
absorption spectrum of the Sun, in which thousands 
of lines of gaseous elements appear dark against the 
continuous-spectrum background. 

A continuous spectrum contains an unbroken se- 
quence of waves or frequencies over a long range 
(see illus.). 

In illus. a the spectra for 1000 and 2000°C (1832 
and 3632°F) were obtained from tungsten filament, 
that for 4000°C (7232°F) from a positive pole of car- 
bon arc. The upper spectrum in illus. b is that of 
the source alone, extending roughly from 400 to 650 
nanometers. The others show the effect on this spec- 
trum of interposing three kinds of colored glass. 

All incandescent solids, liquids, and compressed 
gases emit continuous spectra, for example, an 
incandescent lamp filament or a hot furnace. In 
general, continuous spectra are produced by high 
temperatures, and under specified conditions the 
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Spectra of a blackbody. (a) Continuous emission spectra of a solid, taken with a quartz spectrograph. (b) Continuous 
absorption spectra of the source alone and three kinds of colored glass interposed on it. (From F. A. Jenkins and H. E. White, 


Fundamentals of Optics, 4th ed., McGraw-Hill, 1976) 


distribution of energy as a function of temperature 
and wavelength is expressed by Planck’s law. See 
HEAT RADIATION; PLANCK’S RADIATION LAW. 

Line spectra are discontinuous spectra character- 
istic of excited atoms and ions, whereas band spec- 
tra are characteristic of molecular gases or chemical 
compounds. See ATOMIC STRUCTURE AND SPECTRA; 
BAND SPECTRUM; LINE SPECTRUM; MOLECULAR STRUC- 
TURE AND SPECTRA; SPECTROSCOPY. 

W. F. Meggers; W. W. Watson 
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Spectrum analyzer 


An instrument for the analysis and measurement of 
signals throughout the electromagnetic spectrum. 
Spectrum analyzers are available for subaudio, audio, 
and radio-frequency measurements, as well as for mi- 
crowave and optical signal measurements. 

Generally, a spectrum analyzer separates the sig- 
nal into two components: amplitude (displayed ver- 
tically) and frequency (displayed horizontally). On 
some low-frequency analyzers, phase information 
can also be displayed. Low-frequency analyzers are 
sometimes grouped under the heading “harmonic 
analyzers,” although this term is becoming less com- 
mon. 

Operation. On a conventional spectrum analyzer, 
a screen with a calibrated graticule displays the 
components of the input signal. The vertical scale 
displays the amplitude of each component, and the 
chosen frequency band is displayed horizontally. 
Components of the signal being analyzed are dis- 
played as vertical lines whose height is proportional 
to amplitude and whose horizontal displacement 
equates to frequency. Originally, cathode-ray tubes 
were used for the display; solid-state displays such as 
liquid-crystal displays now are used. See CATHODE- 
RAY TUBE; ELECTRONIC DISPLAY. 

Swept heterodyne technique. The traditional audio- 
frequency, radio-frequency, and microwave spec- 
trum analyzer is essentially a swept heterodyne 
analyzer (Fig. 1). A ramp generator simultaneously 
increases the frequency of a local oscillator and 
moves the spot across the display. When the ramp 
voltage is zero, the spot will be on the left-hand 
side of the display and the local oscillator will be 
at a low frequency. Any signal at the input, at 
a frequency such that the difference between its 
frequency and the local oscillator is within the band- 
width of an intermediate-frequency filter, will be de- 
tected and will vertically deflect the spot on the dis- 
play by an amount proportional to the amplitude of 
the input signal being analyzed. As the ramp volt- 
age increases, the spot moves across the display and 
the analyzer tunes to increasingly higher-frequency 
signals. Each component of a complex waveform 
is thus isolated and displayed. See HETERODYNE 
PRINCIPLE. 

Digital techniques. Traditionally, audio-frequency 
spectrum analyzers used a lower-frequency ver- 
sion of the swept heterodyne technique. Now, 
lower-frequency instruments generally use a digital 
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Fig. 1. Simplified block diagram of a spectrum analyzer with a tracking generator. 


techniques known as the fast Fourier transform 
(FFT). The signal to be analyzed is converted to a 
digital signal by using an analog-to-digital converter, 
and the digital signal is processed by using the FFT 
algorithm. The input to this algorithm consists of 
blocks of digitized data. The algorithm analyzes the 
time-domain waveform, computes the frequency 
components present, and displays the results. All 
frequency components are analyzed simultaneously, 
in contrast to a swept instrument, which analyzes 
only a narrow portion of the spectrum at any one 
time. See ANALOG-TO-DIGITAL CONVERTER. 

The FFT technique is much faster and the mea- 
surement is virtually real time, but it is restricted to 
lower frequencies. Real-time display is vital to ana- 
lyze complex nonrepetitive signals such as speech or 
random vibrations. Digital processing techniques are 
used in radio-frequency and microwave instruments, 
but an analog input is required to down-convert the 
high-frequency signals to a signal low enough in fre- 
quency to be processed by currently available analog- 
to-digital converters. See FOURIER SERIES AND TRANS- 
FORMS. 

Controls. Modern spectrum analyzers incorporate 
sophisticated microprocessor control and can be re- 
motely programmed (operated), so that a computer 
can control all functions and read the measured val- 
ues. There are normally four basic groups of controls: 
frequency, horizontal scale, vertical scale, and mark- 
ers. Both digital keypads and rotary controls are gen- 
erally provided. The frequency controls determine 
the center or reference frequency of the display. The 
horizontal-scale controls determine the portion of 
the spectrum that can be viewed; this can be a scan 
from, say, 100 Hz up to 20 GHz or beyond. Separate 
controls determine the vertical scale and the sensi- 
tivity. Markers can be placed on spectral lines to mea- 
sure accurately absolute and relative frequencies and 
levels. See MICROPROCESSOR. 
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Further controls allow the operator to optimize 
operation for different conditions; for example, the 
intermediate-frequency bandwidth can be changed 
to alter the resolution in order to resolve closely 
spaced signals. The input attenuation and gain of 
the intermediate-frequency section can be separately 
changed to optimize for either low-noise or low- 
intermodulation operation. 

Digital storage. In order to obtain high resolution 
over a relatively wide band of frequencies, it is nec- 
essary to sweep slowly, especially in radio-frequency 
and microwave instruments. A single sweep may take 
as long as 100 seconds. Digital storage gives the ef- 
fect of a constant display, even though a very slow 
sweep may have been used to acquire the displayed 
data. 

Tracking generator. The tracking generator enhances 
the applications of spectrum analyzers (Fig. 1). Its 
output delivers a swept signal whose instantaneous 
frequency is always equal to the input tuned fre- 
quency of the analyzer. The swept signal is used 
to measure the frequency response of circuits such 
as filters and amplifiers over a wide dynamic range, 
which would not be possible with a broadband non- 
selective measurement method. 

Harmonic analyzers. Spectrum analyzers covering up 
to typically 100 kHz can also be called harmonic 
analyzers. This term refers to instruments such as 
wave analyzers and distortion factor meters. Such in- 
struments are still available, but the FFT spectrum 
analyzer has largely supplanted them, assisted with 
digital techniques using microprocessors. Faster and 
more accurate measurements are generally obtained 
using modern digital instrumentation. 

Other harmonic analyzers that may be encoun- 
tered include the tunable filter analyzer, stepped fil- 
ter analyzer, parallel filter analyzer, and time com- 
pression analyzer. 

Applications. Early radio-frequency and micro- 
wave analyzers were developed to measure the per- 
formance of microwave radar transmitters and to an- 
alyze signals from single-sideband transmitters. See 
RADAR; SINGLE SIDEBAND. 
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Typical radio-frequency spectrum analyzer. (IFR Ltd.) 


A typical use for radio-frequency and microwave 
spectrum analyzers is the measurement of spurious 
radiation (noise) from electrical machinery and cir- 
cuits, known as radio-frequency interference (RFD. 
Other uses include monitoring and surveillance to 
detect unauthorized or unintended transmissions, 
such as the civil monitoring of broadcast and com- 
munication channels and the detection of electronic 
warfare signals. Another application is the analysis of 
radio communication transmitters and receivers, in- 
cluding those used in radio and television broadcast- 
ing, satellite systems, and mobile radio and cellular 
telephone communications (Fig. 2). See ELECTRICAL 
INTERFERENCE; ELECTRONIC WARFARE. 

Spectrum analyzers also fulfill a variety of general- 
purpose uses in research, development, manufac- 
ture, and field maintenance of electrical and elec- 
tronic equipment. For certain applications they are 
used instead of oscilloscopes, since the spectrum 
analysis display of a complex waveform can provide 
more information than an oscilloscope display. Os- 
cilloscopes are typically limited to frequencies up to 
1 GHz, while spectrum analyzers can measure mi- 
crowave signals over 300 GHz. See OSCILLOSCOPE. 

Low-frequency spectrum analyzers are used in a 
variety of applications. The most obvious use is the 
measurement of distortion and unwanted signals in 
all types of audio equipment, from recording and 
broadcast studios to amplifiers used in the home. 
See SOUND RECORDING; SOUND-REPRODUCING 
SYSTEMS. 

Further uses include the analysis of speech wave- 
forms, measurement of vibration and resonances in 
mechanical equipment and structures, determina- 
tion of echo delays in seismic signals, investigation 
of noise such as from aircraft engines or from ma- 
chinery in factories, analysis of sonar signals used to 
detect objects underwater, and study of ultrasonic 
waves to determine the internal structure of objects 
such as human tissue and metal castings. See BIOMED- 
ICAL ULTRASONICS; MECHANICAL VIBRATION; NOISE 
MEASUREMENT; NONDESTRUCTIVE EVALUATION; SEIS- 
MOLOGY; SONAR; ULTRASONICS. 

Optical spectrum analyzers use techniques such 
as a collimating mirror and a diffraction grating or a 
Michelson interferometer to separate out the light- 
wave components. They are used for a variety of 
applications, including measurements on lasers and 
light-emitting diodes, and for the analysis of optical- 
fiber equipment used to carry multichannel, digital 
telephony. See DIFFRACTION GRATING; INTERFEROM- 
ETRY; LASER; LIGHT-EMITTING DIODE; OPTICAL COM- 
MUNICATIONS; OPTICAL FIBERS. S. J. Gledhill 
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niques, 1988; R. A. Witte, Electronic Test Instru- 
ments: Analog and Digital Measurements, 2d ed., 
2002; R. A. Witte, Spectrum and Network Measure- 
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Speech 


A set of audible sounds produced by disturbing 
the air through the integrated movements of cer- 
tain groups of anatomical structures. Humans at- 
tach symbolic values to these sounds for commu- 
nication. There are many approaches to the study of 
speech. 


Speech Production 


The physiology of speech production may be de- 
scribed in terms of respiration, phonation, and ar- 
ticulation. These interacting processes are activated, 
coordinated, and monitored by acoustical and kines- 
thetic feedback through the nervous system. 

Respiration. Most of the speech sounds of the 
major languages of the world are formed during exha- 
lation. Consequently, during speech the period of ex- 
halation is generally much longer than that of inhala- 
tion. In providing a sublaryngeal air supply, the res- 
piratory muscles coordinate with the laryngeal and 
supralaryngeal muscles to produce suitable driving 
pressures for the formation of many speech sounds 
(Fig. 1). The aerodynamics of the breath stream in- 
fluence the rate and mode of the vibration of the 
vocal folds. This involves interactions between 
the pressures initiated by thoracic movements and 
the position and tension of the vocal folds. See RES- 
PIRATION. 

The pressure pattern of the sublaryngeal air is 
closely related to the loudness of the voice and ap- 
pears to be correlated with the perception of stress. 
For example, the word “permit” may be either a noun 
or a verb, depending on the placement of stress. Vari- 
ous attempts have been made to correlate units such 
as the syllable and the phrase to the contractions of 
specific respiratory muscles. However, the relation 
between the thoracic movements and the grouping 
of speech sounds is poorly understood. 

Experimental studies on respiration for speech 
production employ techniques such as pressure and 
electromyographic recording at various points along 
the respiratory tract as well as x-ray photography to 
investigate anatomical movements. 

Phonation. The phonatory and articulatory mech- 
anisms of speech may be regarded as an acoustical 
system whose properties are comparable to those of 
a tube of varying cross-sectional dimensions. At the 
lower end of the tube, or the vocal tract, is the lar- 
ynx. It is situated directly above the trachea and is 
composed of a group of cartilages, tissues, and mus- 
cles. The upper end of the vocal tract may terminate 
at the lips, at the nose, or both. The length of the 
vocal tract averages 6.5 in. (16 cm) in men and may 
be increased by either pursing the lips or lowering 
the larynx. 
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Fig. 1. Human head and thorax. (Courtesy of G. E. Peterson) 


Mechanism of phonation. The larynx is the primary 
mechanism for phonation, that is, the generation of 
the glottal tone. The vocal folds consist of connective 
tissue and muscular fibers which attach anteriorly 
to the thyroid cartilage and posteriorly to the vocal 
processes of the arytenoid cartilages. The vibrating 
edge of the vocal folds measures about 0.92-1.08 in. 
(23-27 mm) in men and considerably less in women. 
The aperture between the vocal folds is known as the 
glottis (Fig. 2). The tension and position of the vocal 
folds are adjusted by the intrinsic laryngeal muscles, 
primarily through movement of the two arytenoid 
cartilages. By contraction of groups of muscles, the 
arytenoid cartilages may be pulled either anterior- 
posteriorly or laterally in opposite directions. They 
may also be rotated about the vertical axis by the 
cricoarytenoid muscles. During whispering and un- 
voiced sounds, the glottis assumes the shape of a 
triangle, with the apex directly behind the thyroid 
cartilage. See LARYNX. 
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Fig. 2. Action of the glottis during speech. (a) Six successive positions of the vocal folds in one vibration cycle at 120 Hz. 
(b) Relation between pitch and vocal fold tension. (Courtesy of D. W. Farnsworth) 


Air pressure. When the vocal folds are brought to- 
gether and there is a balanced air pressure to drive 
them, they vibrate laterally in opposite directions. 
During phonation, the vocal folds do not transmit the 
major portion of the energy to the air. They control 
the energy by regulating the frequency and amount 
ofair passing through the glottis. Their rate and mode 
of opening and closing are dependent upon the po- 
sition and tension of the folds and the pressure and 
velocity of airflow. The tones are produced by the 
recurrent puffs of air passing through the glottis and 
striking into the supralaryngeal cavities. 

As the air passes through the glottis with increas- 
ing velocity, the pressure perpendicular to the direc- 
tion of airflow is reduced. This allows the tension of 
the folds to draw them together, either partly or com- 
pletely, until sufficient pressure is built up to drive 
them apart again. It is generally assumed that the vol- 
ume flow is linearly related to the area of the glottis 
and that the glottal source has a waveform that is 
approximately triangular. 

Voiced sound. Speech sounds produced during 
phonation are called voiced. Almost all of the vowel 
sounds of the major languages and some of the conso- 
nants are voiced. In English, voiced consonants may 
be illustrated by the initial and final sounds in the 
following words: “bathe,” “dog,” “man,” “jail.” The 
speech sounds produced when the vocal folds are 
apart and are not vibrating are called unvoiced; ex- 
amples are the consonants in the words “hat,” “cap,” 
“sash,” “faith.” During whispering all the sounds are 
unvoiced. 

Frequency and pitch. The rate of vibration of the vocal 
folds is the fundamental frequency of the voice (FO). 
It correlates well with the perception of pitch. The 
frequency increases when the vocal folds are made 
taut (Fig. 2b). Relative differences in the fundamental 
frequency of the voice are utilized in all languages 
to signal some aspects of linguistic information. In 
Fig. 3, the fundamental frequency of the voice for 
the phrase “speech production” is shown as a dotted 
line. Note that frequency is highest on the vowels 
“ee” and “u,” indicating that these are stressed. Also, 
the fundamental frequency drops to the lowest point 


at the end of the phrase, indicating the phrase is a 
statement rather than a question. 

Many languages of the world are known as tone 
languages, because they use the fundamental fre- 
quency of the voice to distinguish between words. 
Chinese is a classic example of a tone language. There 
are four distinct tones in Chinese speech. Said with 
a falling fundamental frequency of the voice, ma 
means “to scold.” Said with a rising fundamental fre- 
quency, it means “hemp.” With a level fundamen- 
tal frequency it means “mother,” and with a dipping 
fundamental frequency it means “horse.” In Chinese, 
changing a tone has the same kind of effect on the 
meaning of a word as changing a vowel or consonant 
in a language such as English. 

The mode of vibration of the vocal folds can be var- 
ied to change the acoustical properties of the voice. 
For example, if the folds remain sufficiently apart dur- 
ing the vibration, the voice is perceived to be breathy 
in quality. 

Study methods. The properties of the movements of 
the vocal folds have been studied by placing micro- 
phones at different points in relation to the larynx. 
Experiments have also been conducted with elec- 
trodes inserted directly into the laryngeal structures 
of hemilaryngectomized subjects and simultaneously 
recording their speech. Much information about the 
glottal vibration has been gathered through direct 
observation of the vocal folds by means of x-ray pho- 
tography, the stroboscope, and high-speed motion- 
picture photography. The relation between the vol- 
ume flow, the mode of glottal vibration, and the spec- 
trum of the glottal tone has been studied by these 
techniques. Several electronic and mechanical mod- 
els have been constructed to simulate phonation, 
based on the collated evidence from the acoustics 
and physiology of the larynx. 

Articulation. The activity of the structures above 
and including the larynx in forming speech sound is 
known as articulation. It involves some muscles of 
the pharynx, palate, tongue, and face and of masti- 
cation (Fig. 1). 

The primary types of speech sounds of the 
major languages may be classified as vowels, nasals, 
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Fig. 3. Sound spectrograms of the utterance “speech production.” (a) Energy patterns: A, pulse; B, periodic; C, aperiodic; 
and D, silence. (b) The dotted lines represent the pitch pattern of the utterance. 


plosives, and fricatives. They may be described in 
terms of degree and place of constriction along the 
vocal tract. See PHONETICS. 

Vowels. The only source of excitation for vowels 
is at the glottis. During vowel production the vocal 
tract is relatively open and the air flows over the cen- 
ter of the tongue, causing a minimum of turbulence. 
The phonetic value of the vowel is determined by 
the resonances of the vocal tract, which are in turn 
determined by the shape and position of the tongue 
and lips. 

The point of constriction of a vowel is where the 
cross-sectional area of the vocal tract is minimized 
by the humping of the tongue. At this point the vo- 
cal tract is divided approximately into an oral cavity 
in front and a pharyngeal cavity behind. Vowels are 
known grossly as back, central, and front as the point 
of constriction moves from the pharyngeal wall ante- 
riorly along the palate, as in the words “boot” (back), 
“but” (central), “beet” (front). Vowels are referred 
to as high, mid, and low as the height of the tongue 
hump is increasingly lowered, as in the words “beet” 
chigh), “bet” (mid), “bat” dow). As the two param- 
eters, point of constriction and tongue height, are 
varied, the relations between the oral and pharyngeal 
cavities change to produce characteristic resonances 
for different vowels. 

Vowels are grossly described as rounded when 
produced with the contraction of the muscles of 
the lips, as in “he.” Since rounding lengthens the 


vocal tract, the resonances of the rounded vowel are 
generally lower than those of the corresponding un- 
rounded vowel. 

Nasals. The nasal cavities can be coupled onto the 
resonance system of the vocal tract by lowering the 
velum and permitting airflow through the nose. Vow- 
els produced with the addition of nasal resonances 
are known as nasalized vowels. Nasalization may be 
used to distinguish meanings of words made up of 
otherwise identical sounds, such as bas and banc in 
French. If the oral passage is completely constricted 
and air flows only through the nose, the resulting 
sounds are nasal consonants. The three nasal conso- 
nants in “meaning” are formed with the constriction 
successively at the lips, the hard palate, and the soft 
palate. 

Plosives. These are characterized by the complete 
interception of airflow at one or more places along 
the vocal tract. The pressure which is built up behind 
the intercepting mechanism may not be immediately 
released or may be released through the oral or nasal 
orifice. The places of constriction and the manner 
of the release are the primary determinants of the 
phonetic properties of the plosives. The words “par,” 
“bar,” “tar,” and “car” begin with plosives. 

When the interception is brief and the constriction 
is not necessarily complete, the sound is classified 
as a flap. By tensing the articulatory mechanism in 
proper relation to the airflow, it is possible to set the 
mechanism into vibrations which quasi-periodically 
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Fig. 4. Computer-generated three-dimensional spectral 
display of the spoken syllable “yah.” 


intercept the airflow. These sounds are called trills 
and can be executed with the velum, the tongue, 
and the lips. They usually are produced around 
25 Hz, whereas the fundamental frequency of the 
male speaking voice ranges from 80 to 160 Hz. 

Fricatives. These are produced by a partial constric- 
tion along the vocal tract which results in turbu- 
lence. Their properties are determined by the place 
or places of constriction and the shape of the modi- 
fying cavities. The fricatives in English may be illus- 
trated by the initial and final consonants in the words 
“vase,” “this,” “faith,” “hash.” 

Acoustical analysis. The muscular activities of 
speech production influence each other both si- 
multaneously and with respect to time. It is fre- 
quently difficult to segment a sequence of sounds 
because the physiological activities which produce 
them form a continuum. Consequently the physical 
aspects of a sound type are determined to some 
extent by neighboring sounds. Each physiological 
parameter—for example, lip rounding and funda- 
mental frequency of the glottal tone—may be varied 
continuously. Therefore, within its physiological lim- 
its the speech mechanism is able to produce a con- 
tinuum of different sounds. The physiological and 
acoustical criteria in a classification of these sounds 
are therefore dependent on external conditions such 
as the threshold of the analyzing instruments. 

There is a complex, many-to-one relation between 
the physiology of production and the resultant acous- 
tical waves. Acoustically, speech sounds may be re- 
garded as the simultaneous and sequential combi- 


nations of pulse, periodic, and aperiodic forms of 
energy interrupted by silence of varying duration. 
These energy patterns are labeled as A, B, C, and D, 
respectively, in the sound spectrogram of Fig. 3a. The 
horizontal bands in the periodic portions indicate 
the frequency positions of the vocal tract resonances. 
On the average, there is one resonance per kilohertz 
for vowels produced by a male vocal tract, though 
these resonances are differently spaced for various 
vowels. The pitch of the utterance is illustrated in 
the sound spectrograms of Fig. 3b. Each horizontal 
line indicates a harmonic of the glottal tone. The 
spectrum of the glottal tone is usually taken to have 
a slope of —6 to —12 decibels per octave when the 
effects of the vocal tract resonances are discounted. 
The sound spectrograph has been a valuable instru- 
ment for research on the acoustical aspect of speech. 
Essentially, it makes a Fourier type of analysis on the 
acoustical wave. The results are then translated into 
graphic form, usually with frequency along the ordi- 
nate and time along the abscissa. 

Advances in computer technology have also been 
applied to the analysis of speech. Figure 4 shows 
a three-dimensional display of the syllable “yah.” In 
this display, the amplitude of each vocal tract reso- 
nance is indicated by the height of the peak. Thus it 
can be clearly seen that as the syllable progresses in 
time, the resonances change not only in frequency 
but in amplitude as well. 

Novel methods have been introduced from signal- 
processing technology, such as the so-called speech 
flakes shown in Fig. 5. The flakes are of the vowels 
as in “far,” “food,” and “fee.” These three vowels are 
by far those most often used in the languages of the 
world. The redundant symmetry in such flakes may 
turn out to be quite helpful for visualizing speech 
sounds. 

Neurology. The ability to produce meaningful 
speech is dependent in part upon the association 
areas of the brain. It is through them that the stimuli 
which enter the brain are interrelated. These areas 
are connected to motor areas of the brain which send 
fibers to the motor nuclei of the cranial nerves and 
hence to the muscles. Three neural pathways are di- 
rectly concerned with speech production, the pyra- 
midal tract, the extrapyramidal, and the cerebellar 
motor paths. It is the combined control of these path- 
ways upon nerves arising in the medulla and ending 





Fig. 5. Speech flakes of vowels in the words (a) “far,” (b) “food,” and (c) “fee.” (Courtesy of Charles Wooters) 
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Fig. 6. Regions of the brain involved in speech. (a) Left hemisphere, usually more involved in speech functions than the right 
hemisphere. (b) A model for the neural processes involved in speaking a heard word. Arrows indicate the path of the relayed 


signal from the primary auditory area to the motor cortex. 


in the muscles of the tongue, lips, and larynx which 
permits the production of speech. See NERVOUS SYS- 
TEM (VERTEBRATE). 

The part of the pyramidal tract which is most im- 
portant for speech has its origin in the lower por- 
tion of the precentral gyrus of the cerebral cortex. 
In proximity to the precentral gyrus on the left side 
of the brain is Broca’s area. This area is one of several 
which are believed to activate the fibers of the pre- 
central gyrus concerned with movements necessary 
for speech production. 

In most people, the left hemisphere of the brain 
is more involved in speech functions than the right 
hemisphere. Figure 6a shows the regions on the 
surface of the left hemisphere that are especially per- 
tinent for speech. Figure 6b shows a possible model 
for speaking a heard word. First the signal is received 
at the primary auditory area. From there the signal 
is relayed to Wernicke’s area, situated posteriorly in 
the temporal lobe, where it is recognized as a word. 
The heard word is then relayed to Broca’s area, sit- 
uated inferiorly in the frontal lobe, via bundles of 
nerve fibers collectively known as arcuate fascicu- 
lus. Finally it is relayed to the motor cortex, which 
ultimately structures the muscular activities that pro- 
duce the sounds, that is, speak the word. 

Areas of the extrapyramidal tract which contribute 
to speech are the caudate nucleus, globus pallidus, 
and the thalamus. Some of the fibers from these cen- 
ters also go to the same motor nerves as those of the 
pyramidal tract. The chief function of these paths 
in regard to speech is their regulatory and refining 
actions. It is through the influence of the extrapyra- 
midal system that the regulation and inhibition of 
opposing sets of muscles are controlled. The cere- 
bellar motor paths also contribute to the coordina- 
tion and tonus of muscle structures requiring move- 
ments of paired muscles. In addition, the cerebellar 
motor paths are important in coordinating breathing 
for speech production. 

Six of the 12 cranial nerves send motor fibers to 
the muscles that are involved in the production of 
speech. 

These nerves are the trigeminal, facial, glossopha- 


ryngeal, vagus, spinal accessory, and the hypoglos- 
sal. They represent the link between the neural ac- 
tivity which begins in the cerebral cortex and the 
coordinated muscular movements which produce 
speech. The relations between the neurology, phys- 
iology, and acoustics of speech are extremely com- 
plex and poorly understood. Within normal anatomi- 
cal variation, the skill in producing particular speech 
sounds is almost entirely determined by the early lin- 
guistic environment of the speaker. Through the in- 
teractions between the linguistic environment and 
the speech of the individual, the sounds and their 
symbolic values change from community to commu- 
nity and through time. See PSYCHOACOUSTICS; PSY- 
CHOLINGUISTICS. 


Development 


In the early stages of speech development the child’s 
vocalizations are quite random. The control and vol- 
untary production of speech are dependent upon 
physical maturation and learning. 

It is possible to describe the development of 
speech in five stages. In the first stage the child makes 
cries in response to stimuli. These responses are not 
voluntary but are part of the total bodily expression. 
The second stage begins between the sixth and sev- 
enth week. The child is now aware of the sounds 
he or she is making and appears to enjoy this activ- 
ity. During the third stage the child begins to repeat 
sounds heard coming from himself or herself. This is 
the first time that the child begins to link speech pro- 
duction to hearing. During the ninth or tenth month 
the child enters the fourth stage and begins to imi- 
tate without comprehension the sounds that others 
make. The last stage begins between the twelfth and 
eighteenth month, with the child intentionally em- 
ploying conventional sound patterns in a meaningful 
way. The exact time at which each stage may occur 
varies greatly from child to child. 

Although most children begin to use speech mean- 
ingfully by the eighteenth month, they are not able to 
articulate all the necessary sounds. The first sounds 
the child uses are mostly front vowels. The back vow- 
els become more frequent as the child develops. The 
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ability to produce vowels correctly seems to be al- 
most completely learned by 30 months. The earli- 
est consonants the child uses are those formed with 
both lips. By 54 months, the child can produce all the 
necessary consonants. The approximate age level at 
which each of 23 consonant sounds of English is 
mastered by American children follows. 


Age, Sounds 
months 

31 b(baby), p(papa), mqnama), w(wet), 
hbe) 

4" d(dada), t(two), n(mose), g(go), 
k(cat), ng(sing), y(yet) 

5\/ ffun) 

6'/ v(very), th(¢that), z(azure), sh(shoe), 
Idie) 

Wh s(see), z(zoo), r(rain), th(thin), 
wh(who) 


The close coupling of age and sound acquisition, as 
exhibited above, together with other observations, 
has led some researchers to post innate, genetic 
mechanisms which serve exclusively speech. This 
innate hypothesis is a focal point for discussion on 
how speech emerged, in the child as well as in the 
human species. 


Speech Technology 


Since about 1990, technologies have arisen that are 
taking over an ever-expanding share of the market- 
place. These technologies are based on the knowl- 
edge that has accumulated from many decades of 
research on how speech is produced and perceived. 
Speech technology has been developing within three 
areas. 

One area has to do with identifying a speaker by 
analyzing a speech sample. Since the idea is analo- 
gous to that of identifying an individual by finger- 
print analysis, the technique has been called voice 
print. However, fingerprints have two important ad- 
vantages over voice prints: (1) they are based on 
extensive data that have accumulated over several 
decades of use internationally, whereas no compara- 
ble reference exists for voice prints; and (2) itis much 
easier to alter the characteristics of speech than of 
finger prints. Consequently, this area has remained 
largely dormant. Most courts in the United States, for 
instance, do not admit voice prints as legal evidence. 
Some progress may be made in voice print identifi- 
cation after a better understanding of the process of 
phonation is achieved, since this is more specific to 
individuals. 

In contrast, the two other areas of speech tech- 
nology, synthesis and recognition, have seen explo- 
sive growth. Humans prefer to interact with speech, 
which they have mastered since early childhood, 
rather than via keyboard and screen. This preference 
becomes a necessity when visibility is poor, or when 
the hands are otherwise occupied. With the rapid 
pace in computer advancement, speech technology 
is expected to extend into more and more sectors of 
human life in the years to come. 


In many applications where a limited repertoire 
of speech is required, computer-synthesized speech 
is used instead of human speakers. These applica- 
tions began with automobile devices which remind 
the driver to fasten his seat belt, and telephone di- 
rectory services which tell customers the numbers 
they seek. The area has advanced greatly. Coupled 
with a machine that can read the printed page, the 
conversion of text to speech is almost a reality. A 
common technology currently used in speech syn- 
thesis involves an inventory of pitch-synchronized, 
prestored human speech. These prestored patterns 
are selected according to the particular requirements 
of the application and recombined with some over- 
lap into the desired sentence by computer, almost in 
real time. 

The quality of synthesized speech for English is 
remarkably good, though it is limited at present to 
neutral, emotionless speech. Many other languages 
are being synthesized with varying degrees of suc- 
cess. Much research will be required to endow syn- 
thesized speech with the rich panorama of emotion 
that human speech can convey. 

The recognition of speech by computer is much 
more difficult than synthesis. Instead of just repro- 
ducing the acoustic wave, the computer must un- 
derstand something of the semantic message that the 
speech wave contains, in order to recognize pieces 
of the wave as words in the language. Humans do 
this easily because they have a great deal of back- 
ground knowledge about the world, because they 
are helped by contextual clues not in the speech 
wave, and because they are extensively trained in 
the use of speech. Although there are various sys- 
tems in the marketplace at present which purport 
to perform dictation, their levels of performance are 
typically quite low. This is especially true when the 
speech takes place in noisy conditions. 

Nonetheless, given various constraints, some of 
the existing systems do remarkably well—even to the 
extent of helping surgeons in the operating room, 
where reliability is of paramount concern. These 
constraints include (1) stable acoustic conditions in 
which speech is produced, (2) a speaker trained by 
the system, (3) limited inventory of utterances, and 
(4) short utterances. The research here is strongly 
driven by the marketplace, since all sorts of applica- 
tions can be imagined where spoken commands are 
required or highly useful. It is expected that speech 
technology will continue to grow at a fast pace in 
the coming decades. See SPEECH DISORDERS. 

William S.-Y. Wang 
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Typically, problems of speech production, articu- 
lation, or voice, or disruptions of the ability to 
use language. These disorders can be categorized 
in a variety of ways, but most belong to one of 
the following types: articulation, voice, fluency, and 
language. 

Another way to categorize these disorders is on 
the basis of the cause, organic or nonorganic. Or- 
ganic speech and language disorders typically re- 
sult from disease or injury to the structures, mus- 
cles, and nerves necessary to move and control the 
mechanisms for speech and voice production. Dam- 
age to the brain, particularly the left hemisphere, 
as the result of stroke, dementia, or traumatic head 
injury can result in disruptions of normal language 
function. In children, failure of the brain or sys- 
tems subserving language and speech to develop ad- 
equately can result in significant developmental de- 
lays in the acquisition of appropriate language and 
speech functions. Disorders of language and speech 
for which there is no demonstrable organic etiol- 
ogy are often the result of poor learning or inad- 
equate speech and language models. In some in- 
stances, there may be a psychogenic basis for the 
disorder. 

Articulation. Articulation for speech production 
involves the appropriate sequential movement of 
structures known as the articulators: mandible, 
tongue, lips, velum, and so forth. There is a wide 
range of acceptable productions of a given sound 
(phoneme) in the language. The variations, some- 
times known as dialect, reflect regional, ethnic, and 
socioeconomic characteristics; they should not be 
considered as deviant productions. In most of these 
instances, production is precise and falls within the 
accepted bounds of the individual’s speech and lan- 
guage community. However, when the production 
of one or more sounds in the language is imprecise 
and cannot be understood, then an articulation dis- 
order is present. Disorders can range from mild prob- 
lems such as the distortions associated with a lisp, 
or can be so severe as to render the speaker unin- 
telligible. Most articulation problems can be cate- 
gorized in terms of how the sounds are produced. 
Speech sounds can be misarticulated by distortion, 
omission, substitution, or addition. These problems 
can arise from organic causes, such as impairment to 
motor control of the speech musculature following 
a stroke or cerebral palsy (dysarthria), cleft palate 
or lip, or hearing loss. Many articulation problems 
have no discernible organic basis; these are labeled 
as functional, since the etiology of the problem is 
unclear. Many of these problems begin in childhood 
and can persist into adult life if not treated appro- 
priately. See CLEFT LIP AND CLEFT PALATE; HEARING 
IMPAIRMENT; PHONETICS. 

There are a number of approaches for treating 
these disorders, both organic and functional. For ex- 
ample, discrimination training is based on the con- 
cept that the basis of the problem is poor sound dis- 
crimination and the inability to match the auditory 


Speech disorders 


feedback from production with the patterns that oth- 
ers are able to produce. Treatment focuses on train- 
ing auditory discrimination to discover the differ- 
ences and then modifying production to match the 
pattern. Other approaches include behavior modifi- 
cation; production training, which requires relearn- 
ing specific motor patterns; linguistic approaches 
based on theories of phonological structure; and psy- 
chological approaches, including the psychoanalytic 
model. The psychological approach has not been 
widely accepted; however, psychological counseling 
can sometimes be of great help to children who have 
not been accepted because of their poor articulatory 
behaviors. Finally, in cases of very severely impaired 
persons, such as those with cerebral palsy and ac- 
companying dysarthria, communication boards and 
alternative communicative methods, including com- 
puters using synthetic speech, have become increas- 
ingly common. In persons with orofacial abnormal- 
ities, such as cleft lip or palate, that interfere with 
articulation, surgical and dental repair or protheses 
may be necessary before speech therapy can be ini- 
tiated. See PSYCHOLINGUISTICS. 

Production of speech sounds is a complex process 
that requires precise programming and direction by 
the brain. When there is damage to those brain re- 
gions controlling articulatory programming because 
of stroke or trauma, a disorder known as apraxia may 
ensue. This is in contrast to dysarthria, where artic- 
ulation is distorted because of paralysis or paresis of 
the musculature necessary for speech production. 
See APRAXIA; BRAIN. 

Voice. Voice disorders occur when the phonatory 
mechanism is not functioning properly. Some im- 
pairments of function can be the result of vocal 
abuse or misuse. These voice disorders are said to 
be functional, although prolonged abuse or misuse 
can result in temporary or permanent damage to the 
phonatory mechanism. There are, however, voice 
disorders that have a true organic etiology. These dis- 
orders result from disease or some other condition 
that affects the laryngeal, oral, or respiratory struc- 
tures, which are involved in the production of voice. 
Phonation problems associated with cleft palate, 
which is a structural defect in the palate resulting 
in hypernasality, are not included in functional or or- 
ganic disorders. Aphonia is a complete absence of 
voice. More commonly, individuals with voice dis- 
order are dysphonic, retaining some voice quality. 
Whether functional or organic in origin, voice dis- 
orders are described in terms of pitch, quality (reso- 
nance), or loudness. Instruments have become avail- 
able that permit sensitive and objective measure- 
ment of vocal characteristics. These instruments are 
of value in both diagnosis and treatment of voice dis- 
orders. 

Pitch. Disorders of voice in which pitch is the pre- 
dominant factor occur when the speaker’s voice 
pitch is inappropriate for age and gender. In these 
instances, the pitch is described as too high, too 
low, monotonous, or demonstrating repeated inap- 
propriate pitch patterns. A voice that is pitched too 
high is often the result of psychological tension. The 
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muscles of the larynx are contracted in such a way as 
to raise pitch significantly beyond its normal range. 
It may also result when the larynx is small for age 
and gender and the vocal folds are thin and short. 
In some instances, the speaker’s inability to perceive 
pitch and pitch patterns may result in a high-pitched 
voice, monotonous voice, or inappropriate use of 
repeated pitch patterns. See PITCH. 

Quality. Disorders of voice quality (resonance) can 
be divided into two major groups. In one group the 
problem is at the point of origin of the voice, the 
vocal folds and larynx; in the other, at the suprala- 
ryngeal resonating cavities. An example of the first 
type is functional dysphonia, related to emotional 
stress, in which breathy voice indicates vocal cord 
vibration without a closed phase. Vocal abuse may 
also cause vocal fold nodules, which do not permit 
adequate closure of the folds, yielding a breathy and 
harsh vocal quality. In contrast, when the vocal cords 
are squeezed too tightly, they cannot vibrate prop- 
erly, and the voice sounds hoarse and low-pitched. 
Hoarseness associated with excessive vocal cord clo- 
sure is known as hyperfunctional dysphonia. Hoarse- 
ness or breathiness can also result from organic dis- 
ease affecting the vocal cords, such as paralysis of 
one vocal cord, trauma, myasthenia, or cancer of the 
larynx. 

Disorders of quality related to the resonating cav- 
ities are typically of two types, hypo- or hypernasal. 
Hyponasal voice results when there is inadequate 
nasal resonance. This may be due to a physiologic or 
anatomic blockage of the vibrating air that ordinarily 
passes through the nasal cavities. In this case, the vi- 
brating airstream does not reach the nasal passages. 
Hypernasality is the opposite condition, where too 
much air comes through the nasal cavities during 
phonation. This is a common problem associated 
with cleft palate. Speakers who are imprecise in their 
articulation and speak rapidly also demonstrate a 
mild form of this problem. Another type of hyper- 
nasality occurs when there is obstruction in the an- 
terior nasal cavities (pinching the nostrils), which 
produces a quality similar to that of a nasal twang. 
In this case, the vibrating airstream reaches the nasal 
cavities but is unable to pass through them unim- 
peded. 

Loudness. Disorders of loudness relate to the inten- 
sity of voice production. Here the voice is either 
not loud enough to be audible or too loud. The 
latter problem may occur in persons who work in 
very noisy environments and must shout in order to 
be heard. These persons carry this loud production 
over to the nonwork environment. Persons whose 
voice intensity is significantly below that which 
can be heard are typically those individuals whose 
feel inadequate and insecure. Loudness changes can 
often by symptomatic of hearing loss. When there 
is a conductive hearing impairment, the voice is 
often not loud enough, while in the case of sensory- 
neural impairment the voice is too loud. See LOUD- 
NESS. 

Alaryngeal speech. One other disorder of vocal pro- 
duction that should be considered is alaryngeal or 


esophageal speech. This results following surgical re- 
moval of the larynx, usually because of cancer. The 
individual who has undergone such surgical removal 
can, by means of esophageal speech or an artificial 
larnyx, produce adequate but limited voice. See LAR- 
YNX. 

Treatment. Treatment of the voice disorders begins 
with diagnosis. It is essential that any organic basis or 
the disorder be determined and treated before begin- 
ning voice therapy with a certified speech-language 
pathologist. Beyond medical treatment, therapy may 
include modification of the environment, psycho- 
logical counseling, and direct therapy. The last will 
usually include the development of listening skills, 
relaxation exercises, and specific vocal exercises to 
eliminate vocal abuse and determine the optimal 
pitch and resonance. Listening exercises are partic- 
ularly useful in treating disorders where the voice is 
too loud. 

Fluency. Among the most difficult disorders of 
speech to treat are those of fluency, which are char- 
acterized by a disruption of the normal rate of speech 
production. There are three major types of fluency 
disorders: stuttering, cluttering, and those related 
to motor speech disorders of neurological origin 
(dysarthria and apraxia). Of these, stuttering is the 
most commonly observed. 

Stuttering. No one definition of stuttering encom- 
passes the views of scholars devoted to its study. 
However, the following features appear in most def- 
initions: a disruption or blockage of the normal 
flow of speech because of repetitions, prolonga- 
tions of sounds or syllables, and periods of silence. 
All of these features are involuntary and appear to 
be associated with effortful struggles to avoid the 
behavior. 

There is no universal agreement as to the cause of 
the disorder. Three prominent theories attempt to 
explain the disorder. One assumes that it is a learned 
behavior, another that it represents a psychological 
disorder, and the third that it has an organic basis in 
disordered neurologic function. 

There is no cure for stuttering. However, many 
persons with this disorder can be treated so that the 
degree of dysfluency and the associated distracting 
secondary behaviors (for example, facial twitches or 
body movement) can be significantly reduced. No 
single accepted mode for treatment of stuttering is 
generally accepted. Almost everything from surgery 
of the tongue to psychoanalysis and hypnotherapy 
has been used to try to cure the disorder. Interest- 
ingly, of the children who reportedly stutter prior to 
puberty, only 20% continue to do so after the onset 
of puberty. This is the case even in the absence of 
therapy. 

The approach to treatment is different for chil- 
dren and adults; however, most therapeutic ap- 
proaches fall into one of three types: psychologi- 
cal approaches, modification of the stuttering, and 
modification of speech. In the first, the emphasis is 
on counseling and psychotherapy to alter the stut- 
terer’s self-image and attitude toward the dysfluency 
and to reduce avoidance behaviors. In treatment 


directed at modifying the stuttering itself, the focus is 
on enabling the individual to stutter more efficiently 
and fluently. When treatment is directed at modify- 
ing speech, the goal is to achieve “stutter-free” pro- 
duction by altering the rhythm, rate, and voice. This 
approach draws heavily on the principles of behavior 
modification. Most speech pathologists use a combi- 
nation of these general approaches when working 
with the dysfluent individual. 

Cluttering. This is distinguished from stuttering by 
the fact that it is produced without awareness or con- 
cern on the part of the speaker. Cluttering is charac- 
terized by very rapid production accompanied by 
alterations in articulatory patterns, particularly in- 
volving substitutions and omissions. This combina- 
tion, especially in the case of children, frequently 
renders speech unintelligible. It is often the class- 
room teacher who is the first to detect the prob- 
lem. In adults who have been unrecognized clut- 
terers, it may be an employer, friend, or colleague 
who recognizes the problem. Treatment focuses on 
recognition of the cluttering by the speaker, followed 
by specific techniques to alter rate and articulatory 
patterns. 

Other disorders. Speech rate and rhythm can be sig- 
nificantly impeded by motor speech disorders. Dis- 
eases of the nervous system such as multiple sclero- 
sis or parkinsonism and conditions such as cerebral 
palsy interfere with the normal sensory-motor con- 
trol over the articulatory and rhythmic production 
of speech. Because the etiology of this group of flu- 
ency disorders varies so much, there are no specific 
patterns that can be described. Treatment therefore 
focuses on having the individual attempt to gain as 
much control over production as possible, given the 
limits imposed by the cause. In some cases, such 
as Parkinson’s disease, the use of drugs such as 
t-dopa can have a marked effect on speech pro- 
duction. Muscle relaxants are sometimes used along 
with speech therapy for patients with multiple 
sclerosis. However, in cases of progressive neuro- 
logic disease, treatment can provide only temporary 
respite from the disorder. See MULTIPLE SCLEROSIS; 
PARKINSON’S DISEASE. 

Language. Language disorders involve impairment 
of the ability to voluntarily use or comprehend the 
language of the individual’s linguistic community. 
Such disorders need not involve a concomitant im- 
pairment of speech or voice production. These dis- 
orders can be divided into two groups. In delayed 
language acquisition, normal language development 
fails to occur because of neurologic or intellectual 
problems. Aphasia typically occurs in the adult fol- 
lowing a stroke or traumatic injury to the brain after 
the individual had acquired and was using language 
appropriate to age, gender, and linguistic commu- 
nity. In addition, there has been increasing recog- 
nition of language disorders associated with the 
dementias. The language disorders associated with 
delayed acquisition in children and those seen in in- 
dividuals who had previously acquired adequate lan- 
guage share few similarities, and the approaches to 
treatment are dissimilar. 
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Aphasia. The aphasias can be categorized in a vari- 
ety of ways. Nonfluent aphasics show good compre- 
hension but have difficulty with confrontation nam- 
ing and produce cryptic “telegraphic” speech. Fluent 
aphasics show great language output, often meaning- 
less, with very poor comprehension. However, there 
are some aphasics whose only apparent difficulty is 
in the use of names, and while they have reactively 
good comprehension, their expressive language de- 
pends on circumlocutions and nonspecific words. 
Finally, there are global aphasics who are nonfluent, 
with poor comprehension and repetition, and often 
able only to utter a meaningless syllable. 

Because the event that produces the aphasia usu- 
ally involves more than one region of the brain, the 
aphasic often exhibits other problems such as hemi- 
plegia (paralysis of one side of the body); motor 
speech problems (dysarthria); visual problems such 
as blindness of one visual field Chemianopsia); or 
apraxia (difficulty in programming, planning, initi- 
ating, and sequencing motor behaviors, especially 
those required for articulation). In addition, because 
the onset of the neurologic event producing the 
aphasia is frequently sudden, the aphasic is often 
frightened and devastated. The impact of the disor- 
der is felt not only by the patient but also by family 
members who will have to play a significant role in 
rehabilitation. 

Some spontaneous reduction of the aphasia often 
occurs in the first few days and weeks following the 
onset of the disorder. The amount of improvement 
is governed by the extent of the brain damage, its 
locus, and the patient’s overall condition and moti- 
vation. Most of these changes take place during the 
first 3 months following onset, with some continua- 
tion at a slower pace through the sixth month. How- 
ever, language therapy during this period will en- 
hance the spontaneous recovery process. Continued 
improvement, with the goal of having the aphasic 
reach the best level of communicative function possi- 
ble, requires language therapy by a qualified speech- 
language pathologist. The first step in this process is a 
detailed evaluation of the aphasic’s current language 
status. From this evaluation a treatment plan can be 
developed. Both individual and group treatment ap- 
proaches are helpful. There are no uniform treatment 
prescriptions for aphasia. Techniques such as pro- 
grammed learning, behavior modification, and cog- 
nitive and language stimulation have all been used 
with varying degrees of success. Some have recom- 
mended the use of Amerind (American Indian sign 
language) for severely impaired aphasics. Another ap- 
proach attempts through a series of specific steps 
to achieve improved communication by a program 
known as Promoting Aphasic Communicative Effec- 
tiveness (PACE). There have also been efforts to in- 
corporate personal computers into aphasia rehabil- 
itation. Most of these techniques, as well as others, 
do function to reduce the communicative impair- 
ment associated with aphasia. However, it is unusual 
for aphasics to make a complete recovery, and most 
must learn to function with limited language abili- 
ties. See APHASIA. 
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Childhood disorders. Language disorders of childhood 
are often associated with other disorders, such as 
mental retardation, hearing loss, and autism. How- 
ever, there are some children who show specific 
language impairments. They are considered to be 
normal in other aspects of development, in particu- 
lar, hearing and nonverbal intelligence. Terms uesd 
to categorize the childhood disorders include lan- 
guage delay, developmental aphasia, or language im- 
pairment. In the mentally retarded child, most of 
the linguistic features of the language are slow to 
emerge. However, when they do appear, they are 
similar to those of the normal child. The more ab- 
stract the features, the greater the delay in acquisi- 
tion. Some features may never emerge. This is de- 
pendent in part on the severity of the retardation. 
Autistic children, on the other hand, suffer from 
marked failure to acquire and use language in the or- 
dinary sense. Some use echolalia, unsolicited repet- 
itive or imitative verbal patterns echoing a previ- 
ously heard utterance. Since language is dependent 
upon hearing, children with hearing loss or deaf- 
ness will show mild to profound delays in language 
acquisition, with the degree of impairment relative 
to the degree of hearing loss. See AUTISM; MENTAL 
RETARDATION. 

Therapy for children with language delay often 
focuses on production of specific linguistic struc- 
tures. This may involve modeling the structure for 
the child. Some speech-language pathologists ad- 
vocate a behavioral-learning approach using as- 
pects of behavior modification. Others focus on an 
interpersonal-interactive methodology. Because of 
the variety of problems presented by these children, 
treatment must be tailored to their specific needs 
and environments. 

Dementia. Persons with dementia, such as Alzhei- 
mer’s type or multi-infarct dementia, often show a 
disturbance of language. These disturbances can take 
a variety of forms, such as the inability to recall and 
use names or significantly impaired comprehension 
and language output. Linguistic deficits associated 
with dementia, except in rare instances, are related 
to the deterioration of intellect and memory rather 
than a primary deficit in language function. Treat- 
ment for these deficits is often indirect, working to 
create a language environment that is carefully struc- 
tured to the present level of linguistic and cognitive 
function. Often therapy is directed at helping the 
family and care givers appreciate the problem and 
cope with it. See ALZHEIMER’S DISEASE; SPEECH. 

R. S. Tikofsky 

Bibliography. D. R. Boone, Human Communica- 
tion and Its Disorders, 1987; G. H. Shames, Human 
Communication Disorder, 5th ed., 1998. 
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Speech perception 

A term broadly used to refer to how an individual 
understands what others are saying. More narrowly, 
speech perception is viewed as the way a listener 


can interpret the sound that a speaker produces as 
a sequence of discrete linguistic categories such as 
phonemes, syllables, or words. See PHONETICS; PSY- 
CHOLINGUISTICS. 

Classical work in the 1950s and 1960s concen- 
trated on uncovering the basic acoustic cues that 
listeners use to hear the different consonants and 
vowels of a language. It revealed a surprisingly com- 
plex relationship between sound and percept. The 
same physical sound (such as a noise burst at a par- 
ticular frequency) can be heard as different speech 
categories depending on its context (as “k” before 
“ah,” but as “p” before “ee” or “oo”), and the same 
category can be cued by different sounds in different 
contexts. Spoken language is thus quite unlike typed 
or written language, where there is a relatively invari- 
ant relationship between the physical stimulus and 
the perceived category. 

The reasons for the complex relationship lie in 
the way that speech is produced: the sound pro- 
duced by the mouth is influenced by a number of 
continuously moving and largely independent artic- 
ulators. This complex relationship has caused great 
difficulties in programming computers to recognize 
speech, and it raises a paradox. Computers readily 
recognize the printed word but have great difficulty 
recognizing speech. Human listeners, on the other 
hand, find speech naturally easy to understand but 
have to be taught to read (often with difficulty). 
It is possible that humans are genetically predis- 
posed to acquire the ability to understand speech, 
using special perceptual mechanisms usually located 
in the left cerebral hemisphere. See HEMISPHERIC 
LATERALITY. 

Building on the classical research, the more re- 
cent work has drawn attention to the important con- 
tribution that vision makes to normal speech per- 
ception; has explored the changing ability of in- 
fants to perceive speech and contrasted it with that 
of animals; and has studied the way that speech 
sounds are coded by the auditory system and how 
speech perception breaks down in those with hear- 
ing impairment. There has also been substantial re- 
search on the perception of words in continuous 
speech. 

Hearing and vision. Traditionally, speech per- 
ception has concentrated on the problem of turn- 
ing sound into linguistic categories; but an impor- 
tant influence on the field has been the need to 
integrate visual information about articulation with 
sound. 

Although lip-reading is a commonly recognized 
skill of the hearing-impaired, the contribution that 
is made by visible face movements to normal speech 
perception was not fully realized until H. McGurk 
serendipitously discovered the illusion that has be- 
come known as the McGurk effect. The sound of 
someone saying “bah” was dubbed onto a video 
recording of the same person saying either “dah” 
or “gah.” Listeners who simply heard the sound 
track reported correctly “bah,” but those who also 
saw the face reported that the person had said 
“dah.” 


Vision seems to give the listener clear informa- 
tion that can overrule or combine with the infor- 
mation that is being received by the ear. In this ex- 
ample, vision clearly indicates that the speaker’s lips 
did not close. Since the lips should have closed if a 
“b” had been produced, perception produces a con- 
ciliatory percept that is both compatible with the 
seen information and auditorially similar to the actual 
sound. 

Vision gives very clear information about cer- 
tain speech sounds (primarily those involving the 
lips and jaw); however, it provides very little about 
other sounds and none about the presence or the 
value of voice pitch. The presence of voicing con- 
veys important information about consonants, dis- 
tinguishing “b” from “p” for instance; and the pitch 
contour indicates the position of stressed syllables, 
and varies the semantic and pragmatic value of 
an utterance. Prostheses for the profoundly deaf 
that simply convey information about voice pitch 
can substantially improve their ability to understand 
speech. 

The McGurk effect is much weaker in young 
children than in adults, but very young infants are 
surprisingly sensitive to the relationship between 
speech sounds and the corresponding face move- 
ments. Even at a few months old, they will prefer to 
look at faces that match the sounds they are hear- 
ing than at those that do not. Attention to the visible 
as well as the audible consequences of articulation 
is undoubtedly helpful to the child in learning lan- 
guage. A visually simple gesture, such as closing the 
lips, can influence many different auditory cues; see- 
ing the single gesture may help the child to grasp 
the underlying integrity of the sound. Visually hand- 
icapped children are slower than normal children in 
learning to produce sounds that have a visible artic- 
ulation. 

Perception and development of speech categories. 
Adult listeners are exquisitely sensitive to the differ- 
ences between sounds that are distinctive in their 
language. The voicing distinction in English (be- 
tween “b” and “p”) is cued by the relative timing of 
two different events (stop release and voice onset). 
At a difference of around 30 milliseconds, listeners 
hear an abrupt change from one category to another, 
so that a shift of only 5 ms can change the percept. 
On the other hand, a similar change around a dif- 
ferent absolute value, where both sounds are heard 
as the same category, would be imperceptible. The 
term categorical perception refers to this inability to 
discriminate two sounds that are heard as the same 
speech category. 

Categorical perception can arise for two reasons: it 
can have a cause that is independent of the listener’s 
language—for instance, the auditory system may be 
more sensitive to some changes than to others; or 
it can be acquired as part of the process of learning 
a particular language. The example described above 
appears to be language-independent, since similar 
results have been found in animals such as chin- 
chillas whose auditory systems resemble those of hu- 
mans. But other examples have a language-specific 
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component. The ability to hear a difference be- 
tween “r” and “1” is trivially easy for English listeners, 
but Japanese perform almost at chance unless they 
are given extensive training. How such language- 
specific skills are developed has become clearer fol- 
lowing intensive research on speech perception in 
infants. 

Newborn infants are able to distinguish many of 
the sounds that are contrasted by the world’s lan- 
guages. Their pattern of sucking on a blind nipple 
signals a perceived change in a repeated sound. They 
are also able to hear the similarities between sounds 
such as those that are the same vowel but have dif- 
ferent pitches. The ability to respond to such a wide 
range of distinctions changes dramatically in the first 
year of life. By 12 months, infants no longer respond 
to some of the distinctions that are outside their na- 
tive language, while infants from language communi- 
ties that do make those same distinctions retain the 
ability. Future experience could reinstate the ability, 
so it is unlikely that low-level auditory changes have 
taken place; the distinctions, although still coded by 
the sensory system, do not readily control the infant’s 
behavior. 

Listening in the speech mode. Individual speech cat- 
egories (such as “p” or “b”) are cued by a wide 
variety of different aspects of sound. For example, 
“b” has a longer preceding vowel and a shorter si- 
lence than does “p.” In addition, when the lips open, 
the initial burst of noise energy has a lower ampli- 
tude; and when voicing starts the peak of the low- 
frequency energy is at a lower frequency for “b” 
than for “p.” Experiments with synthetic speech 
show that these different cues can be traded off 
against each other: a longer silence can be compen- 
sated for by a lower first format onset frequency. 
Such so-called trading relations between these differ- 
ent cues may sometimes be explained by basic sen- 
sory mechanisms such as adaptation in the auditory 
nerve, but others require more specialized mecha- 
nisms specific to speech. A particularly useful way 
of demonstrating this is to synthesize a caricature 
of speech known as sine-wave speech; it consists of 
sine waves at the formant frequencies. Short utter- 
ances of sine-wave speech can be heard either as non- 
speech whistles or as (peculiar) speech. Some of the 
trading relations between cues in normal speech are 
also found in sine-wave speech, but can be shown 
to arise only when the listener hears the sound as 
speech. They are therefore a result of special speech- 
processing mechanisms, rather than purely auditory 
ones. 

Words and continuous speech. Continuous speech 
is not simply a concatenation of isolated syllables; 
syllables change their pitch, their duration, and their 
phonetic character in continuous speech. In extreme 
circumstances, they can vanish or be completely 
transformed—for example, “Do you want to?” be- 
coming “Djawona?” Although such changes make it 
more difficult to understand how the lexicon can 
be accessed from the speech waveform, the changes 
in duration and pitch convey essential information 
about words, grammar, and intention. 
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Under good listening conditions and with clearly 
articulated speech, it is often possible to identify 
words before they have been completed verbally. A 
word such as “penguin” becomes identifiable in prin- 
ciple before the final vowel, since there is no other 
word in English that begins the same way (except 
for derived forms such as “penguins”). In practice, 
listeners can identify isolated words at their unique- 
ness points, but may not do so in normal speech 
where the necessary phonetic information can be 
absent or distorted. Subsequent context can then 
be used to limit the possibilities before the word is 
“heard.” Speakers adjust the clarity of their speech 
depending on the context, relying on listeners’ abil- 
ity to use that context to make up for the decreased 
information in the signal. Stressed syllables are most 
resistant to such adulteration; they may serve as start- 
ing points for the perceptual process and, in En- 
glish at least, as candidates for the beginnings of 
words. 

The hearing-impaired. Although conductive hear- 
ing losses can generally be treated adequately by ap- 
propriate amplification of sound, sensorineural hear- 
ing loss involves a failure of the frequency-analyzing 
mechanism in the inner ear that humans cannot yet 
compensate for. Not only do sounds need to be 
louder before they can be heard, but they are not 
so well separated by the ear into different frequen- 
cies. Also, the sensorineurally deaf patient tolerates 
only a limited range of intensities of sound; amplified 


sounds soon become unbearable (loudness recruit- 
ment). 

These three consequences of sensorineural hear- 
ing loss lead to severe problems in perceiving a com- 
plex signal such as speech. Speech consists of many 
rapidly changing frequency components that nor- 
mally can be perceptually resolved. The illustration 
shows a spectrogram of speech that has been put 
through a filter bank similar to that found in the nor- 
mal human ear. In the low-frequency region the indi- 
vidual harmonics of the voice are separated; at higher 
frequencies the ear cannot resolve individual har- 
monics, but it still separates the different formants 
(or resonances of the vocal tract). 

The lack of frequency resolution in the sensorineu- 
ral patient makes it harder for the listener to identify 
the peaks in the spectrum that distinguish the sim- 
plest speech sounds from each other; and the use 
of frequency-selective automatic gain controls to al- 
leviate the recruitment problem reduces the distinc- 
tiveness of different sounds further. These patients 
may also be less sensitive than people with normal 
hearing to sounds that change over time, a disability 
that further impairs speech perception. 

Although normal listeners can still recognize 
speech against a background of other noises (in- 
cluding speech, as in the cocktail-party effect), sen- 
sorineural patients are disproportionately affected 
by competing sounds. Their impaired frequency and 
temporal selectivities reduce their ability to separate 


oe M | | E Hr A int N le 


(a) time, s 


























|w IA o 
| 





N 
| 


frequency, kHz 








(b) time, s 


An auditory spectrogram of the world “telephone.” (a) The vertical striations in the bands represent amplitude modulation at 
the frequency of the voice pitch. Such striations are absent from the initial noise portion corresponding to the voiceless 
initial “t.” The medial “f” is too weak to appear on the spectrogram. (b) The first eight or so harmonics of the voice pitch are 
resolved and appear as dark, roughly horizontal bands. Higher harmonics are not resolved (here above about 1.5 kHz). The 


dark horizontal bands now represent the formant frequencies. 


the required signal from its background. Hearing aids 
that can listen in to sound from a particular direc- 
tion can help to filter out unwanted sounds, but 
they do not perform as well when echoes and re- 
verberation are present. A practical solution to this 
problem has not been developed; however, it is pos- 
sible in artificial situations to separate the speech 
of one speaker from another on the basis of voice 
pitch, so that a deaf listener can hear the selected 
voice more clearly. The same technique is also used 
by normal listeners as one way of solving the prob- 
lem of competing background sounds. Extension of 
this method to normal situations is complicated by 
the fact that not all of speech has a pitch. Other, 
more complex information must be used to separate 
voiceless sounds and to link the voiced segments 
appropriately. 

Some profoundly deaf patients can identify some 
isolated words by using multichannel cochlear im- 
plants. Sound is filtered into different frequency 
channels, or different parameters of the speech are 
automatically extracted, and electrical pulses are 
then conveyed to different locations in the cochlea 
by implanted electrodes. The electrical pulses stim- 
ulate the auditory nerve directly, bypassing the in- 
active hair cells of the damaged ear. Such devices 
cannot reconstruct the rich information that the nor- 
mal cochlear feeds to the auditory nerve, but they 
promise to put profoundly deaf patients back into 
contact with the world of sound and to help them to 
communicate more effectively by speech. See HEAR- 
ING (HUMAN); HEARING AID; HEARING IMPAIRMENT; 
PERCEPTION; PSYCHOACOUSTICS; SPEECH. 

C. J. Darwin 
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Speech recognition, in a strict sense, is the process 
of electronically converting a speech waveform (as 
the acoustic realization of a linguistic expression) 
into words (as a best-decoded sequence of linguistic 
units). At times it can be generalized to the process 
of extracting a linguistic notion from a sequence of 
sounds, that is, an acoustic event, which may en- 
compass linguistically relevant components, such as 
words or phrases, as well as irrelevant components, 
such as ambient noise, extraneous or partial words 
in an utterance, and so on. Applications of speech 
recognition include an automatic typewriter that re- 
sponds to voice, voice-controlled access to informa- 
tion services (such as news and messages), and auto- 
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mated commercial transactions (for example, price 
inquiry or merchandise order by telephone), to name 
a few. Sometimes, the concept of speech recogni- 
tion may include “speech understanding,” because 
the use of a speech recognizer often involves un- 
derstanding the intended message expressed in the 
spoken words. Currently, such an understanding pro- 
cess can be performed only in an extremely limited 
sense, often for the purpose of initiating a particular 
service action among a few choices. For example, 
a caller’s input utterance “Id like to borrow money 
to buy a car” to an automatic call-routing system of 
a bank would connect the caller to the bank’s loan 
department. 

Fundamental concept. Converting a speech wave- 
form into a sequence of words involves several essen- 
tial steps (Fig. 1). First, a microphone picks up the 
acoustic signal of the speech to be recognized and 
converts it into an electrical signal. A modern speech 
recognition system also requires that the electrical 
signal be represented digitally by means of an analog- 
to-digital (A/D) conversion process, so that it can be 
processed with a digital computer or a microproces- 
sor. This speech signal is then analyzed (in the analy- 
sis block) to produce a representation consisting of 
salient features of the speech. The most prevalent 
feature of speech is derived from its short-time spec- 
trum, measured successively over short-time win- 
dows of length 20-30 milliseconds overlapping at 
intervals of 10-20 ms. Each short-time spectrum is 
transformed into a feature vector, and the temporal 
sequence of such feature vectors thus forms a speech 
pattern. Figure 2 illustrates the speech analysis pro- 
cess. 

The speech pattern is then compared to a store of 
phoneme patterns or models through a dynamic pro- 
gramming process in order to generate a hypothesis 
(or a number of hypotheses) of the phonemic unit 
sequence. (A phoneme is a basic unit of speech and 
a phoneme model is a succinet representation of the 
signal that corresponds to a phoneme, usually em- 
bedded in an utterance.) A speech signal inherently 
has substantial variations along many dimensions. 
First is the speaking rate variation—a speaker can- 
not produce a word of identical duration at will. Sec- 
ond, articulation variation is also abundant, in terms 
both of talker-specific characteristics and of the man- 
ner in which a phoneme is produced. Third, pro- 
nunciation variation occurs among different speak- 
ers and in various speaking contexts (for example, 
some phonemes may be dropped in casual conversa- 
tion). Dynamic programming is performed to gener- 
ate the best match while taking these variations into 
consideration by compressing or stretching the tem- 
poral pattern (a process often called dynamic time 
warping) and by probabilistically conjecturing how 
a phoneme may have been produced. The latter in- 
cludes the probability that a phoneme may have been 
omitted or inserted in the utterance. The knowledge 
of probability (often called a probabilistic model of 
speech) is obtained via “training,” which computes 
the statistics of the speech features from a large 
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Fig. 1. Basic block diagram of a speech recognition system. 


collection of spoken utterances (of known identity) 
according to a mathematical formalism. At the pres- 
ent time, the most widely adopted formalism is the 
hidden Markov model. 

The hypothesized phoneme sequence is then 
matched to a stored lexicon to reach a tentative de- 
cision on the word identity. The decoded word se- 
quence is further subject to verification according to 
syntactic constraints and grammatical rules, which in 
turn define the range of word hypotheses for lexical 
matching. This process of forming hypotheses about 
words and matching them to the observed speech 
pattern, and vice versa, in order to reach a decision 
according to a certain criterion is generally referred 
to as the “search.” In limited domain applications in 
which the number of legitimate expressions is man- 
ageably finite, these constraints can be embedded in 
an integrated dynamic programming process to re- 
duce search errors. 

Categorization of tasks. Most current speech recog- 
nition systems are essentially designed to convert 
a speech waveform into one or more words in 
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order to perform a specific task. Linguistic expres- 
sions via speech can take the form of word utter- 
ances spoken individually in isolation, or phrases 
and sentences spoken naturally without intention- 
ally controlled word boundaries. A task involving 
the former case is called isolated (or discrete) word 
recognition, while the latter case is generally re- 
ferred to as continuous speech recognition. How- 
ever, the speaker’s articulation effect can be sub- 
stantially different when reading a prepared text and 
when conversing spontaneously with another per- 
son. Also at times, a task may involve connected 
words without the usual syntactic constraints, such 
as a digit sequence (for example, one’s credit card 
number). Thus, a speech recognition task can be cat- 
egorized, in terms of the way that speech is rendered, 
into four essential types: isolated word, connected 
word, continuously read speech, and spontaneous 
speech. 

The degree of sophistication of a speech recogni- 
tion task is largely a function of the size of the vocab- 
ulary it has to deal with. A task involving, say, less 


Speech 


apl LA AAAI CTA cA Aad, 





shifting 
data 
window 


ty aT RTT FE TTA 


viv LAA 





=> 
time 


Spectral sequence 





frequency 


Fig. 2. Short-time spectral analysis of speech. 





than 100 words is called small vocabulary recogni- 
tion and is mostly for command-and-control applica- 
tions with isolated word utterances as input. There 
are usually very few grammatical constraints asso- 
ciated with these types of limited tasks. When the 
vocabulary size grows to 1000 words, it is possi- 
ble to construct meaningful sentences, although the 
associated grammatical rules are usually fairly rigid. 
For dictation, report writing, or other tasks such as 
newspaper transcription, a speech recognition sys- 
tem with a large vocabulary (on the order of tens of 
thousands of word entries) is needed. 

A speaker-independent recognition system is de- 
signed to serve nonspecific users and is suitable for 
public service applications where the user popula- 
tion is large. A voice dialing system embedded in 
a telephone central office switch, a telephone call- 
routing system, and a customer service unit that 
can respond to a caller’s voice without having to 
know the user’s identity are all in this category. It 
has to be able to handle a broad range of accents 
and voice characteristics from various talkers. An- 
other type of speech recognizer, called a speaker- 
dependent recognition system, may be designed to 
serve only a specific user; for example, a personal 
word-processor that responds to its authorized user’s 
voice. A speaker-dependent system requires speaker- 
specific training and is thus also referred to as a 
speaker-trained system. Such a system can also be 
designed by adapting a speaker-independent system 
to the voice patterns of the designated user via an 
enrollment procedure; it is often called a speaker- 
adaptive system. For a limited number of frequent 
users of a particular speech recognizer, such an en- 
rollment procedure generally leads to good recog- 
nizer performance. 

Understanding of speech. Another type of speech 
recognition system has to do with the fundamental 
idea of task-oriented speech understanding. Using 
the popular call-routing system that sorts operator- 
assisted calls into five types (credit-card, collect, 
third-party billing, person-to-person, and operator) 
as an example, a recognition system design engineer 
can use a large-vocabulary continuous speech recog- 
nition system to convert the request utterance (for 
example, “I'd like to make a collect call”) into a word 
sequence as best it can and then employ a natural lan- 
guage processing system to parse and understand the 
intent of the caller. Alternatively, the system designer 
can make the assumption that in such a limited task 
the intent of the caller will most likely be expressed 
in a “keyword” (for example, the word “collect” in 
the above sentence) embedded in the otherwise nat- 
urally spoken sentence. A system that can detect or 
spot the keyword would thus suffice for the task. 
This alternative approach is referred to as keyword 
spotting. 

The keyword-spotting system has proved to be 
very useful, because it makes the interaction be- 
tween the user and the machine natural and ro- 
bust. Experience shows that many users of a speech 
recognition system in telecommunication applica- 
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tions often do not speak words or phrases according 
to the prescribed vocabulary, thus creating out-of- 
vocabulary (OOV) or out-of-grammar (OOG) errors. 
Rather than attempting to recognize every word in 
the utterance, a keyword-spotting system hypothe- 
sizes the presence of a keyword in appropriate por- 
tions of the speech utterance and verifies the hy- 
pothesis by a likelihood ratio test. By avoiding forced 
recognition decisions on the unrecognizable and in- 
consequential regions of the speech signal, the sys- 
tem can cope with natural sentences, obtaining sat- 
isfactory results in constrained tasks. A more sophis- 
ticated keyword spotting system may involve “gist- 
ing,” that is, trying to compile the detected keywords 
into a possible message. Similarly, it is possible to 
perform a primitive analysis on the decoded words 
based on their correlation with a particular seman- 
tic notion in order to derive an understanding for 
limited domain applications. 

Implementation. Realization of a speech recogni- 
tion system involves two essential steps: training 
and decoding. The objectives of training include 
reference models of the phonetic units, which 
characterize (probabilistically) how linguistic units 
are acoustically realized (acoustic modeling), and 
probabilistic relationships among words that form 
grammati-cally legitimate sentences of a language 
C(anguage modeling). These acoustic unit models and 
language models are the basis of reference for infer- 
ring a decision when given an unknown input utter- 
ance. Training is usually performed off-line in non- 
real time. 

The hidden Markov model is the prevalent choice 
for the reference structure in acoustic modeling. A 
hidden Markov model is a doubly stochastic process 
involving (at least) two levels of uncertainty. It is a fi- 
nite state (say N-state) machine in which each state is 
associated with a random process. The random pro- 
cess in each state accounts for the variation in the 
realization of a speech code (such as a phoneme), 
including both the spectral (pertaining to the vo- 
cal tract) and the temporal (pertaining to the rate 
of articulation) behavior. The finite state machine is 
a Markov chain, which characterizes the probabilis- 
tic relationship among the states in terms of how 
likely one state is to follow another state. Figure 3 
illustrates a three-state hidden Markov model. See 
STOCHASTIC PROCESS. 

Training of a hidden Markov model requires a large 
collection of sample utterances, each with a known 
label. In the case of isolated-word recognition, each 
utterance would consist of a word. The identity or la- 
bel of the utterance is usually manually obtained by a 
human listener. For continuous speech recognition, 
each utterance in the training set would be a sen- 
tence, represented by a manually annotated word 
sequence. Model training involves a procedure in 
which locating the proper unit boundaries in the ut- 
terance (segmentation) and adjusting the model pa- 
rameter values for each encountered models (model 
estimation) interleave and iterate until a certain cri- 
terion is optimized. 
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Fig. 3. Three-state hidden Markov model. 


A similar principle applies in the case of language 
modeling, which is imperative in large-vocabulary 
continuous speech recognition. Language modeling 
is usually performed on a text database consisting 
of tens or hundreds of millions of words, sufficient 
to allow derivation of the probabilistic characteris- 
tics of a language (for example, the likelihood of a 
particular word appearing after another). 

Decoding is the process of evaluating the probabil- 
ities that the unknown utterance manifests the same 
characteristics as each of the stored references and 
inferring the identity of the utterance based on the 
evaluated probabilities. For large-vocabulary, contin- 
uous speech recognition, the reference models (such 
as sentence hypotheses) are constructed from unit 
models and can be prohibitively large in number. 
Figure 4 shows an example of representing a hy- 
pothesized sentence (“Show all alerts”) by a se- 
quence of context-dependent phoneme units. Effi- 
cient search algorithms need to be invoked such that 
the decoding result can be obtained within a reason- 
able computational bound. For a recognition system 
to be useful, it needs to be able to produce the recog- 
nition result without obvious latency, preferably in 
real time. 


Except for some extremely simple tasks, a speech 
recognition system is implemented as a stored pro- 
gram for execution with a computer or a digital 
signal processor. The reference models are stored 
in read-only devices, with easy data access for de- 
coding purposes. For telecommunication applica- 
tions in which the system often handles multichan- 
nel traffic or sessions, implementation efficiency is 
achieved by sharing the reference models and proper 
division of the computational task among multiple 
channels. 

Difficulties and remedies. Speech recognition re- 
search has made great progress since 1970. However, 
the current technology is still far from achieving its 
ultimate goal: accurate recognition of unconstrained 
speech from any speaker in any environment or op- 
erating condition. There are several key reasons why 
this remains a challenging problem. 

One primary difficulty in speech recognition is 
that the variation in articulation is difficult to char- 
acterize. While phonemes are considered the funda- 
mental speech units demonstrating regular acoustic- 
phonetic behavior (as determined by human ears), 
their acoustic realization usually contains ambigu- 
ity and uncertainty. A phoneme recognition sys- 
tem that attempts to convert a speech waveform 
into a phoneme sequence without incorporating 
additional knowledge, such as vocabulary entries 
and syntax, cannot achieve high accuracy. Grouping 
these error-prone phoneme or subword sequences 
into a sequence of words from a standard lexicon is 
a nontrivial task. Units as large as words and phrases 
have been suggested for limited task environments 
with good success. Large speech units, however, are 
unsuitable for large-vocabulary, continuous speech 
recognition. The longer the basic unit, the more 
the number of unit-classes there will be for the sys- 
tem to deal with. (Consider the extreme case of 
using the sentence as the unit in continuous speech 
recognition. The number of possible sentences be- 
comes astronomical even with a moderate vocabu- 
lary size.) Having more unit classes also means higher 
complexity in training and decoding. For practical 
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Fig. 4. Sentence representation of “show all alerts” with a finite state network using context-dependent units; “sil” denotes 


silence. 


reasons, subword units such as phonemes, diphones 
(pairs of phonemes), or syllables, which are lim- 
ited in number almost independent of the vocab- 
ulary size, are preferred. To circumvent the articu- 
lation problem, modern large-vocabulary, continu- 
ous speech recognition systems almost universally 
use context-dependent unit models, an example of 
which is shown in Fig. 4. A context-dependent 
phoneme model is a statistical characterization of 
a phoneme when uttered in a particular context; 
for example, a phoneme unit /a/ that follows a /n/ 
and precedes a /p/, as in “snap,” is a member of 
the unit classes with model denoted by ‘n-a-p.’ This 
class is treated as a class different from, say, ‘m-a-d’ 
in “mad.” Context dependency helps reduce the de- 
gree of acoustic-phonetic variation within the class 
and often leads to higher recognition accuracy. 

Another major difficulty is the variation in the char- 
acteristics of the transmission medium over which 
recognition is performed (for example, over a tele- 
phone line, in an airplane cockpit, or using a hands- 
free speakerphone). This kind of variation includes 
the acoustic background noise, the various distor- 
tions inherently caused by the microphones that are 
used to pick up the acoustic signal, reverberation 
that varies substantially from one room to another, 
and the transmission distortion existing in the tele- 
phone network. To cope with this type of variation, 
assumptions, such as that the noise is additive and 
that the distortion is linear, are usually made. Another 
implicit assumption about the characteristics of the 
interference is that they do not change as rapidly 
as speech. For example, in telephone applications, 
most of the transmission properties are considered 
virtually constant over a reasonable period of time 
once the phone circuit connection is made. Signal 
processing methods to remedy these adverse effects 
include spectral subtraction to reduce the impact of 
additive noise, and cepstral bias or mean removal to 
suppress the influence of spectral distortions. The 
principle behind spectral subtraction and cepstral 
bias removal is quite similar. Spectral subtraction op- 
erates in the power spectrum domain and assumes 
that the power spectrum of the received signal is the 
sum of the speech spectrum and the noise spectrum. 
Cepstral bias removal operates in the log power spec- 
trum domain and hence assumes that the log power 
spectrum of the received signal is the sum of the log 
power spectrum of the speech and the log frequency 
response of the transmission channel. In both cases, 
the system estimates the additive noise/bias com- 
ponent from an average of the signal (in the cor- 
responding domain taken over a relatively long pe- 
riod, perhaps the whole utterance, based on the 
aforementioned assumption on the slow-changing 
nature of these adverse effects) and subtracts the 
bias estimate from each of the short-time speech 
representations that form the speech pattern of the 
utterance. 

Talker variation is yet another source of difficulty. 
Variations due to the talker include regional accents, 
articulation and pronunciation, and mode and mood 
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of speaking. Talker adaptation provides means for the 
system to adjust its operating parameters, such as the 
stored reference models, to suit the particular user. 
However, very limited knowledge exists at present 
on the effect of psycholinguistic factors, such as the 
contextual or acoustic environment (for example, a 
noisy café) in which the speech conversation takes 
place, upon the manifestation of acoustic-phonetic 
features in the speech signal. Some studies exist on 
the Lombard effect, which is used to describe the 
observation of change in the talker’s articulation ef- 
fort when he or she speaks in a noisy environment 
(for example, trying to raise the voice or to make his 
or her voice better understood by the listener). 

Similarly, studies to establish systematic under- 
standing of the difference between read and spon- 
taneous speech are just beginning. The difference 
is intuitively obvious, but the science behind it is 
not. Unconstrained automatic speech recognition by 
a machine that converses with people like a human 
is still a challenging undertaking. 

State of the art. Speech recognition by the end of 
the twentieth century achieved modest success in 
limited applications, particularly with systems de- 
signed to recognize small- to moderate-size vocab- 
ularies for well-defined tasks (such as word process- 
ing and document editing on a personal computer 
using voice, dictating a readout in a radiologist’s of- 
fice, and telephone dialing by speaking the name or 
the number). 

A summary of typical speech recognition perfor- 
mance in terms of word error rate is provided in 
Tables 1 and 2. Table 1 shows the average word 
error rate for recognition of isolated words without 
contextual constraints for both speaker-dependent 
(SD) and speaker-independent (SI) systems. In gen- 
eral, the performance is more sensitive to vocabu- 
lary complexity than to vocabulary size. The 39-word 
alpha-digits vocabulary (letters A-Z, digits 0-9, three 
command words) contains highly confusable word 
subsets, such as the E-set (B, C, D, E, G, P, T, V, D 
and the digit 3, and has an average accuracy of 93- 
96%. In comparison, a task involving an 1109-word 
basic English vocabulary has an average accuracy 
of 96%. 

For connected-word recognizers applied to tasks 
with various types of syntactic or grammatical con- 
straints as in natural language, an additional param- 
eter called perplexity is needed to indicate roughly 
the degree of difficulty of the task. Perplexity is de- 
fined as the average number of words that can follow 
an arbitrary word. It is usually substantially less than 
the size of the vocabulary because of the constraints 
of the task-specific language. Table 2 shows the typi- 
cal performance, again in terms of word accuracy, of 
connected and continuous speech recognition sys- 
tems for various tasks. 

The resource management task involves a highly 
stylized language used in military duties with a 
perplexity of no more than 60. Sentences in the 
air travel information system are usually simple 
query utterances aiming at flight information such as 
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TABLE 1. Performance of isolated word recognition systems 
Recognition task Vocabulary size Mode Word accuracy 
Digits (0-9) 10 SI ~100% 
Voice dialing 37 SD 100% 
Alpha-digits and command words 39 SD 96% 
SI 93% 
Computer commands 54 SI 96% 
Air travel words 129 SD 99% 
SI 97% 
City names (Japanese) 200 SD 97% 
Basic english 1109 SD 96% 








airfare, flight time, and the origin and destination of 
the trip. The vocabulary includes a number of city 
or airport names in the task database. While the ap- 
plication language appears to be natural, the gram- 
matical structure in the query sentence is actually 
rather limited with a perplexity of less than 25. In 
the business newspaper transcription task, the input 
is “read” speech; speakers read the text of news ar- 
ticles in the paper. Read speech has less variability 
in its acoustic realization, and is thus easier to recog- 
nize than spontaneous, conversational speech. The 
broadcast news transcription task involves “found” 
speech, such as the news announcements from radio 
or television. The signal may be degraded by noise 
(such as background music or conversation) and dis- 
tortion (for example, due to radio transmission). Ob- 
viously, this is a more difficult task than transcribing 
a news article in a quiet office. 

Despite progress, the performance of current sys- 
tems is barely acceptable for large vocabulary sys- 
tems. Tasks listed in the tables are all considered sub- 
stantially easier than transcribing and understanding 
truly spontaneous conversational speech. 

Speaker recognition. The technology of speech 
recognition often finds applications in speaker recog- 
nition tasks as well. Speaker recognition applications 
can be classified into two essential modes, speaker 
identification and speaker verification. The goal of 
speaker identification is to use a machine to find 
the identity of a talker, in a known population of 
talkers, using the speech input. Speaker verifica- 
tion aims to authenticate a claimed identity from 
the voice signal. These two modes call for differ- 
ent methodologies. Applications of speaker recogni- 
tion include controlled entry to restricted premises, 
and authorization of access to privileged informa- 
tion or other service transactions. For these applica- 
tions, speaker verification is used in most practical 
systems. 


There are also two types of talker authentication 
with regard to the underlying technology. Whereas 
speaker verification (SV) verifies a speaker’s identity 
based on speaker-specific characteristics reflected in 
spoken words or sentences (sometimes referred to 
as a talker’s voice print), another method, known 
as verbal information verification (VIV), checks the 
content of the spoken password or pass-phrase [such 
as a personal identification number (PIN), a social se- 
curity number, or a mother’s maiden name]. In the 
deployment of a VIV system, the user assumes the 
responsibility in protecting the confidentiality of 
the password as in many electronic transactions ap- 
plications. 

Research aimed at finding acoustic and linguistic 
features in the speech signal that carry the character- 
istics of the speaker has a long history. The advent 
of a statistical pattern matching approach (such as 
the hidden Markov model) and its dominance in the 
1990s has caused a convergence in the technological 
approach to this age-old problem. Statistical model- 
ing methods are data-driven and are able to automat- 
ically “learn” speaker-specific (for SV) or password- 
specific (for VIV) characteristics from the provided 
spoken utterances with a known identity. During ver- 
ification, the talker first makes an identity claim, such 
as a name or an account number, and then, in re- 
sponse to the system prompt or question, speaks a 
test phrase to the system. The system evaluates the 
input speech against the stored model for the person 
with the claimed identity Gn SV) or the stored model 
for the password (in VIV), and decides to accept or 
reject the talker. 

Key factors affecting the performance of a speaker 
verification system are the nature of input (whether 
the utterance is based on a specific text and how 
long the test utterance is) and the operating envi- 
ronment in which the verification system is used. 
The performance of a verification system is usually 





continuous speech recognition tasks 


TABLE 2. Performance benchmark of state-of-the-art automatic speech recognition systems for various connected and 





Recognition task 


Connected digit strings 10 
Resource management 991 
Air travel information system 1,800 
Business newspaper transcription 64,000 


Broadcast news transcription 64,000 





Vocabulary size 


Perplexity Word accuracy 
10 ~99% 
<60 97% 
<25 97% 
<140 94% 
<140 86% 








measured by the equal error rate (EER), which is 
defined as the error rate of the system when the op- 
erating threshold for the accept/reject decision is ad- 
justed such that the probability of false acceptance 
and that of false rejection become equal. A state-of- 
the-art system using a text-dependent sentence-long 
test utterance is capable of achieving an EER of 1- 
2%. With short, unconstrained spoken utterances in 
a noisy environment, current systems would have an 
EER of 4-8%. 

The performance of a VIV system depends on how 
many passwords or pass phrases the user is asked to 
give as input to the system in a verification session. 
Experiments showed that a VIV system could achieve 
perfect (0 error) verification after three rounds of 
question and answer. 

The deployment of speaker verification often 
involves nontechnical issues, such as the trade-off 
between false rejection and false acceptance. In busi- 
ness applications, such as an automated banking ma- 
chine where a false rejection may mean unjustified 
inconvenience causing the loss of a legitimate long- 
term customer, the operating threshold may be set 
to allow a high false-acceptance rate. Determination 
of the operating point is thus a matter of business de- 
cision. See LINGUISTICS; PSYCHOACOUSTICS; SPEECH. 

Biing Hwang Juang 

Bibliography. J. L. Flanagan, Speech Analysis, Syn- 
thesis, and Perception, Springer-Verlag, New York, 
1972; S. Furui, Digital Speech Processing, Synthesis, 
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B. H. Juang and S. Furui (eds.), Spoken Language 
Processing, special issue of IEEE Proc., 2000; L. R. 
Rabiner and B. H. Juang, Fundamentals of Speech 
Recognition, Prentice Hall, Englewood Cliffs, NJ, 
1993; L. R. Rabiner and B. H. Juang, An introduction 
to hidden Markov models, IEEE Signal Proc. Mag., 
3(1):4-16, 1986. 


SS 
Speed 
The time rate of change of position of a body without 
regard to direction. It is the numerical magnitude of 
a velocity and hence is a scalar quantity. Linear speed 
is commonly measured in such units as meters per 
second, miles per hour, or feet per second. It is the 
most frequently mentioned attribute of motion. 
Average linear speed is the ratio of the length of 
the path As traversed by a body to the elapsed time 
At during which the body moved through that path, 
as in Eq. (1), where so and f are the initial position 


Sf —So _ As 
ty—t) At 





Speed (average) = d) 


and time, respectively, s; and ¢; are the final position 
and time, and A stands for “the change in.” 

Instantaneous speed, defined by Eq. (2), is the lim- 
Speed Ginstant J= l As ds (2) 

eed (instantaneous) = lim — = — 

p At>0 At dt 
iting value of the foregoing ratio as the elapsed time 
approaches zero. See VELOCITY. Rogers D. Rusk 
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Speed regulation 


The change in steady-state speed of a machine, ex- 
pressed in percent of rated speed, when the load on 
the machine is reduced from rated value to zero. The 
definition of regulation usually means the net change 
in a steady-state characteristic, and does not include 
any transient deviation or oscillation that may occur 
prior to reaching the new operation point. This same 
definition is used for stating the speed regulation of 
electric motors and for certain prime movers, such 
as steam turbines. 

In specifying the speed regulation of dc motors, for 
example, one would compute the steady-state regu- 
lation R by Eq. (1), where Np is the speed (in any 

No — Nr 
R= — (100) percent d) 

Nr 

convenient units) at rated load and No is the speed 
(in the same units used to specify Np) after the load 
is removed and a new steady-state value is reached. 
Regulation is usually stated as a positive quantity, 
even though the regulation characteristic of speed 
versus load, when plotted as a straight line, would 

have a negative slope. 

For small load changes about an operating point, 
an incremental steady-state regulation is also defined 
by Eq. (2), where both the speed N and power P 


_ AN 


R= RP (100) (2) 


are normalized by using rated values as the base 
quantities. Equation (2) recognizes the fact that the 
speed-versus-power regulation characteristic is not 
necessarily a straight line. In computing the regula- 
tion it should also be specified that all adjustments 
to the speed-changing mechanism should remain un- 
changed throughout the test. See MOTOR. 

The speed regulation characteristic of steam tur- 
bines driving synchronous generators is an impor- 
tant parameter in the control of frequency for large 
power systems. Here the speed regulation, often 
called the droop characteristic, determines not only 
the overall systems frequency response characteris- 
tic to load changes but also how any load change 
is shared by the connected turbine-generator units. 
The droop characteristic is important in estimat- 
ing steady-state response, but this parameter enters 
prominently into the determination of transient be- 
havior as well. See GENERATOR. 

The definition of steady-state regulation (1) is also 
used in the specification of the performance of auto- 
matic control systems, where it is used as the mea- 
sure of goodness of a controlled variable, such as 
speed. See CONTROL SYSTEMS. Paul M. Anderson 

Bibliography. N. Cohn, Control of Generation and 
Power Flow on Interconnected Power Systems, 
1966; IEEE Power Engineering Society, Power Gener- 
ation Committee, Recommended Practice for Func- 
tional and Performance Characteristics of Con- 
trol Systems for Steam Turbine-Generator Units, 
IEEE Std. 122-1991; IEEE Standard Dictionary of 
Electrical and Electronics Terms, 7th ed., IEEE Std. 
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E) 
Speedometer 


A device for indicating the speed of a vehicle. The 
mechanical analog speedometer (Fig. 1) was intro- 
duced around 1906, when automobiles were be- 
ginning to increase in numbers. While this type of 
speedometer continues in wide use, changing tech- 
nology has resulted in increasing numbers of quartz 
electric analog speedometers and digital micropro- 
cessor speedometers. 

Mechanical speedometer. The mechanical speed- 
ometer is driven by a cable housed in a casing and 
connected to a gear at the transmission. This gear is 
designed for the particular vehicle model, consider- 
ing the vehicle’s tire size and rear axle ratio. 

In most cases, the speedometer is designed to con- 
vert 1001 revolutions of the drive cable into register- 
ing 1 mi on the odometer, which records distance 
traveled by the vehicle. Thus, 1001 revolutions of 
the cable per minute will indicate a speed of 60 mi/h 
(96 km/h). Odometers with six wheels record 
99,999 mi (or kilometers), while those with seven 
wheels record 999,999 mi (or kilometers). 

The speed-indicating portion of the speedome- 
ter operates on the magnetic principle. In the 
speedometer head, the drive cable attaches to a re- 
volving permanent magnet that rotates at the same 
speed as the cable. Floating on bearings between 
the upper frame and the revolving permanent mag- 
net is a nonmagnetic movable speed cup. (There is 
almost no mechanical connection between the re- 
volving magnet and the speed cup.) The magnet 
revolves within the speed cup, producing a rotat- 
ing magnetic field. The magnetic field is constant, 
and the amount of speed cup movement is at all 
times in proportion to the speed of the magnet ro- 
tation. The magnetic field exerts drag on the speed 
cup, causing it to revolve in the same direction and 
in direct ratio to the speed of the revolving mag- 
net and the vehicle itself. A pointer, attached to 
the speed cup spindle, indicates the speed on the 





transmission 


drive shaft 


Fig. 1. Components of a mechanical speedometer system. 


speedometer dial. The speed cup movement is op- 
posed by a hair spring, so that the speed cup is held 
steady and the pointer indicates the true speed. The 
hair spring also pulls the pointer back to a point 
less than 5 mi/h (8 km/h) when the magnet stops 
rotating. 

Quartz electric speedometer. The quartz electric 
speedometer and odometer assembly (Fig. 2) 
displays the speed of the vehicle, total vehicle 
mileage, and trip mileage. A conventional (analog) 
dial and pointer assembly, along with odometer 
wheels, are used to display this information. How- 
ever, due to improved drive methods, the need for 
the conventional speedometer (flexible shaft) cable 
is eliminated. 

To provide accurate vehicle information, the 
quartz speedometer utilizes an accurate clock signal 
supplied by a quartz crystal, along with integrated 
electronic circuitry to process an electrical speed 
signal. This signal, used by the circuitry to drive the 
air-core gage and odometer stepper motor, is gen- 
erated by a permanent-magnet generator mounted 
in the transmission. This permanent-magnet gen- 
erator, designed to be used with both quartz and 
digital speedometers, provides a sinusoidal speed 
signal that is proportional to vehicle speed at the 
rate of 4004 pulses per mile (2503 per kilometer). 
This speed signal is transmitted to a buffer am- 
plifier which filters and squares the speed signal. 
After conditioning, the signal is transmitted to the 
speedometer circuitry at a rate of 1.111 Hz/(mi)ch) 
[0.694 Hz/(km)(h)]. 

The odometer assembly for the quartz electric 
speedometer consists of conventional odometer 
wheels driven by an electric stepper motor. The step- 
per motor receives a frequency-modulated square- 
wave signal of 0.556 Hz/(mi)ch) [0.348 Hz/(km)(h)] 
from the speedometer circuitry. This signal controls 
the rate at which the motor turns the odometer 
gears. 

The air-core gage mechanism in the electric 
speedometer consists of a coil, bobbin, and case as- 
sembly that utilizes electrical signals supplied by the 
circuit assembly to display vehicle speed. Current 
flow through the coils of the air-core gage causes a 
flux field to be induced. As the flux field varies, the 
armature magnet inside of the coils rotates to align 
itself with the induced field. A spindle and pointer is 
attached to the magnet and indicates information on 
the dial. By controlling the flow of current through 
the coils (via the electronic circuit), the air-core gage 
can be used to display accurately the desired analog 
information. Once the current potential across the 
coils is removed, the pointer returns to zero. 

Digital microprocessor speedometer. In this type of 
instrument (Fig. 3), the vehicle speed is monitored 
by the permanent speed sensor mounted in the trans- 
mission. Like the quartz speedometer system, the 
speed signal is filtered and squared by the buffer am- 
plifier. After conditioning, the signal is transmitted to 
the microprocessor where the counter converts the 
speed signal to a digital signal and stores it in mem- 
ory. The timing circuit has the capacity to handle 
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Fig. 2. Components of a quartz electric speedometer system. 
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Fig. 3. Schematic diagram of the operation of a digital electronic speedometer. The permanent-magnet speed sensor 
generates four pulses for every revolution; at 60 mi/h (97 km/h), 4004 pulses are generated. 
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the counter and memory storage in less than 0.25 s. 
Memory circuit signals are sent to the display circuit, 
which selects the display numerals representing the 
vehicle’s speed, according to the number of pulses 
received from the speed sensor. The electronic 
displays are light-emitting diodes, fluorescent tubes, 
liquid crystals, plasma gas discharge tubes, or 
electroluminescent panels. Odometers used in con- 
junction with digital speedometers are stepper 
motor-driven drum registers or digital displays 
employing nonvolatile memory. See AUTOMOTIVE 
TRANSMISSION; ELECTRONIC DISPLAY; TACHOMETER. 
Robert A. Grimm 
Bibliography. D. Maschino, D. Taylor, and G. 
Thoman, Practical Electric Speedometer System for 
Passenger Car Application, SAE Tech. Pap. 830326, 
February 1983; Society of Automotive Engineers, 
SAE Handbook Nineteen Eighty-Eight: Parts and 
Components, vol. 2., 1988; W. K. Toboldt, Automo- 
tive Encyclopedia, rev. ed., 2000. 


C] 
Spelaeogriphacea 


A crustacean order within the class Malacostraca, su- 
perorder Peracarida. Few spelaeogriphacean species 
are known, four extant species and two supposed 
fossil species. Spelaeogriphus lepidops is from a 
stream in a cave in Table Mountain, South Africa. Poti- 
icoara brasiliensis inhabits a pool in a cave in the 
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Mato Grosso, Brazil. Mangkurtu mityula (see illus- 
tration) and M. kutjarra have been reported from 
aquifers in the Pilbara region of Western Australia. 
Acadiocaris novascotica is a fossil from Carbonifer- 
ous marine sediments in Canada, and Liaoningog- 
ripbus quadripartitus is a fossil from lacustrine 
(ake-associated) deposits of Jurassic age in China. 
It is probable that Acadiocaris is truly a spelaeog- 
riphacean, but it is less certain that Liaoningogri- 
phus is, on the basis of characters visible in fossils, 
its classification in this order or any other is impossi- 
ble to justify. 

Spelaeogriphus lepidops is the largest of the liv- 
ing species, reaching 9 mm (0.36 in.) in length; 
the others attain half that length at most. All have 
a slender flexible body of similar-sized segments. 
The head, incorporating the first thoracic segment, 
supports a short carapace that extends backward 
over the top and sides of the second thoracic seg- 
ment. The carapace encloses a branchial space in 
which sit a pair of maxillipedal gills, less developed 
in Potiicoara than in the other genera. A pair of 
articulating disc-like eyestalks sit on top near the 
front of the head but without pigmented eyes. The 
limbs on the head are a pair of antennules with a 
prominent scaphocerite (scale), a pair of long bi- 
ramous (having two branches) antennae, complex 
mandibles with grinding molar surfaces, incisor and 
palp (sensory appendage near the mouth), setose 
maxillae and maxillules and maxillipeds each with a 
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Lateral (top) and dorsal views of the Western Australian spelaeogriphacean species Mangkurtu mityula. 


prominent basal endite. The patterns of setae 
(bristles) on the mouthparts distinguish speleaog- 
riphaceans from the most closely related peracari- 
dan order, Mictacea. The seven pairs of walking legs 
on the remaining thoracic segments are thin and bi- 
ramous, with the long exopods on the more ante- 
rior ones generating a respiratory current under the 
carapace and past the reduced posterior exopods. 
Each of the first five abdominal segments bears a 
pair of biramous flattened pleopods (abdominal ap- 
pendages used for swimming) except in Spelaeogri- 
phus where the last pair is rudimentary. The last ab- 
dominal segment carries a tail-fan of paired uropods 
and terminal telson. Females incubate 10-16 eggs 
in four or five pairs of oostegites, ventral plates be- 
tween the thoracic legs (as in most Peracarida). Males 
and females differ only in features of the antennae 
and pleopods, which are not the same in all gen- 
era. The food consists of detritus manipulated by 
the mouthparts. The animals creep over the bottom 
with their walking legs and swim by undulations of 
the whole body. All extant species are inhabitants 
of fresh water in subterranean environments, caves, 
or aquifers. It is hypothesized that the group arose 
in the sea but dispersed to Gondwanan continents 
(the continents that formed from the fragmentation 
of the ancient southern supercontinent Gondwana, 
including South America, Africa, Antarctica, and 
Australia), where they were stranded in localized 
fresh-water refugia (areas that escaped the great 
changes that occurred in the regions as a whole, 
providing conditions for relic colonies to survive). 
Because all species are found in such small areas and 
are dependent on the persistence of ground water, 
it could be advocated that they are critically endan- 
gered. See PERACARIDA. Gary C. B. Poore 

Bibliography. I. Gordon, On Spelaeogriphus, a new 
cavernicolous crustacean from South Africa, Bull. 
Brit. Mus. Nat. Hist. (Zool.), 5:31-47, 1957; J. R. 
Grindley and R. R. Hessler, The respiratory mech- 
anism of Spelaeogriphus and its phylogenetic sig- 
nificance (Spelaeogriphacea), Crustaceana, 20:141- 
144, 1971; A. M. S. Pires, Potiicoara brasiliensis: A 
new genus and species of Spelaeogriphacea (Crus- 
tacea: Peracarida) from Brazil with a phylogenetic 
analysis of the Peracarida, J. Nat. Hist., 21:225-238, 
1987; G. C. B. Poore and W. E Humphreys, First 
record of Spelaeogriphacea from Australasia: A new 
genus and species from an aquifer in the arid Pil- 
bara of Western Australia, Crustaceana, 71:721-742, 
1998; Y.-b. Shen et al., A new spelaeogriphacean 
(Crustacea: Peracarida) from the Upper Jurassic of 
China, Contrib. Zool., 68:19-35, 1998. 


A 
Sperm cell 

The male gamete. The typical sperm of most animals 
has a head containing the nucleus and acrosome, a 
middle piece with the mitochondria, and a tail with 
the 9 + 2 microtubule pattern (Fig. 1). Sperm, as 
well as the acrosome shape, varies with the species 





Fig. 1. Diagram of a human spermatozoon, based on 
electron micrographs. 


(Fig. 2). The nucleus consists of condensed chro- 
matin (deoxyribonucleic acid, DNA) and histone 
proteins. The acrosome, which is derived from the 
Golgi complex, contains hydrolytic enzymes, that is, 
hyaluronidase capable of lysing the egg coats at fertil- 
ization. Actin molecules which aid in the interaction 
between sperm and egg are found in the area be- 
tween the acrosome and nucleus. The mitochondria 
in the middle piece apparently provide the energy 
necessary for the motility created by the tail. The 
tail has a central core, or axial filament, made up 
of nine double tubules and two central tubules. See 
CILIA AND FLAGELLA. 

Many groups, including nematodes, myriapods, 
and crustaceans, have atypical sperm which lack a 
flagellum and are presumably nonmotile. The sperm 
of Ascaris is round and moves by ameboid means. 
The crustacean sperm have a large acrosome of sev- 
eral components. In the anomurans a middle with 
mitochondria and arms filled with microtubules pre- 
cedes the nucleus. In the true crabs the nucleus 
forms arms which possess microtubules and also 
surrounds the many component acrosomes. The nu- 
cleus of crustacean sperm does not condense with 
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Fig. 2. Forms of spermatozoa. (After C. R. Austin, Fertilization, Prentice-Hall, 1965) 


maturation as it does in typical sperm mentioned 
above. See SPERMATOGENESIS. Gertrude W. Hinsch 


Spermatogenesis 


The differentiation of spermatogonial cells into sper- 
matozoa. This process occurs within the testes of the 
male (see illus.). The primordial germ cells are desig- 
nated spermatogonia when the gonad is identifiable 
as a testis. 

Spermatogonial divisions occur continuously 
throughout the life of mammals; these divisions both 
maintain the stem cell population (spermatogonial 
cells) and supply cells which develop into sperm. 
Clusters of spermatogonia maintain communica- 
tion through cytoplasmic bridges, and these groups 


become primary spermatocytes when they syn- 
chronously enter the first meiotic prophase. The 
first meiotic prophase is characterized by a series 
of remarkable changes in chromosome morphology, 
which are identical to those seen in the correspond- 
ing stage of oogenesis. The secondary spermatocyte 
produced by this division then undergoes a division 
in which the chromosomes are not replicated; the re- 
sulting spermatids contain half the somatic number 
of chromosomes. See MFIOSIS. 

The spermatids become embedded in the cyto- 
plasm of Sertoli cells, and there undergo the distinc- 
tive changes which result in formation of sperma- 
tozoa. These morphological transformations include 
the conversion of the Golgi apparatus into the acro- 
some and progressive condensation of the chromatin 
in the nucleus. A centriole migrates to a position 


primary spermatocytes 
during first 
meiotic prophase 
and division 


spermatogonial 


division 
spermatogonial 










Sertoli cells 


spermatozoa 





maturing spermatids 


Cellular events in human spermatogenesis. Cell types are 
labeled. 


distal to the nucleus and begins organizing the axial 
filament which will form the motile tail of the sperm. 
Mitochondria may fuse to form a nebenkern as is the 
case for many vertebrates, or there may be less exten- 
sive fusion as in mammals. In all cases the resulting 
structures become located around the axial filament 
in the midpiece. The cytoplasm of the spermatid is 
reflected distally away from the nucleus during sper- 
matid maturation; eventually, most of the cytoplasm 
is sloughed off and discarded (see illus.). 

The Sertoli cells are thought to provide nutrition 
for the developing sperm, because their cytoplasm 
contains large stores of glycogen which diminish as 
spermatids mature. There is no direct evidence for 
this nutritive function, but some forms of male steril- 
ity are associated with the failure to produce normal 
Sertoli cells. Electron microscopy has revealed dis- 
tinct plasma membranes surrounding the two cell 
types at the points of contact, and thus the Sertoli 
cell-spermatid relationship is not syncytial as once 
thought. 

Spermatogenesis is cyclical to a varying extent de- 
pending on the species, and under endocrine con- 
trol. The periodic function of the human testis is 
not nearly as clearly definable as the menstrual cycle 
in the female, and can only be detected by exam- 
ining deoxyribonucleic acid (DNA) synthesis and 
the changes in proportions of the various stages 


Sphaeractinoidea 


present. In seasonal-breeding animals, the testis re- 
gresses, and spermatogenesis ceases during the qui- 
escent season. Spermatogenesis is maintained and 
regulated by male steroid hormones such as testos- 
terone, which is produced by the interstitial or Ley- 
dig cells found in the connective tissue of the testis. 
Interstitial cells, in turn, are stimulated by luteinizing 
hormone (LH) which is produced by the pituitary 
gland. The male testis-regulating hormone was for- 
merly known as interstital cell-stimulating hormone 
ACSH), but it is now known to be identical to LH. 
See ENDOCRINE MECHANISMS; GAMETOGENESIS; MEN- 
STRUATION; TESTIS. Spencer J. Berry 
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Sphaeractinoidea 


An extinct group of fossil organisms generally re- 
ferred to as the Mesozoic stromatoporoids. As origi- 
nally proposed by Othmar Kühn in 1927, the order 
included the Mesozoic stromatoporoids (Sphaerac- 
tiniidae), the heterastridiids (a group of obscure 
spherical Mesozoic fossils commonly regarded as hy- 
drozoans), and the spongiomorphs. Kühn placed all 
of these in the class Hydrozoa. The discovery of 
spicules (spikelike supporting structures) in some 
genera of the sphaeractinids has allowed some gen- 
era formerly considered Mesozoic stromatoporoids 
to be classified in the Porifera as representatives of 
the class Demospongiae and the subclasses Ceracti- 
nomorpha and Tetractinomorpha. This discovery has 
opened the probability that all the sphaeractinids 
will eventually be referable to living sponge fami- 
lies and orders, if traces of spicules can be found in 
them. Both Mesozoic and Paleozoic stromatoporoids 
are now considered to be poriferans; the Paleozoic 
representatives are best considered a class of the phy- 
lum separated from the Mesozoic forms by time, lack 
of spicules, and microstructure. Those genera of the 
Mesozoic group that, at present, cannot be assigned 
to a living poriferan taxon owing to the lack of (or 
our failure as yet to discover) spicules are a disparate 
orphan group that can best be informally referred to 
as sphaeractinids or Mesozoic stromatoporoids. 

These Mesozoic stromatoporoids appear in the 
late Permian with the genus Circopora, but did not 
reach abundance until the Jurassic and early Creta- 
ceous. They declined rapidly in late Cretaceous time, 
and their presence in younger beds is in doubt. They 
grew large, calcareous, internally laminate skeletons 
in encrusting, dome-shaped, bulbous, fingerlike, and 
branched forms. Internally, the skeleton is composed 
of an irregularly repetitive three-dimensional net- 
work of longitudinal pillars or walls, tangential lam- 
inae with beams and cyst plates (see illus.) much 
like that of their Paleozoic look-alikes. In some gen- 
era, longitudinal canals crossed by tabulae are promi- 
nent. The microstructure of the structural elements 
is fibrous in well-preserved fossils and has suggested 
to some that they secreted aragonite rather than cal- 
cite. However, some living sponges secrete calcite 
in a similar microstructure, and it may be that the 
Mesozoic stromatoporoids did the same. 
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(b) 


Sphaeractinoid Actinostromaria rhodaclada, (a) tangential and (6) longitudinal section, 
x6. (From Steinor, 1932, Cretaceous, Jura Mountains) 


The Mesozoic stromatoporoids are important reef- 
building organisms in the Middle East, particu- 
larly in areas that surrounded the Mesozoic Tethys 
seaway. They are known from the Permian of 
Pakistan (Salt Range); the Triassic of the Caucasus, 
Bulgaria, and Indonesia; the Jurassic of Israel, Arabia, 
the Middle East-Mediterranean region, and North 
America (the eastern continental shelf); and the 
Lower Cretaceous of the Mediterranean region. 
See DEMOSPONGIAE; PORIFERA; REEF; SPONGIOMOR- 
PHIDA; STROMATOPOROIDEA. C. W. Stearn 

Bibliography. C. W. Stock et al., Annotated check- 
list of post-Devonian “Stromatoporoid genera” and 
their type species, Fossil Cnidaria Porifera Newsl., 
21(1.2):1-22, 1992; D. TurnSek and A. Barbulescu, 
Upper Jurassic hydrozoa in central Dubrogea 
(Romania), Geologija, 12:73-84, 1969; R. A. Wood, 
Biology and revised systematics of some late 
Mesozoic stromatoporoids, Spec. Pap. Palaeontol., 
no. 37, pp. 1-89, 1987; R. A. Wood and J. Reitner, 
Poriferan affinities of Mesozoic stromatoporoids, 
Palaeontology, 29:469-478, 1986. 
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Sphaerocarpales 


An order of liverworts in the subclass Marchantiidae, 
consisting of two families, the terrestrial Sphaero- 
carpaceae (Sphaerocarpos and Geothallus) and the 
aquatic Riellaceae Riella). The plants are character- 
ized by envelopes surrounding each antheridium and 
archegonium, absence of elaters, poor development 
of seta, and absence of thickenings in the unilayered 
wall of an indehiscent capsule. 

The gametophytes consist of a simple, lobed thal- 
lus of rosette form, or a stemlike axis with lateral 
lobes or appendages. The rhizoids are smooth, and 
air chambers and pores are lacking. Oil bodies may 
be present. The antheridia are separately contained 
within receptacles on the dorsal surface or at the 
margins, while the archegonia are separately pro- 





duced in receptacles dorsally or behind the growing 
point. The sporophyte consists of a bulbous foot, 
short or obsolescent seta, and globose indehiscent 
capsule with a unilayered wall. There are no elaters, 
but minute green cells are mingled with the spores. 
The haploid chromosome number is 8 or 9. See 
BRYOPHYTA; HEPATICOPSIDA; MARCHANTIIDAE. 
Howard Crum 


| 
Sphagnopsida 

A class of the plant division Bryophyta containing 
plants commonly called peatmosses. The spongelike 
plants (Fig. 1) grow as perennials in soft cushions or 
lawns in wet habitats (rarely they grow submerged). 
The class consists of a single genus, Sphagnum, of 
some 200 species. The thallose protonema, fascicled 
branches, dimorphous leaf cells, and spores devel- 
oped from amphithecial tissue are characters unique 
to the class. The plants are ecologically important, 
owing to the spongelike construction of leaves and 
outer cells of stems and branches, and the ability, 
whether living or dead, to create acid conditions by 
exchanging hydrogen ions for cations in solution. 
The dried plant parts are used as a mulch in horticul- 
tural practice, and as fuel where the plant is abun- 
dant. 

Peatmoss stems are simple or forked, with 
branches usually of two kinds, some spreading and 
others pendant, in fascicles spirally arranged around 
the stem and crowded in a headlike apex. Rhizoids 
(present only in early stages) are multicellular with 
slanted crosswalls. The stems and branches are en- 
veloped in a cortex of large, empty, thin-walled 
hyaline cells, which often have pores to the outer 





Fig. 1. Sphagnobrya. (a) Sohagnum capillaceum. 
(b) Sphagnum palustre. (After E. L. Palmer, Fieldbook of 
Natural History, McGraw-Hill, 1949) 


surface. Internal to the cortex is a wood cylinder of 
long, thick-walled cells surrounding an inner tissue 
of more or less thin-walled and undifferentiated cells. 

The stem and branch leaves are generally differen- 
tiated. The branch leaves consist of a network of lin- 
ear green cells (chlorophyllose cells) enclosing large, 
empty hyaline cells that are nearly always reinforced 
internally by annular fibrils and perforated by pores 
on the free surface (Fig. 2). The globose, stalked an- 
theridia occur among the leaves of somewhat differ- 
entiated spreading branches, and are often catkinlike 
and highly colored; one antheridium is produced at 
the side of each leaf. The archegonia are few and 
flask-shaped on very short branches and enclosed by 
large perichaetial leaves. 


chlorophyllose cell 





hyaline cell 


Fig. 2. Cellular structure of a leaf. (After F. W. Emerson, 
Basic Botany, 2d ed., McGraw-Hill, 1954) 
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Fig. 3. Sporophyte of Sphagnum. (After F. W. Emerson, 
Basic Botany, 2d ed., McGraw-Hill, 1954) 
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Fig. 4. Stages in the development of a protonema. 

(a) Germination of spore; (b) division of terminal cell to form 
an apical cell; (c) formation of multicellular rhizoids (after A. 
Engler and K. Prantl, Die naturlichen Pflanzenfamilien, Bd. 
10, 2 Aufl., 1924). (d) Thalloid protonema with young, 
upright leafy branch forming (after F. W. Emerson, Basic 
Botany, 2d ed., McGraw-Hill, 1954). 


The sporophytes are anchored by a foot, and are 
raised beyond the perichaetial leaves at maturity 
by an extension of the gametophyte called a pseu- 
dopodium (Fig. 3). The globose capsules dehisce 
by a lidlike operculum. The capsule wall consists 
of several layers of cells forming a solid tissue, with 
large numbers of nonfunctional pseudostomata (hav- 
ing two guard cells but no pore) scattered over the 
surface. The spores are derived from the amphithe- 
cium, or outer portion of the capsule. The spores are 
tetrahedral, with triradiate scars marking a tetrad ori- 
gin. Amembranous calyptra invests the capsule until 
its maturity. The protonema is normally a small, lobed 
thallus producing one or rarely two leafy plants but 
sometimes proliferating marginal filaments that pro- 
duce at their tips secondary protonemata (Fig. 4). 
The haploid chromosome number is 19 and 38. See 
BRYOPHYTA. Howard Crum 

Bibliography. H. Crum, Sphagnopsida, Sphag- 
naceae, No. Amer Flora 2, 11:1-180, 1984; B. 
Lange, Key to the northern boreal and arctic species 
of Sphagnum, based on characteristics of the stem 
leaves, Lindbergia, 8:1-29, 1982; D. H. Vitt and 
R. E. Andrus, The genus Sphagnum in Alberta, Can. 
J. Bot., 55:331-357, 1977. 
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Sphalerite 


A mineral, 6-ZnS, also called blende. It is the low- 
temperature form and more common polymorph 
of ZnS. Pure -ZnS on heating inverts to wurtzite, 
a-ZnS, at 1020°C (1868°F), but this temperature can 
be lowered substantially by impurity-atom solid so- 
lution (especially Cd?* and Fe?) and sulfur fugac- 
ity. Sphalerite crystallizes in the hextetrahedral class 
of the isometric system with a structure similar to 
that of diamond. The space group is F43m, and 
the cubic unit cell has an edge a = 0.543 nanome- 
ter, which contains four ZnS molecules. Zinc atoms 
occupy the positions of half the carbon atoms of 
diamond, and sulfur atoms occupy the other half. 
Each zinc atom is bonded to four sulfur atoms, and 
each sulfur atom is bonded to four zinc atoms. The 
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(a) (b) 


Sphalerite. (a) Crystals in limestone from Joplin, Missouri 
(specimen from Department of Geology, Bryn Mawr 
College). (b) Crystal habit (after C. Klein and C. S. Hurlbut, 
Jr, Manual of Mineralogy, 21st ed., John Wiley and Sons, 
1993). 


common crystal forms of sphalerite are the tetrahe- 
dron, dodecahedron, and cube, but crystals are fre- 
quently complex and twinned (see illus.). The min- 
eral is most commonly in coarse to fine, granular, 
cleavable masses. The luster is resinous to submetal- 
lic; the color is white when pure, but is commonly 
yellow, brown, or black, darkening with increased 
percentage of iron. It has been shown that excess 
sulfur can also contribute to the darkening of the 
color. There is perfect dodecahedral cleavage; the 
hardness is 3'/2 on Mohs scale; specific gravity is 4.1 
for pure sphalerite but decreases with increasing iron 
content. 

Pure sphalerite contains 67% zinc and 33% sulfur, 
but iron is usually present, substituting in the struc- 
ture for zinc and may amount to 36%. The amount of 
iron in sphalerite varies directly with temperature at 
the time of crystallization. Cadmium and manganese 
may be present in small amounts. 

Sphalerite is a common and widely distributed 
mineral associated with galena, pyrite, marcasite, 
chalcopyrite, calcite, and dolomite. It occurs both 
in veins and in replacement deposits in limestones. 
As the chief ore mineral of zinc, sphalerite is 
mined on every continent. The United States is the 
largest producer, followed by Canada, Mexico, Rus- 
sia, Australia, Peru, the Congo, and Poland. See ZINC. 

Cornelius S. Hurlbut, Jr.; Paul B. Moore 
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Sphenisciformes 


The penguins, a small monotypic order of flight- 
less marine swimming birds found in the colder 
southern oceans (see illustration). They have most 
likely evolved from the order Procellariiformes, per- 
haps from a diving petrel-like ancestor. Classifica- 
tion schemes that hypothesize a link between pen- 
guins and loons have no factual support. At one time, 
the Sphenisciformes had been placed in a separate 
superorder, Impennes, but such a designation sug- 
gested too great a divergence of the penguins from 
their ancestral group of tube-nosed swimmers. Some 
taxonomists have enlarged the Ciconiiformes to in- 
clude the Sphenisciformes penguins, as in the Sibley- 
Ahlquist taxonomy (a radical new approach to tax- 
onomy, based on DNA hybridization studies). See CI- 
CONIIFORMES; GAVIIFORMES; PROCELLARIIFORMES. 


Fossil record. The fossil record of the penguins 
begins in the late Eocene and continues through- 
out the Tertiary. Fossils have been found in Antarc- 
tica, Australia, and South Africa, which approximates 
the range of present-day penguins. The earliest fossil 
penguins were already well specialized as flightless, 
wing-propelled swimming birds, and their fossil his- 
tory provides little new information about the evolu- 
tionary history and distribution of the group. Some 
fossil penguins were much larger than the emperor 
penguin, the largest living species. Some Eocene 
species were gigantic and had a long spearlike bill. 

Characteristics. Penguins are medium-sized to 
large birds. They are completely flightless, their 
wings having been modified into stiff flattened flip- 
pers. They stand upright on legs that are placed far 
posterior and that terminate in four toes, the ante- 
rior three of which are webbed. Penguins swim and 
dive well, using only their wings for propulsion: their 
feet are used only for steering during swimming. Ter- 
restrial locomotion is by walking, hopping, or slid- 
ing on the belly while pushing with the wings. The 
plumage consists of dense scalelike feathers that are 
black dorsally and white ventrally. A distinctive pat- 
tern or crest, often yellow, occurs on the head. 

Penguins are gregarious, breeding in large colonies 
along the coast. The males and females, which 
are identical, form strong pair bonds and share in 
the incubation and care of the downy nestlings. 
The older young of some species are kept in large 
groups, or creches. The emperor penguin (Apten- 
odytes forsteri) breeds on the ice pack along the 
Antarctic coasts during the fall. Incubation of the one 
egg is the responsibility of the male, which remains 
on the nest for over 2 months in the winter without 
eating. The females leave the breeding colony dur- 
ing this period to feed, and return fat and laden with 





Aptenodytes patagonicus, the king penguin. (Photo by 
Gerald and Buff Corsi, © California Academy of Sciences). 


food for the young. Males then leave for the ocean 
and return with food to relieve the females. This al- 
ternation continues until the young are old enough 
to enter the water. By this time, the sea ice has melted 
back close to site of the nesting colony. 

Habitat. Penguins are found only in the cold south- 
ern oceans, on the Antarctic continent and its sur- 
rounding islands and northward to Australia, New 
Zealand, South America, and Africa. One species, 
the Galapagos penguins (Spheniscus mendiculus), 
is found in the Galapagos Islands, which are on the 
Equator but are surrounded by the cold Humboldt 
Current. See AVES. Walter J. Bock 

Bibliography. L. S. David and M. Renner, Penguins, 
Yale University Press, 2004; L. Davis and J. Darby 
(eds.), Penguin Biology, Academic Press, 1990; T. D. 
Williams, The Penguins, Oxford University Press, 
1995. 
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Sphenophyllales 


An extinct order of articulate land plants, common 
during Late Pennsylvanian and Early Permian times. 
They are typified by Sphenophyllum (see ilkus.), a 
small, branching plant, probably of trailing habit. The 
long, jointed stems rarely exceeded 0.4 in. (1 cm) in 
diameter and had superposed, longitudinal, surficial 
ribs between nodes. The vascular system contained a 
solid xylem core with triangular primary wood. The 
leaves were wedge-shaped, usually shorter than 0.8 
in. (2 cm), and had toothed, notched, or rounded 
distal margins. They were attached at the nodes by 
their narrow ends, in whorls of usually 6 or 9 leaves 
each, rarely 18. Long, terminal cones, usually called 
Bowmanites when found detached, contained spo- 
rangia and spores. The sporangia terminated slen- 
der stalks, forming concentric whorls that alternated 





Reconstruction of Sohenophyllum cuneifolium, showing 
whorled leaves and two cones. (After G. M. Smith, 
Cryptogamic Botany, vol. 2, 2d ed., McGraw-Hill, 1955) 


with whorls of sterile bracts. Most species were ho- 

mosporous (produced spores of a single type). See 

PALEOBOTANY; PLANT KINGDOM; SPHENOPHYTA. 
Sergius H. Mamay 
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Sphenophyta 

One of the major divisions (formerly known as Equi- 
setophyta) of vascular plants that includes both liv- 
ing and fossil representatives. The three principal 
orders are Pseudoborniales Devonian), Sphenophyl- 
lales (Devonian-Triassic), and Equisetales (Devonian- 
Recent); the Hyeniales (Devonian) may also be 
sphenophytes. See EQUISETALES; HYENIALES; PSEU- 
DOBORNIALES; SPHENOPHYLLALES. 

All sphenophytes are characterized by axes with 
distinct nodes that produce whorls of small leaves or 
branches; branches often contain longitudinal ribs 
and furrows. Internally the stems of sphenophytes 
are characterized by longitudinally oriented canals, 
some of which functioned in gaseous exchange. Sec- 
ondary tissues were produced in a few fossil forms. 
The reproductive organs of this group are loosely ar- 
ranged strobili or cones consisting of a central axis 
bearing whorls of modified branches, each termi- 
nating in a recurved, thick-walled sporangium. Most 
sphenophytes produced one type of spore (homo- 
spory), although a few fossil forms were het- 
erosporous. 

Pseudoborniales are known from a single species 
collected from a few Devonian localities south of 
Spitsbergen and in Alaska. Impression fossils suggest 
that the plant was a tree about 66 ft (20 m) tall, con- 
structed of whorls of branches and leaves. The repro- 
ductive parts of Pseudobornia are not well known, 
but appear to contain recurved sporangia with one 
type of spore. 

Sphenophyllales, less than 3 ft (1 m) tall, formed a 
portion of the Carboniferous forest understory. The 
leaves were often wedge shaped, and many were pro- 
duced in a whorl. Some forms produced secondary 
tissues in the stems. All were homosporous, with the 
sporangia produced in cones. 

During the Carboniferous, the sphenophytes were 
represented by numerous species in the order Eq- 
uisetales, ranging from herbaceous forms to trees. 
Both homosporous and heterosporous forms were 
present, but the spores of all species contained extra 
exinous structures termed elaters. During the Meso- 
zoic Period, the group declined, with fewer, smaller 
plants. 

Today all that remains of the sphenophytes is the 
genus Equisetum, commonly called the horsetail or 
scouring rush. Except in Australia and New Zealand, 
the members are worldwide in distribution and typ- 
ically grow in damp habitats along the edges of 
streams, although some species are adapted to mesic 
conditions. A few species attain considerable size, 
but none of these produce secondary tissues. All 
species are characterized by one type of spore (ho- 
mospory), each bearing two pairs of elaters on the 
surface. See EMBRYOBIONTA. Thomas N. Taylor 

Bibliography. T. N. Taylor and E. L. Taylor, The Bi- 
ology and Evolution of Fossil Plants, 1993. 
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= 
Sphere 


Both in euclidean solid geometry and in common 
usage the word sphere denotes a solid of revolution 
obtained by revolving a semicircle of radius r about 
its diameter. Its total volume is V = 4/3zr> 

However, in analytic geometry, and more generally 
in modern mathematics, the word sphere denotes a 
spherical surface that bounds a solid sphere. In this 
sense a sphere is the locus of all points P in three- 
dimensional space whose distance from a fixed point 
O (called the center) is equal to a given number. The 
word radius may refer either to one of the segments 
OP, or to their common length r. A plane that inter- 
sects a sphere in just one point is called a tangent 
plane and is perpendicular to the radius drawn from 
the center of the sphere to that point. A plane that 
intersects a sphere in more than one point intersects 
it in a circle. The circle is called a great circle or a 
small circle of the sphere according to whether the 
plane does or does not pass through the center of the 
sphere. If two parallel planes intersect a sphere (see 
illus.), the spherical surface between them is called 
a zone, and its area is 2xzrhb, where the height p of 
the zone is the distance between the parallel planes. 
The volume bounded by the zone and the two base 
planes is called a spherical segment, and is measured 
by the formula V = (th/6)Br7 + 373 + b>, where rı 
and r2 are the radii of the two base circles. It is also 
measured by the prismoidal formula V = (b/6)(B, + 
4M + B2), where B,, Bz denote the areas of the two 
bases, and M the area of the midsection halfway be- 
tween the bases. 

Any great circle of a sphere divides it into two 
hemispheres. A second great circle cuts a hemi- 
sphere into two lunes, each having vertices at the 
two points where the great circles intersect and each 
having an area proportional to the angles at the ver- 
tices between the tangents to the great circle bound- 
aries. A third great circle cuts each lune into two 
spherical triangles. In general a spherical triangle is 
the figure formed by connecting any three points 
A, B, C on the surface of a sphere (but not on the 
same great circle) by great circle arcs called sides. 
The sides BC, CA, AB are measured by the angles a, 
b, c that they subtend at the center of the sphere. If 
the angles between the pairs of sides at the vertices 
A, B, C are denoted by a, 6, and y, respectively, and 





Segment of a sphere formed by intersection of two parallel 
planes with sphere. 


if 20o =« + B + y — 180° denotes the excess of the 
sum of these angles over two right angles, then the 
area of a spherical triangle of excess 20 on a sphere 
of radius r is (6 /90°)r? (or 2a if o is measured in 
radians). Important relations between the sides and 
angles of a spherical triangle are as follows: 

sina sinf siny 


Law of sines: — =— — 
sina sin b sinc 











Law of cosines: 
cosa = cosb cosc + sin b sin c cos 
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Spherical harmonics 

A spherical harmonic or solid spherical harmonic of 
degree n is a homogeneous function, R,(x,y,z), of 
degree n which satisfies Laplace’s equation (1). 
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Here n is any number and 
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is a spherical harmonic of degree —n — 1. There are 
analogous definitions for spaces of any number of 
dimensions. In the present article, n will be a non- 
negative integer and R, a polynomial in x,y,z (poly- 
nomial spherical harmonic). In terms of spherical 
coordinates r, 0, Q, RŒ y, D = 1S 0,p) where Sn, 
a polynomial in cos 6, sin 6, cos ¢, sin @ is a spheri- 
cal surface harmonic of degree n. There are 27 + 1 
linearly independent spherical surface harmonics of 
degree n; any spherical surface harmonic of degree 
n is a linear combination of these, and conversely 
any linear combination of spherical surface harmon- 
ics of degree n is again a spherical surface harmonic 
of degree n. 

Applications. Spherical harmonics occur in poten- 
tial theory. They occur in connection with Laplace’s 
equation not only in spherical coordinates but also in 
spheroidal coordinates (spheroidal harmonics) and 
confocal coordinates (ellipsoidal surface harmon- 
ics). In the latter case their occurrence is due to 
the circumstance that the natural affine mapping of 
an ellipsoid onto a sphere carries the partial differ- 
ential equation of ellipsoidal surface harmonics into 
the partial differential equation satisfied by spherical 
surface harmonics. In spherical coordinates, spher- 
ical surface harmonics occur in connection with 
Laplace’s and Poisson’s equations, the wave equa- 
tion, the Schrödinger equation, and generally in con- 
nection with partial differential equations of the form 
AU + fU = 0. In the latter case one has special 
solutions of the form FS,(0,ġ), where F satisfies 
the ordinary differential equation (2). In geometry, 
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spherical surface harmonics are used in the theory of 


surfaces. In mathematical physics, spherical harmon- 

ics appear in the theories of gravitation, electricity 

and magnetism, hydrodynamics, and in other fields. 
Spherical harmonics of degree n. Notation (3), with 
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JaD an integrable function, is a polynomial spherical 
harmonic of degree n, and every such spherical har- 
monic can be so represented. The representation is 
not unique. If c, is a constant, h1,b2, .. . , bn are n di- 
rections (not necessarily distinct), and 0/d/ denotes 
directional differentiation in the direction þ, then no- 
tation (4) is a polynomial spherical harmonic of de- 
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gree n, and every such spherical harmonic can be 
so represented. The representation is unique. In a 
zonal spherical harmonic the n directions coincide, 
and in a sectorial spherical harmonic they are in a 
plane at angles of m/n. If n — m directions coin- 
cide in the axis and the remaining directions are in 
the plane perpendicular to the axis at angles of 2/m, 
one has a tesseral spherical harmonic of degree n and 
order m. 

Explicitforms. For spherical harmonics whose axis 
is the z axis (n = 0, 1, 2, ..., n), Eq. (5) represents 
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a linearly independent system of spherical surface 
harmonics of degree n. S”,, + S~”, is a zonal, secto- 
rial, tesseral spherical surface harmonic of degree n 
and order m according as m =0,m=n,1<m< 
n — 1. The P7’(w) are associated Legendre functions 
which satisfy the associated Legendre equation (6). 
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P? = P, is the Legendre polynomial of degree n. 

Properties. A function is said to be harmonic in a 
region if it is a twice continuously differentiable so- 
lution of Laplace’s equation there, and if, in addition, 
it vanishes at infinity in case the point at infinity is an 
interior point of the region. Every function harmonic 
inside a sphere about the origin can be expanded in 
a series 
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convergent inside that sphere. Every function har- 
monic outside a sphere about the origin can be ex- 
panded in a series 
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n=0 


convergent outside that sphere. The reciprocal dis- 
tance of the points (x,y,z) and (0, 0,a) is harmonic 


Spherical harmonics 


in the regions r < a and r > a and possesses in these 
regions the expansions of Eqs. (7). 
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Here 6, ¢ determine a point on the unit sphere. 
The scalar product of two functions, f and g, on the 
unit sphere is suitably defined as Eq. (8) where g 
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is the complex conjugate of g. If Gg) = 0, f and 
g are orthogonal. Spherical surface harmonics are 
functions on the unit sphere. Any two spherical sur- 
face harmonics of different degrees are orthogonal. 
The spherical surface harmonics of Eq. (9) form 


5"(6,¢)m =—n,-n+1,...,n (9) 
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an orthogonal system; that is, (S77, Sm’) = O unless 
m = m and n = wW. This orthogonal system is com- 
plete; that is, a continuous function which is orthog- 
onal to all the $} vanishes identically. With an in- 
tegrable function f is associated the Laplace expan- 
sion (10) 
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Under suitable conditions, the Laplace expansion 
will converge to f. For instance, if fis continuous and 
continuously differentiable on the unit sphere, then 
the Laplace expansion converges to f uniformly. 

Let (0o,ġ0) be a fixed point, and let cos y = 
cos 0 cos 0o + sin 0 sin 8o cos (h — ġo) be the spheri- 
cal distance of (6,@) and (@0,¢0). The Laplace expan- 
sion of Eq. (11) is the addition theorem of Legendre 
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polynomials and expresses the change to a new axis 
through the point (0,¢0). Other spherical surface 
harmonics have corresponding addition theorems. 
Let cos y be the spherical distance of (@,¢) and 
(60,0), and let R(w) be a continuous function for 
—1 < w < 1. Then for any spherical surface harmonic 
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Sphingolipid 
S, of degree n, Eq. (12) holds. 
f f K(cos y) 5,0, ¢)sin0 d0 do 
0 =x 


= AySn@o, bo) (12) 


1 
where Ag = 20 KWP, (wW) dw 
-1 
See DIFFERENTIAL EQUATION. A. Erdélyi 
Bibliography. A. Erdélyi et al., Higher Transcenden- 
tal Functions, 3 vols., 1953-1955, reprint 1981; 
E. W. Hobson, Spherical and Ellipsoidal Harmon- 
ics, 1955; M. N. Jones, Spherical Harmonics and 
Tensors for Classical Field Theory, 1985; E. T. 
Whittaker and G. N. Watson, A Course of Modern 
Analysis, 4th ed., 1927. 


SESS 
Sphingolipid 

Any lipid containing the long-chain amino alco- 
hol sphingosine (structure 1) or a variation of 
it, such as dihydrosphingosine, phytosphingosine 
(structure 2), or dehydrophytosphingosine. Sphingo- 


CH3(CH>); CH=CH = =o — CHOH 


OH NH, 
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CH3(CH3)13CH — CH — CH — CHOH 
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sine itself is synthesized by condensing a long-chain 
fatty acid with the amino acid serine. 


Sphingosine is converted into a variety of deriva- 
tives to form the family of sphingolipids. The sim- 
plest form is a ceramide which contains a sphingo- 
sine and a fatty acid residue joined by an amide link- 
age. The structure of the ceramide N-acylsphingosine 
is shown in structure 3. 
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Ceramide is the basic building block of practically all 
of the naturally occurring sphingolipids. Ceramide 
can be further modified by the addition of a phos- 
phorylcholine at the primary alcohol group to form 
sphingomyelin (structure 4), a ubiquitous phospho- 
lipid in the plasma membranes of virtually all cells. 
Modification of a ceramide by addition of one or 
more sugars at the primary alcohol group converts it 
to a glycosphingolipid, which occurs widely in both 
the plant and animal kingdoms. The simplest form 
of a glycosphingolipid is a cerebroside which usu- 
ally contains a glucose or a galactose residue joined 
to the ceramide moiety through a glycosidic linkage. 
The carbohydrate unit can be further extended to 
form more complex structures. A notable example 
of modification is the addition of one or more sialic 
acid residues to an existing carbohydrate chain to 
form the family of gangliosides which are particu- 
larly abundant in the nervous system. The structure 
of a typical ganglioside, GM1, is shown (structure 5). 
See GLYCOSIDE; LIPID. 
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In addition to being important structural com- 
ponents of cellular membranes, sphingolipids par- 
ticipate in diverse cellular functions. A number of 
inheritable diseases that can cause severe mental re- 
tardation and early death occur as the result of a de- 
ficiency in one or more of the degradative enzymes, 
resulting in the accumulation of a particular sphin- 
golipid in tissues. These diseases are collectively 
called sphingolipidoses and include Niemann-Pick 
disease, Gaucher disease, Krabbe disease, metachro- 
matic leukodystrophy, and several forms of gan- 
gliosidoses, such as Tay-Sachs disease. Functionally, 
glycosphingolipids are known to serve as impor- 
tant cell-surface molecules for mediating cell-to-cell 
recognition, interaction, and adhesion. They also 
serve as receptors for a variety of bacterial and 
viral toxins. Many glycosphingolipids can modu- 
late immune responses as well as the function of 
hormones and growth factors by transmitting sig- 
nals from the exterior to the interior of the cell. 
A number of glycolipids are also found to partic- 
ipate in a variety of immunological disorders by 
serving as autoantigens. Other sphingolipids and 
their metabolites, notably ceramide, sphingosine-1- 
phosphate, sphingosylphosphorylcholine, and NN- 
and NN N-methylsphingosine, may serve as second 
messengers in several signaling pathways that are im- 
portant to cell survival or programmed cell death 
(apoptosis). See AUTOIMMUNITY; METABOLIC DISOR- 
DERS. Robert K. Yu 


LL _ 
Spica 

A 1st-magnitude star in the constellation Virgo. Spica 
(a Virginis) is a hot main-sequence star of spec- 
tral type B1, effective temperature about 25,000 K 
(45,000°F), at a distance of 80 parsecs from the Sun 
(2.59 x 10 km or 1.61 x 10! mi). See SPECTRAL 
TYPE. 

It is a double-line spectroscopic binary with an 
orbital period of almost exactly 4 days. The pair 
has been resolved with an intensity interferometer, 
which allows an accurate determination of the fun- 
damental parameters of the system. The primary has 
a mass that is 10.8 times that of the Sun, and a radius 
of 8.1 solar radii; the mass and radius of the sec- 
ondary are 6.8 and 4.4, respectively, in solar units. 
The system constitutes an example of an ellipsoidal 
variable, where the separation between the two stars 
is so small that powerful tidal forces distort the stars. 
The changing aspect of the primary as seen from 
Earth causes light variations with a period equal to 
the orbital period. The separation between the two 
stars is only about 3.5 times the radius of the primary. 
Both components are rotating around their axes very 
rapidly at projected equatorial velocities of 160 km/s 
(100 mi/s) and 100 km/s (60 mi/s), respectively. See 
BINARY STAR; ECLIPSING VARIABLE STARS; STELLAR RO- 
TATION. 

The primary itself is an intrinsic variable of the £ 
Cephei type and is pulsating with a period of about 4 
h. Asa result, the observed light curve is a superposi- 
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tion of two different effects. There are also changes 
in the profiles of the spectral lines, which are related 
to the same mechanisms. See OCCULTATION; STAR; 
VARIABLE STAR. David W. Latham 


= 
Spice and flavoring 


Ingredients added to food to provide all or a part 
of the flavor. Spices, a unique category of flavorings 
which are given preferred legal status because of the 
long history of their use in foods, are pungent or aro- 
matic substances of vegetable origin used in foods at 
levels that yield no significant nutritive value. Flavor 
is the perception of those characteristics of a sub- 
stance taken orally that affect the senses of taste and 
olfaction. The term flavoring refers to a substance 
which may be a single chemical species or a blend 
of natural or synthetic chemicals whose primary pur- 
pose is to provide all or part of the particular flavor 
effect to any food or other product taken orally. Fla- 
vorings are categorized by source: animal, vegetable, 
mineral, and synthetic. 

Chemical nature of flavor. The sensation of flavor 
has a chemical basis that involves both taste and 
smell. 

Taste. The elements of taste consist of sourness, 
caused by hydrogen ions found in organic and inor- 
ganic acids; sweetness, caused by organic chemicals 
containing hydroxyl, carbonyl and amino groups, 
for example glycerol, sugars, amino acids, saccha- 
rin, hexamic acid salts; saltiness, a characteristic of 
sodium chloride, but also exhibited by other in- 
organic salts; and bitterness, which is caused by 
inorganic salts and some relatively high-molecular- 
weight organic compounds, such as alkaloids, for ex- 
ample, quinine, tannins, and sucrose octacetate. The 
taste-producing substances are usually nonvolatile. 

Sensations of pain, heat, and coolness are also 
caused by chemicals. Several fatty acid amides of 
vanillylamine are responsible for the heat or bite in 
red peppers, while another amide, piperine, and its 
relatives provide the pungency of black pepper. Men- 
thol produces a coolness at low concentrations in the 
mouth. The astringency that is produced by alum is 
also produced by tannins from tea and other botani- 
cals. 

Smell. The smell (odor) of a food is judged by the 
nose as volatile components of the food enter ei- 
ther through the nostrils or through the mouth. The 
chemical constituents which cause the odor compo- 
nent of flavor therefore are volatile. No simple char- 
acterization of basic odors has yet been found such 
as is available for tastes. Attempts to reduce the thou- 
sands of complex identifiable odors to a simple clas- 
sification have been made, but have not been widely 
accepted. 

Types of flavors. The application of the physical 
processes of distillation and extraction developed 
during the Middle Ages led to a wide range of es- 
sential oils, extracts, and oleoresins. Research involv- 
ing the chemistry of natural products in the second 
half of the nineteenth century provided a nucleus 
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TABLE 1. Volatiles identified in flavorful foods* 

Type Hydrocarbons Alcohols Aldehydes and ketones Acids Esters Bases Sulfur compounds Miscellaneous Total 
Apple 8 40 39 39 80 4 — lid 221 
Cheddar cheese 32 1 25 28 25 14 6 12 153 
Cinnamon 24 9 10 15 8 = — 15 81 
Cocoa 45 2 52 37 38 103 18 43 383 
Onions and garlic 1 5 15 2 5 4 59 8 99 
Potato 24 24 31 13 1 9 23 9 134 
Tomato Te 31 90 2 28 4 8 9 217 
Cooked beef 74 32 76 22 25 25 45 30 329 
Vanilla 37 24 21 15 39 2 il 23 162 
Roasted peanuts 37 25 72 29 1 63 16 25 278 

*From S. van Straten (ed.), Volatile Compounds in Food, 4th ed., 1977, and supplements. 








of chemicals for use as flavoring ingredients. The re- 
actions studied at that time (Knoevenagel conden- 
sation, Knorr pyrrole synthesis, Perkins reactions 
and rearrangements, Kolbe-Schmitt reaction) pro- 
duce the characterizing ingredients of many spices 
and fruits. Spices contain the largest concentration 
of flavor volatiles and were the first to be studied. 
This early work confirmed the presence of, and the 
structure of, vanillin in vanilla beans at 2% concen- 
tration, eugenol in clove buds at 12%, and cinnamic 
aldehyde in cinnamon at 1-2% concentration. 

The first half of the twentieth century brought a 
continuous effort to identify the flavorful volatiles 
in food by classical organic chemical methods. The 
many ingredients in citrus peel oils and noncitrus 
fruits began to yield their flavor secrets. Creative 
flavorists utilized this knowledge and filled in the 
gaps with information supplied by their noses and 
their palates. Reasonably representative flavors of the 
fruits became possible to produce. These fruits con- 
tained volatiles which effected their flavor at concen- 
trations of 0.01-0.1% (100-1000 ppm). 

The second half of the twentieth century, espe- 
cially since the application of gas-liquid chromatog- 
raphy, has seen an explosion of information concern- 
ing the identity of the volatile components of all 
flavorful foods. The sensitivity of equipment for de- 
tecting ingredients and the speed with which equip- 
ment can identify these ingredients has confirmed 
the complex nature of flavor. Many of the volatiles 
in the less highly flavored foods have been exam- 
ined, isolated, and identified. As many as several hun- 
dred volatile organic chemicals have been identified 
in common flavorful foods (Table 1). These compo- 


nents range from simple hydrocarbons to complex 
chemicals with sulfur or nitrogen (Table 2). While 
many of these chemicals are not present in sufficient 
quantities to affect the flavor and some key ingredi- 
ents have not yet been identified, flavorists have used 
this information in attempts to prepare artificial fla- 
vors. 

A so-called nature-identical chemical is self- 
defining. If the flavor of foods were due to 
stable chemicals, flavors could be made from 
natural-identical chemicals alone. But the natural fla- 
vors of strawberries, grilled hamburger, or roast beef 
are not stable. Research involving development of 
synthetic shelf-stable flavors continues. Many of the 
types of flavors available are listed in Table 3. 

Classification. There are several ways of dividing 
flavorings into subgroups other than source. They 
can be divided into two groups; one group affects 
primarily the sense of taste, and the other affects 
primarily the sense of olfaction. The members of the 
first group are called seasonings, the members of the 
second group are called flavors. 

The same terms can be used to divide flavorings in 
another way. The term flavors can be applied to those 
products which provide a characterizing flavor to a 
food or beverage. The term seasoning can then be 
applied to those products which modify or enhance 
the flavor of a food or beverage—spices added to 
meats, blends added to potato chips, lemon added 
to apple pie. 

Flavorings can also be classified by their physical 
form, as solids, liquids, or pastes. The form will be 
determined by the nature of the flavoring, and the 
form of food to be flavored. 














TABLE 2. Flavor components of various types of flavorful foods 
Total flavor 
Type Component Species components, ppm 
Cooked and 
processed meats Amino acids, sugars Sulfides, pyrazines, thiazoles, furanones 50 
Fish Amino acids Amines, substituted phenols 50 
Nuts Texture Pyrazines, lactones, substituted phenols 50 
Vegetables Starch, sugars Unsaturated aldehydes, sulfur compounds 10-100 
Dairy Fats, acids Acids, ketones, esters 100-1000 
Fermented Alcohols, acids Alcohols, aliphatic esters 100-1000 
Fruits Sucrose, citric acid Esters of aliphatic acids, aliphatic aldehydes, aromatic 100-1000 
aldehydes, terpenes 
Spices Amides Terpenes, phenols, aldehydes, ketones, ethers 2000-200,000 

















TABLE 3. Examples of artificial flavors and their uses 
Type Example Use 
Cooked and Beef Gravy mix 
processed meat Bacon Bacon bit or 
analog, snack 
Seafood Shrimp Snack cracker 
Nut Walnut Toppings, cake 
mixes 
Peanut Cookies, candies 
Vegetable Tomato Sauces 
Cucumber Dressings 
Dairy Butter Margarine, baked 
goods 
Cheese Snacks, analogs 
Fermented Vinegar Salad dressings 
Bread Chemically raised 
bread 
Beer and wine Sauces 
Rum Baked goods 
Fruits Lemon Candy, baked 
goods 
Apple Baked goods, 
chewing gum 
Strawberry Candy, brandy 
Papaya Noncarbonated 
beverages 
Spices Anise Baked goods, 
candy 
Dill Pickles, sauces 
Nutmeg Sauces, 
condiments, 
processed 
meats 











In the United States most flavorings are processed, 
since undried herbs and fresh fruits for desired flavor 
effects are available only on a small scale. Therefore 
another classification is by method of manufacture. 
There is general agreement on the several commer- 
cial designations of flavorings, indicating the method 
of manufacture of each and the type of flavors 
which are available by using ingredients from each 
category. 

Chemicals. These are produced by chemical synthe- 
sis or physical isolation and include such substances 
as vanillin, salt, monosodium glutamate, citric acid, 
and menthol. Artificial or natural, they are only occa- 
sionally used alone, and usually are compounded or 
blended with other ingredients or diluted to achieve 
a specific flavor character at a workable concentra- 
tion. See SALT FOOD). 

Concentrates. Powders are manufactured by dehy- 
drating vegetables, such as onions and garlic, or con- 
centrated fruit juices. These expensive natural prod- 
ucts are available only in limited quantities. 

Condiments. These are single ingredients or blends 
of flavorful (sometimes exotic) foods, spices, and sea- 
sonings, some of which may have been derived by 
fermentation, enzyme action, roasting, or heating. 
They are usually designed to be added to prepared 
food at the table (for example, chutney, vinegar, soy 
sauce, prepared mustard). Some are completely nat- 
ural, while others are a blend of natural and synthetic 
ingredients. 

Spices. These are natural substances that have been 
dehydrated and consist of whole or ground aromatic 
and pungent parts of plants, for example, anise, cin- 
namon, dill, nutmeg, and pepper. Such spices are 
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also classified as herbs if grown in temperate climates 
(Table 4). 

Extracts. These are natural substances produced by 
extraction from solutions of the sapid constituents of 
spices and other botanicals in food-grade solvents. 
They are also available as synthetics. Extracts are 
used in home food preparation, and industrially by 
beverage manufacturers. Examples include vanilla, 
lemon, kola nut, and coffee. 

Oleoresins. These are the natural volatile and non- 
volatile flavor constituents of spices and other botan- 
icals produced by volatile solvent extraction and sub- 
sequent removal of the solvent by distillation. They 
are used industrially in processed meat, canning, and 
baking industries. 

Essential oils. These are the volatile oils obtained 
from spices and other plant materials which have 
the characteristic odors of their sources. They are 
usually obtained by distillation, although the citrus 
peel oils are obtained by expression. Essential oils 
are used in confectionery and beverage industries, 
and in compounded flavors for many applications. 
See ESSENTIAL OILS. 

Hydrolyzed vegetable proteins. These flavors are pro- 
duced by optimizing the development of a basic 
meaty flavor through hydrolysis of various veg- 
etable proteins. They are used in soup formu- 
lations and in compounding flavors and season- 
ings. 

Process flavors. These flavors are natural or artificial 
products of roasting, heating, fermentation, or enzy- 
molysis of foods or normal food ingredients. 

Compounded or blended flavorings. These are a group of 
substances produced by mixing. 

Seasonings. Available in dry or liquid form, these are 
natural or artificial flavorings produced by the blend- 
ing of various categories of flavoring agents with fla- 
vorful foods and flavoring adjuncts, for example, free- 
flow agents and antioxidants. They are suitable for 
altering the flavor of foods to which they are added, 
with the emphasis on the taste of the food; for ex- 
ample, barbecue seasoning for potato chips or chili 
seasoning for beans. 

Flavors. These are natural or artificial flavorings pro- 
duced by the compounding of various categories 
of flavoring ingredients and flavoring adjuncts (for 
example, solvents, antioxidants, and preservatives) 
suitable for imparting, modifying, or augmenting the 
flavor of foods to which they are added, with the em- 
phasis on smell (odor, aroma), for example, artificial 
strawberry flavor for a gelatin dessert, natural orange 
flavor for a breakfast drink mix, artificial bacon flavor 
for a flavored dairy dip, or artificial meat flavor for a 
hamburger extender. Further processing before or 
after compounding can provide these flavors in dry 
or liquid form. They may be water- or oil-soluble, 
and are usually designed to be suitable for specific 
food products and processes. See FOOD ENGINEER- 
ING; FOOD MANUFACTURING. 

Regulations. Perhaps because spices were at one 
time used to mask deterioration of foods, and per- 
haps because other flavorings were at one time 
used to hide economic adulterations, a complex, not 


268 


Spice and flavoring 





TABLE 4. Economically important spices 








Poppy seed Annual of poppy family from Poland, 
Argentina, Iran, and Turkey 


Rosemary Evergreen of mint family from France, Spain, 
and Portugal 

Saffron* Plant of crocus family from Spain 

Sage* Member of mint family from Greece and 
former Yugoslavia 

Savory Member of mint family from France and Spain 

Sesame From sesame plant in U.S., Nicaragua, 
Salvador, Egypt, and Brazil 

Thyme Perennial of mint family from France and U.S. 

Turmeric* Member of ginger family from India, Haiti, 


Jamaica, and Peru 


Seed 
Leaf 


Stigma of flower 
Leaf 


Leaf 
Seed 


Leaf 
Rhizome 


Spice Plant and source Plant part used Principal use 
Allspice* Evergreen of myrtle family from Jamaica and Dried fruit Pickles, roast meat, ketchup 
(Pimenta) Guatemala 
Anise* Parsley family from Mediterranean Seed Baked goods, anisette (a liqueur) 
Basil Mint family grown in U.S. and other parts of Leaves and tender Sauces and soups, especially 
world stems tomato-based 
Bay Evergreen of laurel family from Mediterranean, Leaf Pickles, stews, soups, Sauces 
Turkey, Greece, and Portugal 
Caraway* Biennial of parsley family from north-central Seed Bread and baked goods, cheese, sauces, 
Europe and southern England kummel (a liqueur) 
Cardamon* Ginger family from Guatemala and India Seed Baked goods, coffee blends, curry 
powders 
Cayenne Capsicum family grown over most of the Pod or fruit Many foods, pickles, sauces, meats, 
world (name used for those high in curry powder 
pungency) 
Celery* Parsley family principally from France and Seed Sauces, salads, pickles, soups 
India 
Cinnamon* Evergreen member of laurel family from Bark Baked goods, pickles, candy 
Ceylon 
Clove* Evergreen tree of the myrtle family from Unopened bud Pork products, pickles, stews, meats, 
Madagascar and Zanzibar flower and gravies 
Coriander* Plant of parsley family from former Yugoslavia Dried fruit Frankfurters, bologna, baked goods 
and Morocco 
Cumin* Annual plant of parsley family from Iran and Dried fruit Chili powder 
Morocco 
Dill* Member of parsley family from India and Seed and leaves Pickles, soups, sauces 
domestic sources 
Fennel* Perennial of parsley family from India and Seed Baked goods, salad dressings, meat 
Rumania products 
Garlic* Member of lily family from U.S. and Bulb Baked meats, sauces, dressings, soups 
Mediterranean 
Ginger* Tuberous perennial from Jamaica, India, and Rhizome Soft drinks, baked goods, pickles, 
Africa puddings 
Mace Tropical tree similar to rhododendron from Aril covering Baked goods, processed meats 
Indonesia and West Indies nutmeg 
Marjoram Perennial of mint family from France, Peru, Leaves Soups, stews, sauces 
and Chile 
Mint Perennial herb of many varieties grown in Leaves Confections, sauces, jellies, gums 
US. 
Mustard* Annual of mustard family from U.S. in Seeds Sauces, baked meats, processed meats 
Montana and California and gravies 
Nutmeg* Tropical tree similar to rhododendron from Seed Baked goods, processed meats, meat, 
Indonesia or West Indies fruit sauces 
Oregano Perennial of mint family from U.S., Mexico, Leaves Sauces and stews, especially 
Italy, and France tomato-based 
Paprika* Member of capsicum family from Spain, U.S., Pod or fruit Red-colored dressings, sauces, meats, 
and Hungary condiments as ketchup 
Pepper* Perennial climbing vine from India and Berry (black); berry Virtually all food products, table use 
black Indonesia with cortex 
white removed (white) 


Toppings for baked goods and 
confections 
Meats, sauces, gravies 


Coloring rice and other specialties 
Sausage, poultry, poultry stuffing 


Meats, stuffings, salads, sauces 
Baked goods and confections 


Sauces for shellfish and with fresh tomato 
Yellow color in pickles, sauces, and fish 





*See individual articles for further information. 











always logical, series of laws and regulations has been 
enacted in the United States and most other devel- 
oped countries to prevent the use of flavorings in a 
deceptive manner. The basis of the laws and regula- 
tions varies from country to country, causing prob- 
lems in international trade. The Codex Alimentarius 
represents an effort to standardize the composition 
and identification of many major foods and food in- 
gredients. It is jointly sponsored by the United Na- 


tions Food and Agriculture Organization (FAO) and 
the World Health Organization (WHO) with support 
from other groups such as the International Organi- 
zation of Flavor Industries TOFD. 

Laws and definitions differ widely from country to 
country. In the United States the addition of spices 
and natural and artificial flavors to food CGincluding 
beverages and chewing gum) is permitted by law 
and governed by regulations. These regulations are 


included in the Code of Federal Regulations. Natu- 
ral flavors are those which contain only natural fla- 
voring ingredients; artificial flavors are those which 
contain at least some artificial flavoring ingredi- 
ents. Natural flavoring ingredients include essential 
oils, oleoresins, essences or extractives, protein hy- 
drolysates, distillates or any products of roasting, 
heating or enzymolysis which contain the flavor- 
ing constituents derived from spices, fruits, or fruit 
juices, vegetables or vegetable juices, edible yeast, 
herbs, barks, buds, roots, leaves or similar plant ma- 
terials, meats, seafood, poultry, eggs, dairy products, 
or fermentation products whose significant function 
is flavoring rather than nutritive. An artificial flavor 
is any substance whose function is to impart flavor 
which is not derived as a defined natural flavor in- 
gredient. In addition, any flavor ingredient—natural 
or artificial—must either be generally recognized as 
safe (GRAS) or covered by a regulation of the United 
States Food and Drug Administration. Earl J. Merwin 
Bibliography. K. Bauer and D. Garbo (eds.), Com- 
mon Fragrance and Flavor Materials, 3d ed., 1997; 
D. B. Birch and M. G. Lindley (eds.), Developments in 
Food Flavours, 1987; H. B. Heath and G. A. Reinec- 
cius, Flavor Chemistry and Technology, 1986; J. K. 
Jank, Spices: Their Botanical Origin, Their Chemi- 
cal Composition, Their Commercial Use, Including 
Seeds, Herbs, and Leaves, 1980; D. B. Min (ed.), Fla- 
vor Chemistry of Lipid Foods, 1989; I. D. Morton 
and A. J. MacLeod (eds.), Food Flavours, pt. A: In- 
troduction, 1982, pt. B: The Flavour of Beverages, 
1986; R. Taranishi, R. G. Buttery, and E Shadidi, Fla- 
vor Chemistry: Trends and Developments, 1989. 


III} 
Spider silk 
Spider silks are fine fibers produced from fibrous- 
protein solutions for use by spiders in the various 
functions required for their survival. Spiders are the 
only animals that use silk for a variety of functions 
in their daily lives, including reproduction, food cap- 
ture, and construction. Silk fibers produced by silk- 
worms to build their cocoons have been used in 
the textile industry for more than 5000 years. But 
in recent years, spider silk, particularly silk from the 
major ampullate glands used to make elaborate “orb” 
webs, has attracted the attention of scientists and en- 
gineers in many fields from evolutionary biology to 
mechanical engineering and material science. The 
exceptional toughness, stiffness, and extensibility of 
spider-silk fibers outperform even the best synthetic 
materials. With the recent and expected advances 
in molecular genetics and proteomics, as well as 
in the understanding of material properties in the 
nanoscale size range, orb-web-weaving spiders are 
ideal organisms for studying the relationship among 
the physical properties of protein-based materials in 
terms of their function and evolution. See ARANEAE; 
FIBROUS PROTEIN; NATURAL FIBER; SILK. 

Mechanical properties. Over the past 400 million 
years, many different kinds of silk have evolved in 
the nearly 37,000 known spider species. For every 


kind of spider silk that has been characterized in any 
detail, there are more than 1000 that have not been 
characterized. Silk from only a few species, such as 
Nephila clavipes (the golden orb-weaving spider) 
and Araneus diadematus (common garden spider), 
have been examined in detail. Orb-weaving spi- 
ders produce silk for different functions from as 
many as seven different glands (see illustration and 
Table 1). Dragline and viscid silk fibers have been 
the focus of recent research because they are among 
the toughest materials known. Toughness is a mea- 
sure of a material’s energy absorption capacity. A 
single silk fiber is only 1-10 micrometers in diam- 
eter, compared to a human hair which typically is 
close to 50 xm. Dragline silk will support the spi- 
der’s weight and is used to make the radial threads in 
webs. Viscid silk is the sticky, spiral threads in webs 
used for capturing prey. Table 2 lists the characteris- 
tic mechanical properties of dragline and viscid silk, 
compared to other materials. Most of the other ma- 
terials are either very strong but not tough and ex- 
tensible (such as steel) or are highly extensible but 
not strong (such as rubber). The extraordinary char- 
acteristic of dragline silk is that it has an exceptional 
combination of high stiffness, strength, extensibil- 
ity, and toughness. The breaking stress or strength 
of dragline silk is comparable to steel, Kevlar® (used 
in bullet-proof clothing), and carbon fibers, but it is 
more extensible and tougher. Viscid silk is even more 
extensible, with stiffness comparable to rubber, but 
it is much stronger than rubber. The high stiffness 
of dragline and the rubberlike extensibility of viscid 
silk with the exceptional ability to absorb energy 
appear to be optimized for their required functions. 
Scientists seek to understand how spiders are able to 
spin silk fibers from an aqueous protein solution with 
the desired properties. Using spider silk as a guide, 
it should be possible to make the next-generation 
high-performance (very strong and yet very extensi- 
ble) materials. See STRENGTH OF MATERIALS; STRESS 
AND STRAIN; YOUNG’S MODULUS. 

Structure. All types of silk have the same basic 
building blocks—protein polymers made of amino 
acids. Although the different silk glands in an evo- 
lutionarily advanced orb-weaving spider, such as N. 
clavipes, possibly evolved from a single type of gland, 
silk glands now differ significantly from one another 
in morphology and composition, and produce dif- 
ferent kinds of silk. The best-characterized silk pro- 
tein and associated fiber is the dragline silk of N. 
clavipes. Dragline-silk fibers are composed primar- 
ily of two protein polymers: spidroin I and spidroin 
II. Both spidroin I and II consist of repeated alter- 
nating protein sequences rich in alanine (Ala) and 
glycine (Gly) amino acids. These proteins are stored 
in the glands as concentrated solutions (30-50 wt %). 
A combination of physicochemical and rheological 
processes (including ion exchange and elongational 
flow) cause the protein molecules to self-assemble 
and transform from an unaggregated and soluble 
state in the gland, through an intermediate liquid- 
crystalline state, into an insoluble fiber, while passing 
through the spinning canal. 


Spider silk 


269 


270 


Spider silk 







" aggregate cylindricale 












flagelliforme 4 


minor ampullate: 
auxiliary spiral 


minor 
ampullate 


piriforme 


major ampullate: 
structural and 
dragline silk 





cylindrical: 
outer eggsac 









cylindrical: 
capture core threads aggregate: 


aqueous coat 


aciniform: 4 
wrapping silk 


and packing silk 





eiD piriform: 
Se = attachment cement 


Silk fibers and silk glands in Araneus diadematus. (Reprinted from F. Vollrath, Strength and structure, Rev. Mol. Biotechnol., 


74(2):67-83, 2000) 


After spinning, the dry silk fibers consist of stiff 
nanometer-scale crystals embedded in a softer pro- 
tein matrix. The nanocrystals are formed by the ag- 
gregation of hydrophobic polyalanine regions into a 
highly ordered crystalline 6-sheet via self-assembly, 
while the glycine-rich protein regions form the softer 
rubbery matrix. This combination of hard crystal- 
lites embedded in a soft matrix is considered the pri- 
mary reason for the exceptional mechanical strength 
and extensibility of dragline fibers. It is still not clear 
how spiders and silkworms prevent and regulate the 
formation of -sheet aggregates during the spinning 
process, but understanding of the mechanism is of 
great relevance for protein folding. An understanding 
of this mechanism could help in developing thera- 
pies for protein-folding diseases, such as Alzheimer’s, 
which result from the uncontrolled aggregation 
of proteins into f-sheet aggregates known as 
amyloid plaques. See AMINO ACIDS; POLYMER; PRO- 
TEIN. 

Mimicking spider silk. Although spidroin I and 
spidroin II have been sequenced, a complete under- 


standing of the various physicochemical processes 
during the spinning of silk fibers is still lacking. Be- 
fore we can completely mimic spider silk, we must 
fully understand the biochemistry, processing strat- 
egy, and molecular basis of the mechanical prop- 
erties of silk. Advances are being made on differ- 
ent fronts to emulate different aspects of spider 
silk. Three main approaches are being pursued to 
make materials with mechanical properties compa- 
rable to dragline silk. The first approach is to use re- 
combinant DNA methods to produce proteins with 
structure similar to spidroin I and II in other organ- 
isms, including potato, tobacco, Escherichia coli, 
and goat’s milk. The second approach is to synthesize 
high-molecular-weight block copolymers that mimic 
the molecular structure of silk proteins. The third 
approach is to develop strong, yet extensible syn- 
thetic composites by reinforcing elastomeric poly- 
mers with inorganic nanoparticles. To date, these 
approaches have had limited success. One resulting 
application is the spinning of fibers from recombi- 
nant silk proteins. 





TABLE 1. Types of common spider silk and their function 











Silk Gland Function 
Dragline Major ampullate Radial threads of web frame safety line 
Viscid Flagelliform Prey capture 
Gluelike Aggregate Prey capture, attachment 
Minor ampullate Minor ampullate Orb-web frame construction 
Cocoon Cylindriform Reproduction 
Wrapping Aciniform Wrapping captured prey 
Attachment Pyriform Attachment to environmental substrates 











TABLE 2. Tensile mechanical properties of spider silk and other materials 





Stiffness or Young’s 


Strength or breaking 


Toughness or maximum 
energy absorption, 
MJ/m? (Btu/in) 


Extensibility or 
maximum strain, % 





Material* modulus, GPa (kilo psi) stress, GPa (kilo psi) 
AD dragline silk 10 (1450) 1.1 (159) 
AD viscid silk 0.003 (0.43) 0.5 (72) 
NC dragline silk 22 (3190) 1.1 (160) 
NE dragline 8 (1160) 1.1 (160) 
Silkworm silk 7 (1015) 0.6 (87) 
Tendon, collagen 1.5 (217) 0.15 (22) 
Bone 20 (2900) 0.16 (23) 
Elastin 0.001 (0.145) 0.002 (0.29) 
Synthetic rubber 0.001 (0.145) 0.05 (7.25) 
Nylon fiber 5 (725) 0.95 (137) 
Kevlar 49 130 (18850) 3.6 (622) 
Carbon fiber 300 (43500) 4 (680) 
High tensile steel 200 (29000) ats (2174) 


27 160 (2.48) 
270 150 (2.32) 
9 150 (2.32) 
28 165 (2.56) 
18 70 (1.08) 
12 7.5 (0.12) 
3 4 (0.06) 
150 2 (0.03) 
850 100 (1.55) 
18 80 (1.24) 
2.7 50 (0.77) 
1.3 25 (0.39) 
0.8 6 (0.09) 





*AD: Araneus diadematus; NC: Nephila clavipes; NE: Nephila edulis. 











In addition to the material aspects, the silk- 
spinning process is far superior and more so- 
phisticated than industrial spinning processes for 
making polymeric fibers. Conventional industrial 
processes, such as wet spinning or dry spinning, 
operate only under very controlled conditions dur- 
ing which the polymer filaments are rapidly solid- 
ified or “quenched” to provide sufficient mechan- 
ical strength to be drawn into fibers. Spiders are 
able to spin silk fibers under widely different am- 
bient conditions, and the silk filaments rarely break 
during spinning. Even in the wet state during spin- 
ning, the silk filaments are strong enough to sup- 
port the weight of the spider. Microscopic analysis 
shows that the fibers have radially graded mechanical 
properties, with aligned, stiffer molecules surround- 
ing a softer elastic core. Spiders also produce dif- 
ferent silk fibers using the same apparatus through 
highly regulated protein self-assembly. A complete 
understanding of spiders’ spinning processes would 
help in developing new technologies that produce 
nanometer- and micrometer-scale fibers from self- 
assembling polymers and nanocomposite materials. 
See ELASTICITY; MANUFACTURED FIBER; NANOSTRUC- 
TURE. Nitin Kumar; Gareth H. McKinley 

Bibliography. C. L. Craig, Spiderwebs and Silk: Trac- 
ing Evolution from Molecules to Gene to Pheno- 
types, Oxford University Press, New York, 2003; 
J. Gosline et al., The mechanical design of spider 
silks: From fibroin sequence to mechanical func- 
tion, J. Exp. Biol., 202:3295-3303, 1999; D. Kaplan 
etal., Silk Polymers: Material Sciences and Biotech- 
nology, American Chemical Society, Washington 
DC, 1994; F Vollrath, Strength and structure of spi- 
ders’ silks, Rev. Mol. Biotechnol., 74:67-83, 2000; F 
Vollrath and D. P. Knight, Liquid crystalline spinning 
of spider silk, Nature, 410:541-548, 2001. 


ESE) 
Spilite 

An aphanitic (microscopically crystalline) to very 
fine-grained igneous rock, with more or less altered 
appearance, resembling basalt but composed of al- 


bite or oligoclase, chlorite, epidote, calcite, and acti- 
nolite. 

In spite of the highly sodic plagioclase, spilites are 
generally classed with basalts because of the low sil- 
ica content (about 50%). Spilites also retain many 
textural and structural features that are characteris- 
tic of basalt. 

Spilites are found most frequently as lava flows 
and more rarely as small dikes and sills. Spilitic 
lavas typically show pillow structure, in which the 
rock appears composed of closely packed, elon- 
gated, pillow-shaped masses up to a few feet across. 
Pillows are typical of subaqueous lava flows. Vesi- 
cles, commonly filled with various minerals, may 
give the rock an amygdaloidal structure. See SYN- 
CLINE. 

Once thought by some petrologists to be of 
uniquely igneous origin, spilites are now recog- 
nized as metamorphosed basalts. The metamor- 
phism which takes place consists of the reaction 
of primary Ca-rich plagioclase to produce the as- 
semblage of albite and one or more calcic miner- 
als such as epidote, calcite, prehnite, and pumpel- 
lyite. Primary pyroxene and olivine in the basalt 
convert typically to chlorite, actinolite, and serpen- 
tine. In many cases, except for the filling of vesi- 
cles, the texture of the original basalt is essentially 
preserved. Some basalt flows have endured spilitiza- 
tion of their permeable, vesicular tops, yet remain 
unaltered in their denser interior. The conversion 
of basalt to spilite is a hydrothermal metamorphism 
that results from the action of hot briny water which 
permeates the rock, taking advantage of the pil- 
low or vesicular structures to gain access to the 
primary minerals. In places where former oceanic 
crust is now exposed on land, spilite and kerato- 
phyre (spilitized andesites) pillow lavas are seen re- 
peatedly. This indicates that low-grade hydrothermal 
metamorphism is common in the oceanic crust. See 
BASALT; IGNEOUS ROCKS; METAMORPHISM; METASO- 
MATISM; PETROLOGY. William Ingersoll Rose, Jr. 

Bibliography. G. C. Amstutz et al. (eds.), Spilites and 
Spilitic Rocks, 1974; M. G. Best, Igneous and Meta- 
morphic Petrology, 2d ed., 2002. 


Spilite 


271 


272 


Spin (quantum mechanics) 


= 
Spin (quantum mechanics) 


The intrinsic angular momentum of a particle. It is 
that part of the angular momentum of a particle 
which exists even when the particle is at rest, as dis- 
tinguished from the orbital angular momentum. The 
total angular momentum of a particle is the sum of 
its spin and its orbital angular momentum resulting 
from its translational motion. The general properties 
of angular momentum in quantum mechanics imply 
that spin is quantized in half integral multiples of A 
(h/2x , where Þ is Planck’s constant); orbital angular 
momentum is restricted to half even integral multi- 
ples of h. A particle is said to have spin 7/., meaning 
that its spin angular momentum is 3/,h. See ANGULAR 
MOMENTUM. 

A nucleus, atom, or molecule in a particular en- 
ergy level, or a particular elementary particle, has a 
definite spin; for instance, a deuteron has spin 1, a 
Lit" nucleus in its ground state has spin 3/., and an 
electron has spin 1/2. The spin is an intrinsic or inter- 
nal characteristic of a particle, along with its mass, 
charge, and isotopic spin. See I-SPIN; SYMMETRY LAWS 
(PHYSICS). 

A particle of spin s has 2s + 1 spin states, since 
according to quantum mechanics the projection of 
an angular momentum of magnitude j along an axis 
can have the 27 + 1 integrally spaced values j, 7 — 1 
+ —j7 + 1, —j. These spin states represent an inter- 
nal degree of freedom of a particle, in addition to 
its external freedom of motion in three-dimensional 
space. 

In field theory, in which particles are regarded 
as quanta of a field, the spin of the particle is de- 
termined by the tensor character of the field. For 
instance, the quanta of a scalar field have spin 0 
and the quanta of a vector field have spin 1. A cel- 
ebrated theorem of quantum field theory, proved 
first by W. Pauli, states a connection between spin 
and statistics: A particle with half even integral spin 
obeys Bose-Einstein statistics and is called a boson; 
a particle with half odd integral spin obeys Fermi- 
Dirac statistics and is called a fermion. See ELECTRON 
SPIN; QUANTUM FIELD THEORY; QUANTUM MECHAN- 
ICS; QUANTUM STATISTICS. Charles J. Goebel 


E 
Spin-density wave 

The ground state of a metal in which the conduction- 
electron-spin density has a sinusoidal variation in 
space, with a wavelength usually incommensurate 
with the crystal structure. This antiferromagnetic 
state normally occurs in metals, alloys, and com- 
pounds with a transition-metal component. It occurs 
also, however, in quasi-one-dimensional organic con- 
ductors. See ANTIFERROMAGNETISM; CRYSTAL STRUC- 
TURE; ELECTRON SPIN. 

In a metal, the conduction-electron charge densi- 
ties for up and down spin are ordinarily equal. Their 
sum, po), exhibits a dependence on r having the 
same spatial periodicity as that of the lattice. A metal 
with a spin-density wave has, instead, charge den- 


sities given by Eq. (1). The spin density o @) is the 


P0 = 2P + pcosQ- r] a) 





difference between p,(r) and p-r), and is therefore 
given by Eq. (2), 


o(t) = Eppo(t) cosQ - r (2) 


where ê is a unit vector defining the axis of spin 
quantization, and p is the amplitude of the spin- 
density wave. The wave vector Q is determined by 
the conduction-electron Fermi surface. The wave- 
length, à = 27/Q, of the spin-density wave is thus 
in general not commensurate with the lattice period 
a. The ratio A/a varies with temperature, pressure, 
alloy composition, and other parameters, because of 
changes in the electronic structure or in the popula- 
tion of electron states as a function of these parame- 
ters. See FERMI SURFACE. 

Origin. The Pauli exclusion principle keeps elec- 
trons with parallel spin apart. The resulting re- 
duction in Coulomb interaction is called exchange 
energy. This reduction is increased when the 
conduction-electron charge densities are modulated 
according to Eq. (1). However, modulation requires 
an increase in conduction-electron kinetic energy. 
According to the spin-density-wave instability theo- 
rem, energy can always be further reduced if the 
quantum theory is treated in an approximation that 
neglects electron-electron scattering. Were it not for 
these latter correlations, every metal would have one 
or more spin-density waves. See EXCHANGE INTERAC- 
TION; EXCLUSION PRINCIPLE. 

The correlation energy, which contributes to 
a metal’s stability, arises primarily from scatter- 
ing of up-spin electrons by down-spin electrons. 
Since a spin-density wave tends to stratify opposite 
spin densities in alternating layers, as indicated in 
Eq. (1), the scattering is reduced, thereby tending to 
suppress the spin-density-wave instability. This con- 
trasts with a charge-density wave, where shifts of 
exchange and correlation energy act in unison. Con- 
sequently, spin-density-wave states are less likely to 
occur than charge-density-wave states. See CHARGE- 
DENSITY WAVE. 

Detection. Since the total electronic charge den- 
sity, the sum of p, and p_ in Eq. (1), is equal to po(r), 
a spin-density wave cannot be detected in an x-ray 
diffraction experiment. It leads, however, to a sinu- 
soidal magnetic field, from which neutrons, having a 
magnetic moment, can experience Bragg diffraction. 
At incommensurate points in wave-vector space, two 
magnetic satellites are observed on opposite sides of 
each crystallographic Bragg reflection. 

The magnetic origin of spin-density-wave satellites 
can be demonstrated in two ways. The first is their 
absence in an x-ray experiment. The second is by 
means of neutron diffraction with a spin-polarized 
beam, since only a magnetic reflection can cause re- 
versal of neutron polarization during diffraction. See 
NEUTRON DIFFRACTION; X-RAY DIFFRACTION. 

Occurrence. There are well over 100 materi- 
als which, over a temperature range, support a 


a 


Cc 
E 


SS FS 


4 AAAA 


VU 











99050A 


l I | T | T | | t i 
\ | \ \ | \ | \ i 
VW \ Vly ALY \ 


Oo >—0 >—_0>— <0 <0 —<—0 
y 
\ 
\ 
\ 











LSW TSW FAN 


Types of spin-density wave observed in metals: SS = spiral (helical); FS = ferromagnetic (cone); SS= skewed; LSW = 
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spin-density wave. These include some of the rare- 
earth elements of the lanthanide series and the 3d 
transition metals, manganese and chromium, the lat- 
ter being the prototypical itinerant electron antifer- 
romagnet. Many types of spin-density wave occur, 
since in general not only may the vector ê in Eq. (2) 
be parallel or perpendicular to the wave vector Q, 
corresponding to a longitudinal or transverse spin- 
density wave, respectively, but in the latter case ê 
may rotate about Q, corresponding to a helical spin- 
density wave. A uniform ferromagnetic magnetiza- 
tion may be superposed on the spin-density wave 
(see illus.). 

The occurrence of inelastic neutron-scattering 
peaks at incommensurate points indicates the exis- 
tence of spin-density-wave fluctuations in some met- 
als thought to be nonmagnetic (for example, copper 
and yttrium) when doped with magnetic impurities 
(manganese and gadolinium, respectively). This be- 
havior suggests that the spin-density-wave instability 
may be common, even in nontransition metals. 

Spin-density-wave fluctuations have been pro- 
posed as an essential feature for some forms of 
high-temperature superconductivity. In one class of 
quasi-one-dimensional organic conductors, the Bech- 
gaard’s salts, the spin-density-wave states give way to 
a superconducting state under pressure. In another 
case, (TMTSF),ClO,4 (TMTSF = tetramethyltetrasele- 
nafulvalene) is superconducting at low temperatures 
and at ambient pressure but develops a spin-density 
wave in a magnetic field. See ORGANIC CONDUCTOR; 
RARE-EARTH ELEMENTS; SUPERCONDUCTIVITY; TRAN- 
SITION ELEMENTS. Eric Fawcett 


Bibliography. E. Fawcett, Spin-density-wave antifer- 
romagnetism in chromium, Rev. Mod. Phys., 60:209- 
283, 1988; E. Fawcett, Spin-density waves and SDW 
fluctuations, Sov. J. Low Temp. Phys., 15:441-444, 
1989; Yu. A. Izyumov, Modulated, or long-periodic, 
magnetic structures of crystals, Sov. Phys. Usp., 
27:845-867, 1984; D. Jérome and H. J. Schulz, Or- 
ganic conductors and superconductors, Adv. Phys., 
31:299-490, 1982; N. I. Kulikov and V. V. Tugushev, 
Spin-density waves and itinerant antiferromagnetism 
in metals, Sov. Phys. Usp., 27:954-976, 1984; A. W. 
Overhauser, Spin-density waves in an electron gas, 
Phys. Rev., 128:1437-1452, 1962. 


C 
Spin glass 

One of a wide variety of materials that contain inter- 
acting atomic magnetic moments and also possess 
some form of disorder, in which the temperature 
variation of the magnetic susceptibility undergoes 
an abrupt change in slope, that is, a cusp, at a tem- 
perature generally referred to as the freezing temper- 
ature, 7; At temperatures below 7; the spins have no 
long-range magnetic order, but instead are found to 
have static or quasistatic orientations which vary ran- 
domly over macroscopic distances. The latter state 
is referred to as spin-glass magnetic order. Below T; 
these systems also exhibit magnetic hysteresis and 
slow relaxation phenomena. Analogous effects are 
observed in measurements of torque in an applied 
magnetic field and in electron paramagnetic reso- 
nance. 
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Spin-glass ordering is usually detected by means 
of magnetic susceptibility measurements, although 
additional data are required to demonstrate the ab- 
sence of long-range order. Closely related suscep- 
tibility cusps can also be observed by using neu- 
tron diffraction. It is not generally agreed whether 
spin glasses undergo a phase transition or not. See 
MAGNETIC SUSCEPTIBILITY; NEUTRON DIFFRACTION; 
PHASE TRANSITIONS. 

Spin-glass transition. An example of a spin-glass 
ordering transition is shown in Fig. 1, where mag- 
netic susceptibility x is plotted as a function of ab- 
solute temperature T for the case of dilute alloys 
of iron in gold. The iron solute atoms carry a mag- 
netic moment and occupy sites at random in the gold 
lattice. Neighboring moments interact with one an- 
other via the Ruderman-Kittel-Kasuya-Yosida (RKKY) 
exchange interaction U, given byEq. (1), which is 


ij 


mediated by the conduction electrons of the gold. 
The exchange constant Jy oscillates rapidly in sign 
with increasing distance between spins $; and $j. 
Anisotropic interactions analogous to the RKKY ex- 
change as well as dipolar and crystal field inter- 
actions are also believed to have important conse- 
quences for spin-glass properties. The sharp suscep- 
tibility peak in Fig. 1 occurs at a temperature 7; such 
that kg; has the value of a typical exchange con- 
stant J;, where kz is Boltzmann’s constant. In this 
case the necessary disorder arises from the random 
placement of the magnetic moments. The resulting 
configuration of exchange couplings is such that at 
low temperatures the minimum energy state is one 
where the moment orientations are only correlated 
locally and are effectively random over macroscopic 
distances. 

As seen in Fig. 1, the sharpness of the transition 
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Fig. 1. Magnetic susceptibility versus absolute 
temperature for two different concentrations of iron alloyed 
with gold. The data curves indicate measurements in 
several applied field strengths as shown. 1 Oe = 79.6 units. 
Magnetic susceptibility in SI units = magnetic susceptibility 
in emu multiplied by 47 = 12.57. (After V. Cannella and J. A. 
Mydosh, Magnetic ordering in gold-iron alloys, Phys. Rev., 
B6:4220-4237, 1972) 


is destroyed by the application of a relatively small 
magnetic field, Hp) ~ 100 oersteds (1 Oe = 79.6 
ampere-turns/m). This is evidence that the transition 
is a cooperative effect involving a large number of 
moments, since the motion of an isolated moment 
would be largely unaffected by a field of 100 Oe at 
this temperature. 

Some of the basic elements of the spin-glass tran- 
sition are reproduced in a mean-field model calcula- 
tion. In this model one calculates the thermodynamic 
behavior of a system of classical spins which interact 
via exchange couplings having the form given by Eq. 
(1) with, however, a distribution of J; values which 
is taken to be a gaussian random function. Below the 
temperature T, the system is found to be ordered 
in the sense that each spin $; will retain some mem- 
ory of its orientation SD at time ¢, at a much later 
time f, even though OD) = 0, where ( ) denotes 
an average over all spins. The ordering effect is ex- 
pressed by the Edwards-Anderson order parameter 
q= (Si) G SiD). Their theory gives Eq. (2) for the 
magnetic susceptibility x. Here x, = a/Tis the Curie 


X=xXA-@D (2) 


susceptibility, where a is the Curie constant. For 
T> T;, q = 0; and for T < T;, q='/2 (,/T)?—1]. Thus 
XCD) is just the Curie susceptibility for T > T, and for 
T < Tris given by Eq. @) for T;— T < T, where bisa 
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constant. Equation (2) therefore exhibits a cusp sim- 
ilar to that shown in Fig. 1. The Edwards-Anderson 
model also predicts a singularity in the specific heat 
at T;. However, this is not observed experimentally. 
Spin-glass ordering. Spin-glass ordering has been 
found to occur in a wide variety of disordered mag- 
netic materials, exhibiting several distinct types of 
disorder. The following list is intended to be illus- 
trative but not exhaustive. The dilute alloy Au:Fe 
cited above is disordered as a consequence of ran- 
dom placement of iron atoms in the gold lattice. For 
similar reasons other dilute alloys with noble-metal 
hosts (for example, Cu:Mn, Ag:Mn, and Au:Mn) are 
also spin glasses. Some of these systems show spin- 
glass behavior over a wide range of composition, lim- 
ited only by the Kondo effect and possible ferromag- 
netic behavior at low and high concentrations of the 
magnetic ion, respectively. At low concentrations the 
exchange interactions of these alloys are well charac- 
terized as to their strength and distribution. Another 
type of metallic spin glass is one which contains a 
relatively high concentration of magnetic ions and is 
structurally amorphous. Examples of this are GdAl, 
and (Fe,Mn,_»)o075(P16BeAls)o.25. In the former case 
the gadolinium ions are the magnetic constituent. 
Special techniques are required to prepare these ma- 
terials in a structurally amorphous state. The second 
example cited is from a family of materials known as 
metglass and is a reentrant spin glass, as discussed 
below. See AMORPHOUS SOLID; KONDO EFFECT. 
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Fig. 2. Magnetic susceptibility versus absolute temperature 
for the compound (Pdbo,.9965F€o.0035)1_xMn, for three values of 
the manganese atomic fraction x. The scale for the curve 
marked x = 0.065 has been magnified four times. (After G. J. 
Nieuwenhys, B. H. Verbeek, and J. A. Mydosh, Towards a 


uniform magnetic phase diagram for magnetic alloys with 
mixed types of order, J. Appl. Phys., 50:1685-1690, 1979) 


There are also a number of interesting cases of 
insulating spin glasses. The best-characterized and 
most thoroughly studied insulating spin glass is the 
system Eu,Sr,_,S. This compound is a ferromagnet 
for x = 1 as a consequence of the ferromagnetic 
nearest neighbor exchange interaction among the 
europium, while the original crystalline structure is 
retained. At sufficient dilution (wv < 0.5) the system 
becomes a spin glass through the agency of antifer- 
romagnetic second-neighbor exchange couplings. At 
higher europium concentrations this system also be- 
comes reentrant. 

Ferromagnetic to spin-glass transition. A number of 
compounds are found to form ferromagnetic order 
at some Curie temperature 7,, then exhibit a col- 
lapse of that order as the temperature is lowered 
below T,. Such a system is termed a reentrant spin 
glass, because the spin-glass phase appears below a 
temperature where the magnetically ordered higher 
temperature phase collapses. An example of this be- 
havior is shown in Fig. 2, where magnetic suscepti- 
bility versus absolute temperature is plotted for three 
concentrations of manganese (Mn) in the compound 
(Pdo,996sF€o,0035)1—-»Mn,.. For x = 0.01 the compound 
is a ferromagnet, with x saturated for all tempera- 
tures below 13 K. For x = 0.05 the compound be- 
comes ferromagnetic at T ~ 9 K, but the ferromag- 
netism collapses into spin-glass behavior for T < 3 
K. This is the reentrant-spin-glass case. For higher 
manganese concentrations (for example, x = 0.065) 
the ferromagnetism disappears, and the material is 
a simple spin glass as in Fig. 1. Whether the spin- 
glass transition of the ferromagnetic to spin-glass 
transition is a true phase transition is a matter of 
some controversy. Reentrant-spin-glass behavior has 
been observed in a variety of other systems as well, 
including Eu,Sr;_,S, Au,_,Fe,, Fe,Cr,;_,, and amor- 
phous (Fe;_,Mn,)o 75(P16BoAls)o.25. The first of these 
is a nonconductor, and the others are magnetic al- 
loys. See FERROMAGNETISM. R. E. Walstedt 


= 
Spin label 


A molecule which contains an unpaired electron spin 
which can be detected with electron spin resonance 
(ESR) spectroscopy. Molecules are labeled when an 
atom or group of atoms which exhibit some unique 
physical property is chemically bonded to a molecule 
of interest. Groups containing unpaired electrons in- 
clude organic free radicals and a variety of types 
of transition-metal complexes (such as vanadium, 
copper, iron, and manganese). Molecules with un- 
paired electron spins are readily detected with elec- 
tron spin resonance spectroscopy. Through analysis 
of ESR spectra, rates of molecular motion whose mo- 
tion is restrained by surrounding molecules can be 
determined. Measurement of rates of molecular mo- 
tion and molecular orientation has proved to be very 
important in the study of a variety of types of biolog- 
ically important problems. 

Electron spin resonance. This is a spectroscopic 
technique which detects transitions between elec- 
tron spin energy levels. When a molecule possess- 
ing an unpaired electron spin is placed in an exter- 
nal magnetic field, the external field interacts with 
the magnetic moment of the unpaired electron spin, 
producing two distinct energy states. In one state the 
unpaired electron is oriented with its magnetic mo- 
ment along the direction of the external field, while 
the other state has the moment of the unpaired elec- 
tron oriented against the external field direction. The 
separation of these two energy states is about 3 cm7! 
in an external field of 3000 gauss (0.3 tesla), and 
an absorption of radiation can be observed when 
a sample is irradiated with microwave radiation of 
about 3 cm wavelength. This absorption is observed 
as a single line in an electron spin resonance spec- 
trum. The wavelength at which absorption of en- 
ergy is observed depends on the magnitude of the 
external field and on a molecular property called the 
g value. 

Crystals. If a crystalline sample is used, different g 
values may be observed along the various axes of the 
crystal. When directional dependence is observed 
the g value is anisotropic, with principal values along 
the x, y, and z axes of the crystal denoted by gx, Sy) 
and gzz. If 8x. = yy, the sample has an axial sym- 
metric g tensor, and the g values are denoted as gy 
and g,. The unpaired electron spin is often delocal- 
ized over two or more atoms in the spin label. If the 
nuclei have nuclear spin other than zero, the mag- 
netic moment of the nuclei can interact with the 
magnetic moment of the electron spin and split the 
electron spin energy levels. When this is the case, 
the ESR spectrum shows a series of lines that de- 
pend on the number and type of nuclear spins in the 
molecule. The interaction between the electron and 
nuclear spins is called a hyperfine interaction, and 
the separation of the energy levels and ESR lines is 
called the hyperfine splitting. The magnitude of the 
interaction between nuclear and electron spins may 
depend on the orientation of the molecule with re- 
spect to the external field. The components of the 
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electron-nuclei anisotropic splitting are denoted by 
Axx, Ayy, and Azz, Or A, and Aj when the molecule 
has axial symmetry. 

Solutions. When a spin-labeled molecule is free 
to rotate in solution, the anisotropic g value and 
anisotropic hyperfine interaction are averaged by 
rapid reorientation of the label, and one observes 
the isotropic (angle-independent) g value and hyper- 
fine splitting in the ESR spectrum. As rotation of the 
molecule is slowed, the anisotropic interactions start 
to contribute, and the ESR spectrum is changed. A 
continuous change in the ESR spectrum is observed 
as the sample goes from the case of free rotation 
to the case of a rigid sample where motion is very 
slow. Analysis of ESR spectra permits determination 
of motional correlation times from about 107! to 
10-3 s. In many cases, determination of the direc- 
tion of motion of the labeled group (that is, rotation 
in all directions or preferred rotation around a single 
axis) is also possible. 

Biological studies. Analysis of the rate and type of 
motion of a spin label is important for a wide vari- 
ety of biological problems. The type of label used in 
these studies is generally a nitroxide free radical. The 
unpaired electron spin in a nitroxide radical is delo- 
calized over an N-O group, and an electron-nuclei 
hyperfine splitting is observed from the spin = 1 ni- 
trogen nucleus. The oxygen atom has a spin of zero. 
In solution an isotropic nitrogen hyperfine splitting 
of about 15 gauss (1.5 millitesla) is observed, while 
in a crystalline solid anisotropic splitting is observed 
with Az, = Aj = 32 gauss (3.2 millitesla) and Axx = 
A, = 6 gauss (0.6 mT). The z direction is defined by 
the 2p orbital of the nitrogen atom. 

Enzymes. Nitroxide spin labels have been bonded 
to a number of enzyme substrates with connecting 
chains of varying length between the label and the 
substrate. The labeled substrate is allowed to form a 
complex with the enzyme, and the ESR spectrum is 
monitored. The receptor site on the enzyme is often 
in a type of well within the protein, within which the 
substrate must fit. If the nitroxide label is connected 
to the substrate by a short chain, it lies within the 
well and rotational motion is inhibited. When a suf- 
ficiently long connecting chain is used, the nitrox- 
ide lies outside the protein and rotational motion is 
relatively free. Changes in motion of the label can 
be monitored with ESR, and the transition from hin- 
dered to free rotation as a function of the length of 
the connecting chain can be determined. The dis- 
tance from the substrate to the label can be calcu- 
lated from molecular models for labeled molecules 
with different connecting chain lengths. If the dis- 
tance at which free rotation starts is determined, the 
depth of the receptor site well of the enzyme can 
be calculated. Spin-labeling studies of this type pro- 
vide a powerful technique for the study of the ge- 
ometry and dimensions of receptors in enzymes. See 
ENZYME. 

Membranes. Spin labels have been used extensively 
to study the structure of membranes. Biological 
membranes consist mainly of a phospholipid bi- 
layer with embedded proteins. Other molecules 


(for example, cholesterol) may also be found in the 
lipid bilayer, and a variety of types of molecules may 
be bound to the surface. The phospholipids have 
polar head groups (hydrophilic) which are on the 
outside of the membrane, and nonpolar aliphatic car- 
bon chains which are on the inside of the bilayer. 
The lipids form ordered structures which are read- 
ily oriented in an external magnetic field. Spin la- 
bels have been bonded to the carbon atoms along 
the aliphatic chains of the lipids to study internal 
motion in the membrane bilayer. The ESR spectra 
of spin labels bound to lipids which have been ori- 
ented in an external field show that the lipid can 
rotate around its long axis, but that head-to-tail rota- 
tion is very slow. The relative motional correlation 
times of labels bound at different positions along the 
chain can be determined from ESR spectra. These 
studies show that motion is freer on the inside of 
the lipid away from the polar head group, indicating 
greater fluidity on the inside. Membrane structures 
can be changed by a number of kinds of perturba- 
tions, and spin labels have been used to monitor 
the effect of the perturbation. The structure of the 
phospholipid bilayer changes with temperature, and 
spin-labeling studies have been conducted to deter- 
mine phase transition temperatures. Molecules such 
as cholesterol or ethanol change the fluidity of the 
membrane, and spin-labeling studies have been con- 
ducted to monitor the effect of chemical perturba- 
tions on the fluidity of membranes. See CELL MEM- 
BRANES. 

When nitroxide labels are relatively close to one 
another, the odd electron spins in the nitroxide 
groups may be interchanged between molecules. 
This spin exchange reaction dramatically changes 
the appearance of the ESR spectrum of the labels, 
since the energy states are averaged by this inter- 
action. This interaction may be used to study the 
rate of diffusion of labeled molecules from an area 
of high concentration (strong exchange) to an area 
of low concentration (weak exchange). This type of 
investigation has been conducted to determine the 
rate of lateral diffusion of lipids in membranes. A 
high concentration of labeled lipid is initially intro- 
duced into a membrane, and an ESR spectrum from 
strongly exchange-coupled spin labels is observed. 
As the labeled molecules diffuse apart, the ESR spec- 
trum changes to that characteristic of labels with no 
exchange coupling. By monitoring the change in the 
ESR spectrum of the label as a function of time, it is 
possible to determine how fast lipids diffuse through 
membranes. Spin-labeling studies of these types have 
provided important information about the structure, 
organization, and rates of motion in membranes. 

Other applications. Spin labels have also been used 
to study the structure and organization of synthetic 
polymers and to study phase transitions. Investiga- 
tions using deuterium substitution for hydrogen with 
nuclear magnetic resonance detection have been 
conducted to study problems similar to those inves- 
tigated with nitroxide labels. Spin-labeling studies 
allow investigation of a variety of types of prob- 
lems which cannot readily be studied with other 


techniques. See ELECTRON PARAMAGNETIC RESO- 
NANCE (EPR) SPECTROSCOPY; FREE RADICAL; TRIPLET 
STATE. Robert Kreilik 

Bibliography. L. J. Berliner, Biological Magnetic 
Resonance, vol. 14: Spin Labeling, 1998; L. J. 
Berliner (ed.), Spin Labeling: Theory and Applica- 
tions, vols. 1 and 2, 1976, 1979; C. P. Poole, Jr., Elec- 
tron Spin Resonance, 2d ed., 1997. 


ae 
Spinach 

A cool-season annual of Asiatic origin, Spinacia oler- 
acea, belonging to the plant order Caryophyllales. It 
is grown for its foliage (Fig. 1) and served as a cooked 
vegetable or as a salad. New Zealand spinach (Tet- 
rogonia expansa) and Mountain spinach (Atriplex 
hortense) are also called spinach but are less com- 
monly grown. Spinach plants are usually dioecious. 
See CARYOPHYLLALES. 

Cultivation and harvesting. Propagation is by seed, 
commonly planted in rows 10-20 in. (25-50 cm) 
apart. Cool weather favors maximum production. 
High temperatures and long daylight periods en- 
courage seedstalk formation and reduce vegetative 
growth. Fall-seeded spinach is overwintered and har- 
vested in the spring in areas where the weather is 
mild or the crop is protected by snow cover. See 
PHOTOPERIODISM; SEED. 

Varieties (cultivars) are classified according to seed 
(smooth or prickly), leaves (smooth or savoyed), 
and seeding tendencies (early or late, or long stand- 
ing). Popular varieties are Long Standing Bloomsdale, 
Early Hybrid 7, Hybrid 424, Savoy Hybrid 621, and 
Viroflay. 

Harvesting by hand or machine begins when the 
plants have reached full size and before seedstalks 
form, usually 40-50 days after planting. 

H. John Carew 

Disease. Diseases may reduce yield and quality, 
and be limiting factors in the production of spinach 
for processing and fresh market. Some of the most im- 
portant diseases of spinach are Fusarium wilt, caused 
by Fusarium oxysporum f. sp. spinaciae, white 
rust, caused by Albugo occidentalis; downy mildew, 
caused by Peronospora spinaciae; blights, mosaics, 
and stunts caused by viruses and yellows and decline 
(Fig. 2). Several factors may cause a yellows and de- 





Fig. 1. Broad, mature leaves of spinach plants in the field. 
(Editorial Photocolor Archives) 





Fig. 2. Four seedlings with root and hypocotyl decay (left) 
and three healthy seedlings (right). 


cline syndrome; these include soil-borne pathogenic 
fungi and nematodes, nutrient imbalances, excess 
soil moisture, abrupt temperature changes, and pro- 
longed periods of cloudy, wet weather. 

Crop rotations, with spinach being planted only 
once every 3 years, and the use of disease-free seed 
are the most effective control measures for white 
rust, downy mildew, and other leaf spot diseases 
that survive in seed, plant debris, and soil. The wilt 
pathogen will survive in the soil indefinitely, so crop 
rotation is not an effective method of control. 

Some varieties are resistant to downy mildew and 
blight, but market type and quality are not always ac- 
ceptable. Fungicides can be used to control downy 
mildew and white rust, but fungicides are not avail- 
able to control other foliage diseases of less eco- 
nomic importance, such as Cercospora leaf spot and 
anthracnose. To help reduce the spread of epidemics 
of foliage diseases, spring spinach should not be 
planted in the vicinity of winter spinach. 

Viral diseases may be produced by destroying vol- 
unteer spinach plants and weeds that serve as alter- 
native hosts. Controlling insect vectors may slow the 
spread of viruses within a field and between fields. 
Early spring crops can be grown before blight be- 
comes prevalent in the northern United States, but 
the practice is ineffective in southern Texas, where 
vectors may be blown in by northerly winds. See 
PLANT PATHOLOGY. Donald R. Sumner 

Bibliography. A. E Sherf and A. A. MacNab, Veg- 
etable Diseases and Their Control, 2d ed., 1986. 
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Spinal cord 


The portion of the central nervous system within 
the spinal canal of the vertebral column, that is, the 
entire central nervous system except the brain. The 
spinal cord extends from the foramen magnum at 
the base of the skull to a variable level of the spinal 
canal; it terminates at the lumbar level in humans 
and extends well into the caudal region in fishes. 
Paired spinal nerves enter the spinal canal between 
each pair of vertebrae and connect with the spinal 
cord. The number of spinal nerves varies widely in 
vertebrates; in humans there are 31 pairs (8 cervical, 
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12 thoracic, 5 lumbar, 5 sacral, and 1 coccygeal). In 
tetrapods there are spinal cord enlargements where 
the large spinal nerves extend to the limbs, a cervi- 
cal enlargement cranially, and a lumbosacral enlarge- 
ment caudally. 

The internal organization of the spinal cord can 
be seen in a transverse section (Fig. 1). The outer 
portion of the spinal cord is made up of nerve fibers, 
most of which are oriented longitudinally and carry 
information between parts of the spinal cord, be- 
tween spinal cord and brain, and between brain and 
spinal cord. The outer white matter is divided into 
dorsal, lateral, and ventral columns. The interior of 
the spinal cord consists of gray matter and is divided 
into a dorsal sensory horn (or column) and a ventral 
motor horn (or column). In the thoracic and lumbar 
regions of the cord there is also a small lateral horn 
(or column) which contains preganglionic sympa- 
thetic neurons. In the very center of the bilaterally 
symmetrical spinal cord is a small central canal, lined 
with ependymal cells and containing cerebrospinal 
fluid. See SYMPATHETIC NERVOUS SYSTEM. 

Each spinal nerve divides into a dorsal sensory 
root and a ventral motor root before entering the 
spinal cord. The dorsal root contains the spinal gan- 
glion (Fig. 2), and then its axons enter the dorso- 
lateral aspect of the cord. Upon entering the spinal 
cord, each dorsal root axon divides and ramifies to 
send terminal branches to one or all of the following 
neuron groups: sensory neurons of the dorsal horn; 
motor neurons of the ventral horn; and long terminal 
branches to neurons in the brain. The motor neurons 
of the ventral horn, in addition to receiving synapses 
from dorsal root axons, also receive synaptic endings 
from neurons in other parts of the spinal cord and 
from long axons coming from the brain. The axons 
of the ventral horn neurons leave the cord through 
the ventral root of the spinal nerve and run with pe- 
ripheral nerves to innervate the muscles of the body. 
With this complex synaptic and fiber organization, 
the spinal cord can act as the integrating center for 
spinal reflexes (such as the knee jerk reflex), send 
sensory information from the brain, and receive in- 
formation from the brain to initiate or inhibit muscu- 
lar activity. See MOTOR SYSTEMS; SENSATION. 

Other than size and shape differences, the spinal 
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Fig. 1. Human spinal cord sectioned to show the white and 
gray matter. (After G. M. Wyburn, The Nervous System, 
Academic, 1960) 
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Fig. 2. Brain and spinal cord, illustrated in ventral aspect. 
Nerve roots and chief autonomic ganglia are shown. (After 
H. Morris et al., Comparative Anatomy, Blakiston-McGraw- 
Hill, 1936) 


cords of vertebrates are basically similar. However, 
in fishes some or all of the visceral motor fibers pass 
through the dorsal roots rather than the ventral roots 
of spinal nerves. See AUTONOMIC NERVOUS SYSTEM; 
NERVOUS SYSTEM (VERTEBRATE). Douglas B. Webster 


C 
Spinal cord disorders 


In addition to those disorders common to the brain, 
the spinal cord is subject to certain lesions because of 
its position or structure. A few of the more important 
are mentioned. 

Injury. Spinal cord injury results from dislocations, 
fracture, or compression in many cases, but a special 
form, called spinal shock, may result from a severe 
blow without actual distortion of adjacent tissue. In 
this case, there is a temporary paralysis which grad- 
ually clears in a length of time related to the severity 
of the paralysis. In direct damage, the cord may be 
slightly, partially, or completely damaged at one or 
more levels. Typical motor and sensory losses follow, 
with a poor prognosis for recovery if the nerve tissue 
is severely injured. Occasionally, similar damage may 
result from a hemorrhage produced by injury. 

A fairly common type of potential cord injury is 
seen in a number of cases of slipped disks, in which 
the inner, soft part of the vertebral column extrudes 
into the spinal canal. If this compresses the cord, 
functional loss of temporary or permanent degree 


can follow; more often, pressure is exerted on spinal 
roots so that pain, numbness, and some type of mus- 
cle weakness intervene. 

Tumors. Spinal cord tumors are not infrequent and 
most of these are of two types, the metastatic, from 
a primary source elsewhere in the body, and the tu- 
mors of the meninges or connective tissue related to 
the cord. The latter include neurofibromas, menin- 
giomas, and gliomas, which occur most often. The 
signs and symptoms and the extent of damage relate 
largely to the physical compression of the cord at a 
particular level. See TUMOR. 

Congenital defects. A few of the more common 
congenital defects involving the cord include an un- 
closed neural canal, or spina bifida, and reduplicated 
or otherwise malformed cords, such as those caught 
in an external sac of other tissues, the meningomye- 
locoele. 

Inflammation. Inflammations may result from 
known or unknown agents and in meningitis may 
involve primarily the coverings; in myelitis, the cord 
itself. The meningococcus, pneumococcus, strepto- 
coccus, tubercle bacillus, and other microorganisms 
frequently cause meningitis. The most widely known 
cause of myelitis, of course, is the poliomyelitis 
virus group, although other agents will also produce 
inflammations, such as typhus, spotted fever, and 
Treponema pallidum, the cause of syphilis. Other 
diseases such as malaria, amebiasis, trichinosis, ele- 
phantiasis, toxoplasmosis, and blastomycosis may 
display their effects on the spinal cord. 

Vascular diseases. Not uncommonly motor or sen- 
sory deficits in the spinal cord may result from spe- 
cific vascular diseases and from such nonspecific 
conditions as injury, aneurysms, or congenital de- 
fects, and rarely, from blood vessel tumors. 

Nerve tract degeneration. An ill-defined group of 
disorders characterized by degeneration of nerve 
tracts or myelin sheaths of the cord is found more 
often than one would suspect. In these, some un- 
known or poorly understood mechanism causes the 
deterioration of cells and fibers so that function 
is altered, then lost, and the nervous tissue is ei- 
ther replaced by a scar or a softening of cystlike 
area remains. Multiple sclerosis, combined degen- 
eration associated with pernicious anemia, Parkin- 
son’s disease, postinfectious encephalomyelitis, and 
syringomyelia are examples. See MULTIPLE SCLEROSIS; 
PARKINSON’S DISEASE. 

Drugs. Finally, the effect of drugs or chemicals and 
metabolic disorders is often seen as cord depres- 
sion or stimulation with variable prognoses and per- 
manence of effects. These include alcoholism, car- 
bon monoxide poisoning, lead intoxication, vitamin 
deficiency, and certain familial diseases. See ALCO- 
HOLISM; POISON; SPINAL CORD; VITAMIN. 

Edward G. Stuart; N. Karle Mottet 


E III 
Spinicaudata 

An order of fresh-water branchiopod crustaceans for- 
merly included in the order Conchostraca. The entire 
body and its appendages are covered by a bivalve 


Spinning (textiles) 


carapace up to about 17 mm (0.7 in.) long, hence 
the name clam shrimp. The carapace is often bilat- 
erally compressed and usually displays several lines 
of growth. The head cannot be protruded. It usu- 
ally bears paired sessile eyes, but these are fused in 
Cyclestheria. Locomotion is by means of large an- 
tennae. The mandibles are of the rolling, grinding 
type. 

There are more than 16 pairs of similar foliaceous 
trunk limbs (at least 32 in some species); these 
draw in particles of food by beating in metachronal 
rhythm. In the male, the first two pairs (the first pair 
only in Cyclestheria) are modified as claspers. The 
trunk terminates in a robust telson that bears a pair 
of stout claws. 

Reproduction is usually bisexual. Usually, the fe- 
male carries a clump of numerous, very small, 
drought-resistant eggs, sometimes of complex shape, 
that are shed at the next molt. In Cyclestheria a few 
larger parthenogenetic eggs are usually brooded in a 
dorsal brood pouch beneath the carapace. Eggs, ex- 
cept the parthenogenetic eggs of Cyclestheria, hatch 
as nauplii. All species frequent temporary pools in 
many parts of the world, but Cyclestheria also oc- 
curs in permanent waters. See BRANCHIOPODA. 

Geoffrey Fryer 
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Spinning (metals) 
A production technique for shaping and finishing 
metal. In the spinning of metal, a sheet is rotated and 
worked by a round-ended tool, controlled manually 
or mechanically. The sheet is formed over a mandrel. 
Spinning operations are usually carried out on a 
special rigid lathe fitted only with a driving head- 
stock, tail spindle, and tool rest. Surface speeds of 
500-5000 ft/min (2.5-25 m/s) are used, depending 
on material and diameter. The work is rubbed with 
soap, lard, or a similar lubricant during working. The 
operation can be set up quickly, and thus is desirable 
for short runs or for experimental units subject to 
change. Spinning may serve to smooth wrinkles in 
drawn parts, provide a fine finish, or complete a 
forming operation as in curling an edge of a deep- 
drawn part. Other operations include smoothing, 
necking, bulging, burnishing, beading, and trim- 
ming. Spun products range from precision reflectors 
and nose cones to kitchen utensils. Such materials as 
steel, aluminum, copper, and their softer alloys are 
spun in thicknesses up to 18 in. G mm). It may be 
necessary to anneal the metal during the spinning. 
See SHEET-METAL FORMING. Ralph L. Freeman 


EE 
Spinning (textiles) 

The fabrication of yarn (thread) from either discon- 
tinuous natural fibers or bulk synthetic polymeric 
material. In a textile context the term spinning is ap- 
plied to two different processes leading to the yarns 
used to make threads, cords, ropes, or woven or knit- 
ted textile products. 
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Spiral 


Natural fibers, such as wool, cotton, or linen, are 
generally found as short, entangled filaments. Their 
conversion into yarn is referred to as spinning. After 
a carding operation on the raw material to disen- 
tangle the short filaments, the filaments are drawn 
(drafted) to promote alignment in an overlapping 
pattern and then twisted to form, by mechanical in- 
terlocking of the discontinuous filaments, a resistant 
continuous yarn. See COTTON; LINEN; WOOL. 

The term spinning is also used for the production 
of monofilaments from synthetic polymers—for 
example, polyamides or nylons, polyesters, and 
acrylics—or modified natural polymers, such as 
cellulose-rayon. The process involves forcing a pres- 
surized fluid—either a melt, solution, or chemi- 
cal intermediate—through many small orifices of a 
spinnerette and subsequently solidifying the emerg- 
ing monofilaments by cooling, drying, or coagula- 
tion. Generally the monofilaments are then stretched 
(drawn) to increase their strength by promoting 
molecular orientation and are wound as yarn which 
can be used directly for threads, cords, or ropes. 
Such yarn, however, is often cut into relatively short 
lengths (staple) and reformed by a process similar 
to that used for natural fibers into a yarn more suit- 
able, in terms of appearance and feel, for making 
certain textile products. See MANUFACTURED FIBER; 
NATURAL FIBER; TEXTILE. J. M. Charrier 

Bibliography. K. L. Hatch, Textile Science, 1993; 
M. L. Joseph et al., Joseph’s Introductory Textile Sci- 
ence, 6th ed., 1992; M. Raheel (ed.), Modern Tex- 
tile Characterization Methods, 1996; B. Wulfhorst, 
T. Gries, and D. Veit, Textile Technology, 2006. 


M] 
Spiral 

A term used generically to describe any geometrical 
entity that winds about a central point or axis while 
also receding from it. Spiral staircases, helices, non- 
planar loxodromes (curves that intersect those of a 
given class at a constant angle, for example, rhumb 
lines, in case the curves are on a sphere whose merid- 
ians form the given class) are examples of spirals 
whose windings do not lie in a plane. The planar spi- 
ral curve p = a0, in polar coordinates p and 6 (see 
illus.), was introduced by Archimedes in his book 
On Spirals. Other planar spiral curves are logarith- 
mic, p = e®, the tangents to which make a constant 
angle with the radii vectors drawn to the points of 


Pp | 





Planar spiral curve, or Archimedes’ spiral. 


contact; hyperbolic, p0 = a; and the lituus, p70 = 
a’. See HELIX. Leonard M. Blumenthal 


TE) 
Spiriferida 

An extinct order of brachiopods, in the subphylum 
Rhynchonelliformea, that inhabited shallow seas of 
the Paleozoic and early Mesozoic. It was the most di- 
verse group of spire-bearing brachiopods (those that 
contain a spirally coiled calcareous structure called 
the brachidium used to support the lophophore). 
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Spiriferida. (a-b) Dorsal and lateral views of Neospirifer, 
Permian, and (c) with dorsal valve removed showing spiral 
lophophore supports. (d) Diversity history of spiriferid 
families. (a-c after R. C. Moore, C. G. Lalicker, and A. G. 
Fischer, Invertebrate Fossils, McGraw-Hill, New York, 1952; 
d after J. J. Sepkoski, Jr., A Compendium of Fossil Marine 
Animal Families, 2d ed., Milwaukee Public Mus. Contrib. 
Biol. Geol., no. 83, 1992) 


Spiriferids also possess unequally biconvex valves 
that are externally smooth or radially ribbed. The 
valves are generally strophic (straight hinge) with a 
well-developed interarea commonly limited to the 
ventral valve (illus. a-c). Shells may be punctate 
(possessing small perforations filled by outer epithe- 
lial tissue) or impunctate. Punctate shells arose in 
stratigraphically younger spiriferids; the presence or 
absence of punctae serves to distinguish two major 
taxonomic groups. 

The evolutionary origin of spiriferids is unclear. 
One hypothesis has the spiriferids derived from 
the rhynchonellids via another spire-bearing group 
called the athyrids. A competing hypothesis has the 
spiriferids derived directly from the orthids and the 
other spire-bearing groups derived from the rhyn- 
chonellids. Clarification of this problem will require 
additional analyses of new fossil material. 

Spiriferids were sessile, attached, epifaunal sus- 
pension feeders. Most had a functional pedicle, used 
for attachment to the substrate. The earliest spir- 
iferids appeared in deep-water environments in the 
Late Ordovician and thus did not take part in the 
Ordovician radiation of marine life. Spiriferids first 
diversified in the Silurian following the Late Or- 
dovician mass extinction (illus. d). They, along with 
strophomenids and pentamerids, became dominant 
members of marine benthic communities, replacing 
the orthids. They reached their acme in diversity and 
dominance of communities in the Devonian. They 
suffered vast extinction in the Late Devonian, but re- 
bounded in diversity and were still dominant mem- 
bers of communities in the late Paleozoic. Spiriferid 
diversity dropped dramatically following the late Per- 
mian mass extinction and never returned to Paleo- 
zoic levels. Spiriferids suffered another big decline 
in the Late Triassic mass extinction, and finally died 
out in the Middle Jurassic. See BRACHIOPODA; ARTIC- 
ULATA (ECHINODERMATA). Mark E. Patzkowsky 

Bibliography. P. Copper and R. Gourvennec, Evolu- 
tion of the spire-bearing brachiopods (Ordovician- 
Jurassic), in P. Copper and J. Jin (eds.), Brachiopods, 
A. A. Balkema, Rotterdam, 1996. 


E=) 
Spirometry 

The measurement, by a form of gas meter, of vol- 
umes of gas that can be moved in or out of the lungs. 
The classical spirometer is a hollow cylinder (bell) 
closed at its top. With its open end immersed in a 
larger cylinder filled with water, it is suspended by a 
chain running over a pulley and attached to a coun- 
terweight (see illus.). The magnitude of a gas volume 
entering or leaving is proportional to the vertical ex- 
cursion of the bell. In another type of spirometer, 
the bell is wedge shaped and is constrained to rotate 
on an axis located along its apex. Volume changes 
are proportional to the arc traversed. 

Volume changes can also be determined from mea- 
surements of flow, or rate of volume change. Flow 
may be measured continuously by a pneumotacho- 
graph. A resistive element, such as a bundle of fine- 
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Diagram of a spirometer. 




















bore tubes, is inserted in the tube through which 
breathing occurs. The pressure drop across the re- 
sistance is proportional to flow and can be sensed 
and recorded continuously by a transducer that gen- 
erates an electrical signal. The flow signal can be 
continuously integrated to yield a volume trace. 

The volume of gas moved in or out with each 
breath is the tidal volume; the maximal possible value 
is the vital capacity. Even after the most complete ex- 
piration, a volume of gas that cannot be measured by 
the above methods, that is, the residual volume, re- 
mains in the lungs. It is usually measured by a gas 
dilution method or by an instrument that measures 
blood flow in the lungs. Lung volumes can also be 
estimated by radiological or optical methods. 

At the end of an expiration during normal resting 
breathing, the muscles of breathing are minimally ac- 
tive. Passive (elastic and gravitational) forces of the 
lungs balance those of the chest wall. In this state the 
volume of gas in the lungs is the functional residual 
capacity or relaxation volume. Displacement from 
this volume requires energy from natural (breathing 
muscles) or artificial (mechanical) sources. See RES- 
PIRATION. Arthur B. Otis 


re) 
Spirophorida 

An order of sponges of the class Demospongiae, sub- 
class Tetractinomorpha, with a globular shape and a 
skeleton of oxeas and triaenes as megascleres, and 
microspined, contorted, sigmalike microscleres. 

Internal buds or gemmules may be formed. The 
eggs become affixed to substratum by means of long 
pseudopodia and undergo cleavage and morphogen- 
esis without a free larval stage. 

Species of Tetilla may have basal spicule tufts 
functioning to anchor the sponge in sandy bottoms. 
Species of Cinachyra, with ostia and oscules re- 
stricted to surface depressions, are common inshore 
inhabitants of coral reef lagoons. Members of this 
order range down to depths of at least 5900 ft 
(1800 m). See DEMOSPONGIAE. Willard D. Hartman 


Spirophorida 
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Spirurida 
An order of nematodes in which the labial region is 
most frequently provided with two lateral labia or 
pseudolabia; in some taxa there are four or more 
lips; rarely lips are absent. Because of the variabil- 
ity in lip number, there is variation in the shape of 
the oral opening, which may or not be surrounded 
by teeth. The amphids are most often laterally lo- 
cated; however, in some taxa they may be located 
immediately posterior to the labia or pseudolabia. 
The stoma may be cylindrical and elongate or rudi- 
mentary. The esophagus is generally divisible into an 
anterior muscular portion and an elongate swollen 
posterior glandular region, where the multinucleate 
glands are located. Eclosion larvae are usually pro- 
vided with a cephalic spine or hook and a porelike 
phasmid on the tail. 

All known spirurid nematodes utilize an inver- 
tebrate in their life cycle; the definitive hosts are 
mammals, birds, reptiles, and, rarely, amphibians. 








Fig. 1. Thelazia platyptera, a parasitic spiruroid nematode. (a) Anterior end of male, lateral 
view; (b) caudal end of male, lateral view; (c) caudal end of male, ventral view; (d) anterior 
end of female, lateral view; (e) caudal end of female, lateral view; (f) egg. (Parasite 
Laboratory, Agricultural Research Service, Beltsville, Maryland) 


The order is divided among four superfamilies: 
Spiruroidea, Physalopteroidea, Filarioidea, and Drilo- 
nematoidea. 

Spiruroidea. The superfamily Spiruroidea com- 
prises parasitic nematodes whose life cycle always 
requires an intermediate host for larvae to the third 
stage. The definitive hosts are mammals, birds, fishes, 
reptiles, and rarely amphibians; and spiruroids may 
be located in the host’s digestive tract, eye, or 
nasal cavity or in the female reproductive system 
(Table 1). Morphologically (Figs. 1 and 2), the lip 
region is very variable in Spiruroidea, ranging from 
four lips to none. When lips are present, the lat- 
eral lips are well developed and are referred to as 
pseudolabia. The cephalic and cervical region may 
be ornamented with cordons, collarettes, or cuticu- 
lar rings. The stoma is always well developed and is 
often provided with teeth just inside the oral open- 
ing. In birds the nematodes are often associated with 
the gizzard, and the damage caused results in death, 
generally by starvation. When the muscles of the giz- 
zard are destroyed, seeds pass intact and cannot be 
digested. 

This superfamily contains the largest of all known 
nematodes, Placentonema gigantissima, parasitic 
in the placenta of sperm whales. Mature females at- 
tain a length of 26 ft (8 m) and a diameter of 1 in. 
(2.5 cm). They are unusual not only because of their 
great length but also because the adult female has 
32 ovaries, which produce great numbers of eggs. 
The eggs are not unusually large. Armand Maggenti 

Filarioidea. The superfamily Filarioidea contains 
highly specialized parasites of most groups of verte- 
brates. They are particularly common in amphibians, 
birds, and mammals. While they cannot be classified 
as completely harmless, most of the many hundreds 
of known species are not associated with any recog- 
nized disease. A limited number of species produce 
serious diseases in humans, and a few others pro- 
duce serious diseases in domestic or wild animals 
(Table 2). The filarial parasites of humans are found 
almost exclusively in the tropics, with some exten- 
sion into the subtropics. This is by no means the 
universal pattern. Heartworm disease in dogs has be- 
come established in the north temperate zone in the 
United States and Canada and in the south temperate 
zone in Australia. Several other enzootic species are 
known in northern amphibians, birds, and mammals. 
See HEARTWORMS. 

Morphology of adults. Among the Filarioidea there are 
no conspicuous divisions into distinct body regions. 
True lips are absent, and the mouth cavity is greatly 
reduced. The cuticule is devoid of prominent orna- 
ments found on many nematodes. The female gen- 
ital opening is usually located in the region of the 
esophagus-intestine junction in the anterior portion 
of the worm. Sexual dimorphism is the rule; in com- 
mon with other nematodes, the female filarioid is at 
least twice as long as the male, and often the dif- 
ference is much greater. Among the species found in 
humans or dogs, the females vary from 1 to 28in. (2.5 
to 70 cm) long. Sensory organs, the cephalic papil- 
lae, are reduced in size and number. The eggs in a 

















TABLE 1. Common spiruroid parasites of domestic animals 
Intermediate 
Parasite Definitive host host Distribution Pathology 

Ascarops strongylina Swine Beetles Cosmopolitan Inflammation of stomach 
(Rudolphi, 1819) mucosa 

Cheilospirura hamulosa Fowl, turkey Grasshoppers, N. America, S. America, Asia, Soft nodules in muscul- 
(Diesing, 1851)* beetles Europe ature of gizzard 

Dispharynx nasuta Fowl, turkey, Isopods N. America, S. America, Ulcers and gland des- 
(Rudolphi, 1819) pigeon, guinea Europe, Asia truction in proven- 

fowl, pheasant triculus 

Echinuria uncinata Duck, goose, Water fleas N. America, Europe, Asia, Inflammation and nodules 

(Rudolphi, 1819) swan N. Africa in proventriculus and 
gizzard 

Draschia megastoma Equines Musca sp. Cosmopolitan Tumors in stomach wall 
(Rudolphi, 1819) 

Draschia microstoma Equines Stable fly, Cosmopolitan Catarrhal gastritis, stom- 
(Schneider, 1866) housefly ach ulcers 

Habronema muscae Equines Musca sp. Cosmopolitan Catarrhal gastritis 
(Carter, 1861) 

Oxyspirura mansoni Fowl, turkey, Cockroaches Southern U.S., S. Asia, Lesions in eyes, blindness, 
(Cobbold, 1879) peafowl Australia, S. America loss of eyeball 

Tetrameres americana Fowl, turkey Grasshoppers, N. America, S. America, Africa, Irritation and inflammation 
(Cram, 1927) cockroaches Asia, Europe of proventriculus 

Thelazia californiensis Sheep, cat, dog Fannia thelazia California Scar tissue formation in 
(Price, 1930) eyes, blindness 

Thelazia callipaeda Dog, rabbit Unknown Far East Scar tissue formation in 
(Railliet and Henry) eyes, blindness 

Thelazia rhodesii Cattle, sheep, Musca sp. Europe, Asia, Africa Scar tissue formation in 
(Desmarest, 1828) goat, buffalo eyes, blindness 
“Occasional parasite of humans. 











Fig. 2. Oxyspirura mansoni, a common parasite of poultry. 
(a) Tail of male, lateral view; (b) tail of male, ventral view; 
(c) tail of female; (d) eggs. (Parasite Laboratory, 
Agricultural Research Service, Beltsville, Maryland) 


few species have thick shells but more commonly are 
covered only by a thin vitelline membrane. The eggs 
characteristically undergo partial embryonation in 
the uterus, and precociously active embryos (micro- 
filariae) are discharged. The adult worms are found 
in a wide variety of places in the body of the ver- 
tebrate host, but each particular species has its pre- 
ferred host and preferred location within that host 
(Table 2). 

Microfilariae. These precociously active embryos are 
devoid of any alimentary canal (Fig. 3). The differ- 
ent species range from less than 100 to more than 
700 micrometers long and 2 to 8 wm wide; they may 
or may not have a cephalic spear. The body is filled 
with a long column of somatic cells. Among them 
may be distinguished, with special staining, four cells 
variously termed R cells or G cells from which the 
alimentary canal and the reproductive system will 
develop. In some species the microfilariae are dis- 
charged still enclosed in the vitelline membrane and 
are termed “sheathed”; without the membrane they 
are “unsheathed.” Most of the known forms have 
microfilariae in the blood regardless of the tissue 
in which the adults occur. These microfilariae may 
continue to circulate for months to years depend- 
ing on the species and the level of host immune 
response. They do not necessarily circulate in con- 
stant numbers. Some species show a conspicuous 
daily periodicity, either nocturnal or diurnal, in the 
peripheral circulation. It has been shown with mi- 
crofilariae of Wuchereria bancrofti of humans and 
with the unidentified microfilariae in the crow that 
the periodicity can be reversed in less than a week by 
reversing the active period (work, feeding, light, and 
so on) and the resting period (sleeping, darkness). 
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TABLE 2. Some common and better-known Filarioidea 





Species 


Definitive host 


Location in the host 


Disease 





Wuchereria bancrofti 
Periodic 


Aperiodic 
Brugia malayi 
B. timori 
B. pahangi 


Human 


Human 
Human 
Human 


Wild felines, human 


Deep lymphatics 


Deep lymphatics 
Deep lymphatics 
Deep lymphatics 
Deep lymphatics 


Elephantiasis 


Elephantiasis 
Elephantiasis 
Elephantiasis 
Elephantiasis 





Onchocerca volvulus 
O. cervicalis 

Loa loa 
Stephanofilaria stilesi 


Elaeophora schneideri 


Human 


Horse 
Human 
Cattle 


Mule deer, sheep, 
moose, elk 


Subcutaneous lymphatics 


Ligamentun nuchae 


Subcutaneous tissues, eyes 


Skin 


Distal arteries 


Fibrous-nodules, blindness 

Slight irritation 

Calabar swellings (nonpitting edema) 
Dermatitis 


Dermatitis, “clear-eyed blindness” 





Dipetalonema perstans 


D. streptocerca 

D. reconditum 
Mansonella ozzardi 
Dirofilaria immitis 


Dirofilaria repens 
Litomosoides carini 





Human 


Chimpanzee 
Dog 

Human 

Dog 


Dog 
Cotton rat 


Peritoneal and pleural 
cavities 

Subcutaneous 

Subcutaneous tissues 

Body cavities 

Right heart 


Subcutaneous tissues 
Pleural cavity 


Minor allergies 


Slight edema 

Nonreported 

Very little 

Chronic debilitation (heartworm disease) 


Eczema 
Pleural adhesions 








Furthermore, an annual periodicity has been shown 
to occur with one species, the microfilariae of Diro- 
filaria immitis, in the dog, in the temperate zones; 
the microfilariae are most plentiful in the periph- 


spear sheath 
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Fig. 3. Microfilariae. (a) Wuchereria bancrofti. (b-g) Posterior ends of the six species 
normally found in humans: (b) W. bancrofti; (c) Brugia malayi; (d) Mansonella ozzardi; 
(e) Loa loa; (f) Dipetalonema (Acanthocheilonema) perstans; (g) Onchocerca volvulus. 
Of these, b-f are found in blood, g is found in skin. 


eral blood during the summer with its higher tem- 
peratures and longer periods of daylight. When 
the microfilariae are not in the peripheral circula- 
tion, they are sequestered in the capillary bed of 
internal organs, notably the lungs. They are inca- 
pable of further development as long as they remain 
in the definitive host. 

Life cycle. All the known filariae require a blood- 
sucking arthropod intermediate host, usually an in- 
sect and commonly a dipteran, in which to com- 
plete embryonation. The microfilariae are ingested 
as the arthropod feeds. After embryonation is com- 
pleted, the resulting infective larvae gain entrance 
into the definitive vertebrate in association with the 
next feeding of the arthropod (Fig. 4). 

Filariasis. Filariasis is a disease caused by filarioidea 
in humans or lower animals. More loosely, the term 
is used to indicate mere infection by such organ- 
isms. In human medicine, filariasis commonly refers 
to the disease caused by, or to infection with, 
one of the mosquito-borne, elephantoid-producing 
Filarioidea—most frequently W bancrofti, less fre- 
quently Brugia malayi, and more recently including 
B. timori. 

1. Diagnosis. Serological and intradermal tests do 
not give any clue to specific identification. The only 
specific laboratory aid to diagnosis is the detection 
and identification of the microfilariae. Brugia timori 
is a sheathed microfilaria, but the sheath does not 
stain with the common blood-parasite stain, Giemsa. 
This microfilaria has the two detached terminal nu- 
clei seen in B. malayi but is nearly half again as long 
as the latter, with the cephalic space nearly twice 




















Microfilariae 
Intermediate host Location Sheath* Periodicity Geographical distribution 
“Domestic” night-biting Blood + Nocturnal Nearly worldwide in tropics 
mosquitoes 
Sylvan day-biting mosquitoes Blood + Slightly diurnal South Pacific islands 
Sylvan night-biting mosquitoes Blood + Nocturnal Southern Asia 
Domestic night-biting mosquitoes Blood + Nocturnal Indonesia 
Sylvan night-biting mosquitoes Blood + Nocturnal Southern Asia 
Simulium Skin = = Tropical Africa, Mexico, Guatemala, 
Venezuela, Columbia 
Culicoides Skin = = Cosmopolitan 
Chrysops Blood + Diurnal Equatorial Africa 
Unknown Cutaneous + = Southern and western U.S. 
lymph vessels 
Unknown Skin = = Mountains of southwestern U.S. 
Culicoides Blood = = Tropical Africa, South and Central 
America 
Culicoides Skin = = Africa 
Fleas, ticks (?) Blood - - Southern Europe, North Africa, U.S. 
Culicoides Blood Central and South America, West Indies 
Mosquitoes Blood - Nocturnal and Tropics and subtropics, temperate zone 
seasonal in U.S. 
Mosquitoes Blood - - Southern Europe, North Africa, Vietnam 
Mites Blood E = Southern U.S. 





*+indicates presence of sheath. 











as long. The microfilariae of Dipetalonema strepto- 
cerca, a subcutaneous parasite of chimpanzees, are 
unsheathed and nearly half again as long as those 
of D. perstans. However, the posterior terminal col- 
umn of nuclei is more than twice as long (with 9- 
12 nuclei) as in the latter, and the posterior end is 
curved like a shepherd’s crook. The parasite has been 
reported frequently in humans in the rainforests of 
western Africa. Over 40% infection with microfilar- 
iae was shown by one set of skin snips from a small 
group of natives in Ghana. 

2. Distribution and_ history. The species 
Wuchereria bancrofti has worldwide distribution 
in the tropics and extends into both north and 
south temperate zones. However, the very high 
infection rates and the high prevalence of the 
resulting grotesque elephantiasis (characterized by 
cutaneous and subcutaneous tissue enlargement due 
to lymphatic obstruction) are localized at sea-level 
areas with year-round uncontrolled high mosquito 
densities in the tropics. Introduction into the tem- 
perate zones commonly results in a few secondary 
cases before the infection dies out. This parasite is 
important also in medical and biological history. A 
century ago Patrick Manson noted that the common 
occurrence of tropical elephantiasis was restricted 
to areas where W bancrofti was common (with 30- 
50% microfilaremia) but that microfilaremia com- 
monly did not occur in the older subjects with 
advanced disease symptoms. This was one of the 
earliest indications of host (immune) response to re- 
peated parasitic infections. Manson’s demonstration 
that the microfilariae after ingestion with a blood 


meal continued embryonic development within the 
mosquito vector was also the first incrimination 
of mosquito transmission of any disease-causing 
organism. 
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Fig. 4. Life cycles of the Filarioidea. 
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3. Periodicity, In most areas of the world, the 
microfilariae of W bancrofti have a conspicuous 
nocturnal periodicity. There are 50 to 150 times as 
many microfilariae in the midnight blood as in noon 
blood. Often none may be found in midday blood. 
However, this well-defined periodicity has been com- 
pletely reversed within a week by reversing the 
human quiescent period (sleep and darkness) and 
the active period (exercising, eating, and daylight). 
This periodic variety of W bancrofti is transmitted by 
domestic, anthropophilous, night-biting mosquitoes, 
which readily enter houses to feed. Culex fatigans 
(C. quinquefasciatus) and some of the important 
malaria vectors such as Anopheles darlingi, A. fu- 
nestus, and A. farauti are important vectors. Never- 
theless, even in areas dominated by the periodic par- 
asite, individual cases of the aperiodic form occur. dn 
1906 such a case was found in the Philippines, and 
since it was considered that the case might involve 
a distinct species, the specific name philippinensis 
was coined.) 

On some of the geologically newer islands of the 
South Pacific east of the 176th meridian (Samoa, 
Marquesas, Fiji, and the Gilbert and Ellice among 
them) the dominant mosquitoes are sylvan daytime 
feeders. Nocturnal mosquitoes are rare or absent. In 
this environment an evolutionary change has taken 
place. The microfilariae have either no periodicity 
or only a slight diurnal periodicity. Aedes polyne- 
siensis and close relatives are the only vectors, and 
the parasite has almost completely lost its capac- 
ity to develop in C. fatigans. This parasite is evi- 
dently evolving into a new species. Perhaps it has 
already evolved into distinct species, despite the fact 
that the loss of nocturnal periodicity is the only un- 
equivocal distinction. There is some suggestion that 
acute febrile attacks after repeated bites by the mos- 
quito intermediate host are more common than with 
the periodic forms. At any event, to the earlier spe- 
cific name, philippinensis, pacifica was added in 
1932. 

4. Control. The fundamental difference in the epi- 
demiology of the two forms is clearly depicted by the 
control measures. The periodic form has been suc- 
cessfully reduced or eliminated by use of bednets, 
household screening, residential insecticides, and 
control of the readily detected domestic breeding 
sites. The use of community-wide household spray- 
ing to eliminate malaria from Guyana also eliminated 
transmission of W bancrofti. Acquired resistance to 
insecticides has complicated this approach. 

Neither attempts at larval destruction nor at- 
tacks on adult mosquitoes have had any success in 
controlling the aperiodic infection. Only communi- 
tywide medication has had any success. Diethylcar- 
bamazine (Hetrazan) in various regima has reduced 
the pool of microfilariae for mosquito infection and 
is active against the infective larvae or early develop- 
mental stages. 

Brugia malayi occurs in both periodic and ape- 
riodic strains, but it does not appear to be as com- 
pletely isolated as is the case with W bancrofti. An- 
other parameter is added by the fact that species of 


Mansonia are the principal vectors. The larval stages 
of these mosquitoes are attached to submerged veg- 
etation; herbicides have some value in control. Di- 
ethylcarbamizine has also been used. It appears to 
be very successful in areas where B. timori has been 
introduced recently. 

Onchocerciasis. This disease is caused by Oncho- 
cerca volvulus in the subcutaneous lymphatics. It 
is characterized by subcutaneous nodules which are 
most conspicuous where the skin lies closely over 
bony structures, via cranium, pelvic girdle, joints, 
and shoulder blades. When they are on the head, 
the microfilariae reach the eyes. Ocular disturbances 
vary from mild transient bleary vision to total and 
permanent blindness. This is a major problem in 
the provinces of Oaxaca and Chiapas in Mexico and 
the nearby Pacific slopes of Guatemala, where infec- 
tion is an occupational hazard associated with coffee 
growing. 

Termed “river blindness,” onchocerciasis also 
affects field workers in the Ivory Coast, Ghana, 
Mali, Togo, and Upper Volta. The World Health Orga- 
nization has estimated that there are over a million 
cases in Upper Volta alone. Attempts to control the 
blackfly vector have had only some local successes. 
Surgical removal of the nodules is curative and on 
a communitywide basis eliminates or reduces trans- 
mission. 

Loa loa. The African eye worm, Loa loa, is the fi- 
larioidan worm most commonly acquired by Cau- 
casian immigrants, including missionaries, in Africa. 
Transmission is by daytime-feeding sylvan deer-flies, 
Chrysops. The only preventive measures are protec- 
tive clothing, including head nets. Repellents have 
some value. Fortunately, serious damage is rare even 
when the worm gets into the eye. The areas of pit- 
ting edema known as calabar swellings are painful 
and diagnostic. They commonly occur on the wrists, 
hands, arms, or orbital tissues. Adult worms move 
freely. When the worm is accessible, surgical removal 
is indicated. This is one of the few Filarioidea in 
which adult worms are killed by diethylcarbamazine. 
See NEMATA. Gilbert F. Otto 

Bibliography. H. W. Brown and E A. Neva, Basic 
Clinical Parasitology, 6th ed., 1996; L. S. Garcia and 
D. A. Bruckner, Diagnostic Medical Parasitology, 
áth ed., 2001; P. E. Sartwell (ed.), Maxcy-Rosenau’s 
Preventive Medicine and Public Health, 11th ed., 
1980. 
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Spitzer Space Telescope 

A high-performance infrared telescope that is one 
of the four Great Observatories launched by the Na- 
tional Aeronautics and Space Administration (NASA) 
between 1990 and 2003. It takes advantage of dra- 
matic advances in infrared detectors that have oc- 
curred over the last 20 years; it utilizes modern detec- 
tor arrays in space, where they are limited only by the 
faint glow of the zodiacal dust cloud. Ground-based 
infrared telescopes can operate only at the wave- 
lengths where the atmosphere is transparent, lying 


between 1 and 25 micrometers. Even within these 
windows, the thermal emission of the atmosphere 
is more than a million times greater than the dilute 
emission of the zodiacal cloud; there is additional 
foreground thermal emission from the telescope it- 
self. High-sensitivity detectors are blinded by these 
bright foreground signals. Operating in space elimi- 
nates the atmospheric absorption and emission; also, 
a telescope in the vacuum of space can be cooled suf- 
ficiently to virtually eliminate its emission. See ZODI- 
ACAL LIGHT. 

Spitzer was known for most of its life as SIRTF 
(Space Infrared Telescope Facility). SIRTF was 
launched on August 25, 2003, and was renamed after 
Lyman Spitzer, Jr., the first prominent astronomer to 
advocate putting telescopes into space. 

Development. The Infrared Astronomy Satellite 
(RAS) and the Infrared Space Observatory 
(SO) were important predecessors of the Spitzer 
Telescope. IRAS, launched in 1983, had four sets of in- 
frared detectors, operating at wavelengths of 12, 25, 
60, and 100 yum. The detectors and a 60-cm (24-in.) 
telescope were mounted inside a liquid-helium 
Dewar vessel that maintained them at a tempera- 
ture of ~2 K (—456°F) for the 10-month mission, al- 
lowing a very sensitive all-sky survey. The European 
Space Agency built JSO to follow up on the IRAS 
survey. ISO too had a 60-cm telescope mounted in 
a liquid helium Dewar. It carried four instruments 
designed for detailed study of individual objects 
through imaging, photometry, and spectroscopy. I$O 
was launched in 1995 and had a 28-month mission 
that returned a wealth of data over the 3-200-um 
spectral range. See DEWAR FLASK. 

The Spitzer Telescope was also conceived to fol- 
low up on IRAS. Although its science team was se- 
lected in 1984, the mission was repeatedly delayed. 
Throughout this period, the mission was revived re- 
peatedly with new technical concepts. Finally, in 
1996 the telescope got started officially toward con- 
struction under the direction of the Jet Propulsion 
Laboratory. By that time, NASA had downsized all its 
science missions under the guidelines of its “faster- 
better-cheaper” philosophy. Spitzer had been de- 
scoped by about a factor of 6 in cost from the original 
plans. 

Design. The approved concept introduces a num- 
ber of innovations developed in response to the pres- 
sure to reduce cost. It uses an Earth-trailing orbit 
to get far from the thermal radiation of the Earth 
(Fig. 1). The instruments are cooled inside a 
Dewar containing liquid helium, as with IRAS and 
ISO. However, the telescope is mounted on the out- 
side of this Dewar and was launched warm. The 
very cold environment away from the Earth allows 
the outer shell of the satellite to cool passively, 
greatly reducing the heat load on the telescope. 
The thermal loads on the helium Dewar are also 
very small. This concept of a telescope launched 
warm that cools by radiating into space provides 
the technical foundation for future, larger infrared 
telescopes such as the James Webb Space Telescope 
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Fig. 1. The orbit of Spitzer. The Earth orbits the Sun in a 
counterclockwise direction in this figure, and the frame of 
reference rotates with the Earth-Sun line to show how 
Spitzer trails behind. Viewed in a solar frame of reference, 
the Spitzer orbit is a normal Keplerian ellipse. In the 
Earth-centered frame here, the observatory makes a loop 
or kink every year when it partially overtakes the Earth. The 
observatory will fall farther and farther behind during its 
5-year mission. The circles centered on the Earth are 0.2, 
0.4, and 0.6 astronomical unit in radius, respectively. 


The 85-cm (33.5-in.) telescope and 360-liter 
(95-gallon) Dewar are held off the spacecraft by 
a thermally insulating truss, and a thermal shield 
blocks radiation from the spacecraft (Fig. 2). The 
solar panel is cantilevered off the spacecraft, and the 
telescope is protected from its heat by another ther- 
mal shield. The Dewar protected the liquid helium 
from thermal loads while the observatory was on 
the ground. The instruments are mounted on top of 
the helium vessel, and a vacuum-tight aperture door 
sealed out the air when on the ground. Upon reach- 
ing orbit, the aperture door was opened to allow light 
from the telescope to reach the instruments. The 
outer shell cooled to 34 K (—398°F) during the first 
month of the mission. Venting the cold helium vapor 
from the Dewar through heat exchangers cools the 
telescope further, to about 6° above absolute zero, 
or 6 K (—449°F). The extremely efficient thermal de- 
sign is expected to yield a 5-year lifetime at these 
temperatures. 

Spitzer has three instruments. The Infrared Array 
Camera ARAC) images in bands at wavelengths of 
3.6, 4.5, 5.8, and 8 um. The Infrared Spectrograph 
ARS) provides spectra from 5 to 40 um at low 
resolution (1-2%) and from 10 to 38 um at moder- 
ate resolution (about 0.15%). The Multiband Imag- 
ing Photometer for Spitzer (MIPS) provides imag- 
ing at 24, 70, and 160 um. The instruments use 
advanced infrared detector arrays that operate at or 
near the fundamental natural background limit in 
space. The IRAC arrays are 256 x 256 pixels in for- 
mat, using indium antimonide (nSb) photodiodes 
for the two shorter bands and silicon:arsenic (Si:As) 
impurity band conduction CBC) devices for the 
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Fig. 2. Cutaway drawing of Spitzer, showing the major subsystems. 


longer ones. IRS uses 128 x 128 arrays of Si:As and 
silicon:antimony (Si:Sb) IBC devices. MIPS uses a 
similar Si:As array at 24 um and germanium:gallium 
(Ge:Ga) arrays for the two longer bands. The high 
performance of these detector arrays allows Spitzer 
to deliver its anticipated breakthrough in science 
capability, despite its history of descopes. [SO had 
about 50 times more detectors than JRAS, and Spitzer 
has about 100 times more than ZSO. The sensitivity 
per detector has also improved by an order of mag- 
nitude with each new mission. 


Objectives. Spitzer is operated as a general-access 
observatory from the Jet Propulsion Laboratory and 
a science center at the California Institute of Technol- 
ogy. Eighty percent of the observing time is assigned 
by competitive application from the astronomical 
community. The science program includes investi- 
gations of star formation, planetary systems, and the 
assembly and evolution of galaxies. 

Star formation. Young stars form in dense interstellar 
molecular clouds, carrying a heavy burden of inter- 
stellar dust. Initially, forming stars are cold, only tens 


of degrees above absolute zero. Even after gravita- 
tional contraction has warmed them to more typ- 
ical stellar temperatures, they are hidden by the 
surrounding cocoons of interstellar gas and dust 
and can be seen only in the infrared. Spitzer is 
mapping many nearby star-forming regions to iden- 
tify very new stars that are too cold and faint to 
be known from other types of observation, docu- 
menting the very first stages of star formation. It is 
finding luminous massive stars and the outflows of 
gas that occur as accretion stops on these objects. 
It is also locating very-low-mass substellar brown 
dwarfs, revealing the processes by which stellar- 
mass clumps in molecular clouds can fragment into 
much smaller pieces before collapsing. See BROWN 
DWARF. 

Figure 3a shows a star-forming region about 
14,000 light-years away that has a classic structure. 
Within the core of a molecular cloud, clumps of gas 
have collapsed into massive stars. Winds from these 
stars have cleared away the remaining gas, punch- 
ing a hole into the cloud. From the Spitzer images, 
more than 300 newly formed stars have been iden- 
tified in the central hole and surrounding molecular 
cloud. The remains of the cloud are heated by the 
young stars and glow at wavelengths of 4.5, 5.8, and 
8 um in this image. See MOLECULAR CLOUD; STELLAR 
EVOLUTION. 

Planetary systems. Embryo planets are believed to 
form during the first few million years of the life of 
their star, from a protoplanetary disk made of dusty 
and gaseous material with too much angular momen- 
tum to fall into the star itself. This stage in the planet 
formation process stops as the disk is dissipated by 
evaporation of grains of material within it, by accre- 
tion of material into the star or the embryo planets, 
and by ejection of material from the system. There- 
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after, planets grow by accretion of material from the 
embryos that is released when they collide with each 
other. A by-product of these collisions is circumstel- 
lar disks of dust and debris. The dust grains in the 
circumstellar disk are warmed by the star and glow 
in the infrared, where Spitzer can detect them. Initial 
Spitzer measurements of debris disks show a broad 
variety of behavior, indicating a few cases that ap- 
pear to be dominated by recent and dramatic colli- 
sions. The disk signals damp down after about 150 
million years. These observations are reminiscent of 
theories for the formation of the planets in the solar 
system, and in particular of the extreme bombard- 
ment of the Earth over the first few hundred million 
years of its existence. See EXTRASOLAR PLANETS; IN- 
TERPLANETARY MATTER; PLANET; PROTOSTAR; SOLAR 
SYSTEM. 

Figure 3b and c shows Spitzer images of Fomal- 
haut, a bright star about 25 light-years away and 
about 200 million years old. The image of the star 
itself has been removed from the figure panels. 
Figure 3b, at a wavelength of 70 um, shows the dusty 
debris from a shattering collision of asteroid-sized 
embryo planets. A Hubble Space Telescope (HST) 
image of scattered light from this ring shows it to 
be off-center relative to the star, possibly maintained 
in this configuration by the gravitational action of a 
massive planet. Because the ring is closer to the star 
in the lower left of the 70-um image (Fig. 3b), the 
dust is warmer there and glows more brightly than 
elsewhere. At a wavelength of 24 wm (Fig. 30), the 
ring is filled in with warmer dust that is probably 
falling into the star. This dust is very dilute and is not 
seen in the Hubble image. 

Assembly and evolution of galaxies. In nearby galaxies, 
the shortest-wavelength Spitzer bands reveal the dis- 
tribution of the stars that dominate the visible mass, 


Fig. 3. Illustrative science results from the Spitzer observatory. (a) Star formation in the nebula RCW 49 (courtesy of 
NASA/JPL-Caltech; E. Churchwell, University of Wisconsin). (b) Fomalhaut circumstellar disk at wavelengths of 70 wm and 
(c) 24 um (courtesy of NASA/JPL-Caltech; K. Stapelfeldt, JPL). (d) Dusty, elliptical galaxy Centaurus A (courtesy of 


NASA/JPL-Caltech; J. Keene, Spitzer Science Center & Caltech). 
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improving understanding of the structure of the 
galaxies. The longer-wavelength bands glow in the 
infrared emission of various forms of interstellar dust, 
from tiny aromatic hydrocarbon powder heated to 
hundreds of degrees from the absorption of a sin- 
gle ultraviolet photon, to large grains at about 20 K 
(—424°F). 

Figure 3d shows how Spitzer can see into the 
nearest large elliptical radio galaxy, Centaurus A, 
about 10 million light-years away. A striking trape- 
zoidal structure glows in the 8-4m band, which 
traces the distribution of polyaromatic hydrocarbon 
powder. A computer model indicates that this beau- 
tiful structure remains from a spiral galaxy that has 
been engulfed by the larger elliptical galaxy, a pro- 
cess that may have been responsible for triggering 
the nuclear activity and radio emission. 

In the distant universe, Spitzer is locating a 
population of very luminous, dusty galaxies at red- 
shifts of z = 1-4. These galaxies are identified be- 
cause of their very red colors in the IRAC bands 
(resulting from the combination of high redshift and 
absorption by interstellar dust) or their detection in 
the MIPS bands, along with information on the red- 
shift from spectra, or inferred from the IRAC col- 
ors. They represent a stage when smaller galaxies 
were merging at a high rate to form what have 
become large elliptical galaxies and spiral galaxy 
bulges. Spitzer is also detecting galaxies and quasars 
at even higher redshifts and will contribute to knowl- 
edge of the most distant objects known in the uni- 
verse. See GALAXY, EXTERNAL; GALAXY FORMATION 
AND EVOLUTION; QUASAR. George Rieke 

Bibliography. J. C. Mather (ed.), Optical, Infrared, 
and Millimeter Space Telescopes, Proc. SPIE, 
vol. 5487, 2004; G. H. Rieke, The Last of the 
Great Observatories: Spitzer and the Era of Faster, 
Better, Cheaper at NASA, University of Arizona 
Press, Tucson, 2006; Spitzer Space Telescope Mission 
(special issue), Astrophys. J. Suppl. Ser.,154(1):1- 
474, September 2004. 
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Splachnales 


An order of the true mosses (subclass Bryidae), 
whose members are remarkable perennial plants 
that grow mainly on nitrogenous substrates, such as 
dung, and show considerable differentiation of neck 
tissue below the spore-bearing part of the capsule. 
Many of the species show adaptations that encour- 
age insect dispersal of spores. The order consists of 
two families with about eight genera. 

The plants are gregarious or dense-tufted, erect, 
and often forked. The leaves are soft, lanceolate or 
obovate, sometimes bordered at the margins, and 
commonly toothed. The single costa may be ex- 
current or terminate at or below the apex. The 
perichaetia are terminal, with bracts much like the 
upper stem leaves. The setae are elongate and the 
capsules erect with a noticeably differentiated neck. 
Stomata are very numerous in the neck, and have 


two guard cells. A single peristome is usually present, 
entire or forked, with 16 teeth sometimes paired or 
joined in twos and fours. In Splachnum the teeth are 
chambered since segments of the inner peristome lie 
opposite the teeth and are fused with them. The ca- 
lyptrae are generally mitrate and sometimes hairy. See 
BRYIDAE; BRYOPHYTA; BRYOPSIDA. Howard Crum 
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Spleen 


An organ of the circulatory system present in most 
vertebrates, lying in the abdominal cavity usually in 
close proximity to the left border of the stomach. 
It exhibits wide variation in size, shape, color, and 
location, depending upon species and age. 

Anatomy. In humans the spleen normally mea- 
sures about 1 x 3 x 5 in. (2 x 7 x 12 cm) and 
weighs less than !/ Ib (230 g). It is a firm organ with 
an oval shape and is indented on its inner surface to 
form the hilum, or stalk of attachment to the peri- 
toneum. This mesentery fold also carries the splenic 
artery and vein to the organ. The spleen is composed 
of a complex pulp lying between fibrous partitions, 
or trabeculae. These partitions project inwardly 
from the dense fibrous capsule that surrounds the 
organ. 

Blood circulation is unusual in that the splenic 
arterioles open into thin-walled dilations, called si- 
nusoids, which in turn drain into small veins. This 
arrangement of distensible blood-filled vessels, clus- 
ters of cells, and developing follicles accounts for the 
pulpy character of the spleen. 

The pulp consists of a white pulp and a red pulp. 
The white pulp Gdymphatic tissue) forms irregular 
nodules or follicles, often called Malpighian corpus- 
cles and consisting of aggregations of developing 
lymphocytes. These nodules form initially around 
the small branches of the splenic arteries. In addi- 
tion to the lymphocytes, other white cells such as 
monocytes, histiocytes, and giant cells are found in 
quantity. The red pulp consists of the sinusoids filled 
with blood and of splenic or Billroth cords filling 
the spaces between the sinusoids. Lymphocytes are 
found in the splenic cords that merge gradually into 
the white pulp. The distribution and amount of white 
and red pulp vary greatly, depending upon certain 
infections, intoxications, and disturbances in blood 
cell formation (anemia and leukemia) in the body. 
See ANEMIA; LEUKEMIA. 

Function. The spleen is an important part of the 
blood-forming, or hematopoietic, system; it is also 
one of the largest lymphoid organs in the body and 
as such is involved in the defenses against disease at- 
tributed to the reticuloendothelial system. This sys- 
tem consists of many different types of cells in the 
body having the power to neutralize or engulf for- 
eign particles or bacteria and sometimes to act as 
filters through which blood or lymph must pass. See 
HEMATOPOIESIS. 

Although the chief functions of the spleen appear 
to be the production of lymphocytes, the probable 


formation of antibodies, and the destruction of worn- 
out red blood cells, other less well-understood activ- 
ities are known. For example, in some animals it may 
act as a reservoir for red blood cells, contracting from 
time to time to return these cells to the bloodstream 
as they are needed. In the fetus and sometimes in 
later life, the spleen may be a primary center for the 
formation of red blood cells. See ANTIBODY. 

Another function of the spleen is its role in biligen- 
esis. Because the spleen destroys erythrocytes, it 
is one of the sites where extrahepatic bilirubin is 
formed. Bilirubin is the principal pigment of bile 
formed by the reduction of biliverdin. Bile is nor- 
mally present in the feces. See BILIRUBIN. 

Lower vertebrates. In the lower vertebrates splenic 
tissue appears as scattered lymphoid masses in the 
wall of the digestive tract and apparently acts as the 
primary blood-forming organ. Bone marrow appears 
in certain amphibians and in most higher forms, tak- 
ing over the formation of red cells in adults under 
normal conditions. The spleen then produces cells 
which are principally of the lymphoid type and en- 
gages in a number of other activities, some of which 
have been mentioned above. 

This organ is not necessary to life for any verte- 
brate because its activities may be shared or assumed 
by other organs under proper conditions. Many of 
the splenic functions, such as lymphocyte formation, 
are common to all lymphoid tissue. Splenectomy may 
result in a temporary depression of antibody forma- 
tion or by a recrudescence of latent or low-grade 
infection. See SPLEEN DISORDERS. Walter Bock 
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Spleen disorders 


The spleen is rarely the site of primary disorders 
except those of vascular origin, but it is frequently 
involved in systemic inflammations, metabolic dis- 
eases, and generalized blood disorders. 

Among vascular disturbances, acute and chronic 
congestion are prominent, particularly chronic con- 
gestion caused by cardiac failure, cirrhosis of the 
liver, and obstruction of the blood flow from the 
spleen by thrombi, scarring, or tumor tissue. Ob- 
struction of the splenic artery or its branches by 
thrombi may result in an infarct caused by either 
cardiac or blood disease. 

Inflammations include acute and chronic forms. 
The characteristic engorgement of blood often 
causes a marked enlargement of the organ. Bac- 
teremias frequently produce this enlargement, or 
splenomegaly, and inflammation, but any severe in- 
fectious disease such as diphtheria or pneumonia 
may do so. Tuberculosis, syphilis, typhoid fever, and 
malaria, as well as many other specific infections, 
may cause splenomegaly, often with characteristic 
gross or microscopic changes for each disease. See 
MALARIA; SYPHILIS; TUBERCULOSIS. 

The spleen is involved in certain types of pigment 
metabolism and is frequently the site of specific kinds 


of degeneration when amyloidosis, hemosiderosis, 
or argyrosis is present. 

Similarly, in disorders of lipid metabolism, abnor- 
mal kinds or amounts of fat may appear in connection 
with Gaucher’s disease, the hereditary Neimann-Pick 
disease, and others. See LIPID METABOLISM. 

Because the spleen normally aids in the destruc- 
tion of worn-out red blood cells and, under certain 
conditions, in the formation of red cells, it is not un- 
common to find splenic involvement in blood disor- 
ders such as sickle-cell anemia, congenital hemolytic 
jaundice, polycythemia, and Mediterranean anemia. 
See HEMATOLOGIC DISORDERS. 

The leukemias, especially when of the lympho- 
cytic or neutrophilic varieties, cause some of the 
most prominent cases of splenomegaly as well as 
other changes. In myeloid leukemia, for instance, 
spleen weight of 13-18 Ib (6000-8000 g) is not 
rare and the spleen may fill the entire abdomen. See 
LEUKEMIA. 

Hypersplenism is an obscure disorder in which 
one or more of the blood cell types are destroyed 
to excess. Primary hypersplenism results from un- 
known causes; secondary hypersplenism may follow 
inflammatory diseases, chronic congestion, or tumor 
invasion. 

Tumors originating in the spleen are rare and usu- 
ally limited to such benign growths as hemangiomas, 
lymphangiomas, and fibromas, but malignant lym- 
phomas and lymphosarcomas also occur. 

Secondary tumors, which originate elsewhere and 
metastasize to the spleen, are not uncommon, partic- 
ularly the lymphoma group. Other varieties are seen 
less often and include carcinomas, particularly from 
a bloodstream invasion or growth in the neighboring 
stomach or intestine. See TUMOR. 

Trauma to the spleen is more common than sus- 
pected, and the consequences of rupture of the 
organ are often tragic because this may follow an 
apparently moderate abdominal injury. 

Many other conditions produce splenic change, 
notably splenomegaly of some degree. The presence 
of an enlarged spleen indicates that a thorough diag- 
nostic evaluation should be made. See SPLEEN. 

Edward G. Stuart; N. Karle Mottet 
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Splines 
A series of projections and slots used instead of a key 
to prevent relative rotation of cylindrically fitted ma- 
chine parts. Splines are several projections machined 
on the shaft; the shaft fits into a mating bore called a 
spline fitting. Splines are made in two forms, square 
and involute (see illus.). Since there are several pro- 
jections (integral keys) to share the force in transmit- 
ting power, the splines can be shallow, thereby not 
weakening the shaft as much as a standard key. 
Square splines have 4, 6, 10, or 16 splines. The 
external part (shaft) may have the splines formed 
by milling and the internal part (bore) by broach- 
ing. Three classes of fits are used: sliding (as for gear 
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Spodumene 


(a) (b) 


Diagrams of (a) square spline and (6) involute spline profile. 





shifting) under load, sliding when not loaded, and 
permanent fit. Square splines have been used exten- 
sively for machine parts. In the automotive industry, 
square splines have been replaced generally by invo- 
lute splines, which cost less to make accurately for 
good fit and interchangeability. 

Involute splines are used to prevent relative ro- 
tation of cylindrically fitted machine parts and have 
the same functional characteristics as square splines. 
The involute spline, however, is like an involute gear, 
and the spline fitting (internal part) is like a mating 
internal gear. Profiles are the same as for gear teeth 
of the stub (fractional pitch) form with 30° pressure 
angle. 

Involute splines on the shaft are generated by a 
hob or a gear shaper, and internal splines are formed 
by a broach or a gear shaper. Three classes of fit are 
standard: sliding, close, and press. 

Involute serrations are similar to involute splines 
except that the pressure angle is 45° and, while there 
are three standard fits loose, close, and press), serra- 
tions are usually press-fitted and used for permanent 
assembly. They are used for both parallel and tapered 
shafts. See MACHINE KEY. Paul H. Black 
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Ham et al., Mechanics of Machinery, 4th ed., 1958; 
C. R. Mischke and J. E. Shigley, Mechanical En- 
gineering Design, 6th ed., 2000; G. H. Ryder and 
M. D. Bennnett, Mechanics of Machines, 2d ed., 
1990. 
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Spodumene 


The name given to the monoclinic lithium pyroxene 
LiAl(SiO3)2. Spodumene commonly occurs as white 
to yellowish prismatic crystals, often with a “woody” 
appearance, exhibiting the 87° pyroxene (110) cleav- 
ages. It is easily identified during heating in a flame 
by the red color given off, accompanied by a marked 
swelling of the fragment. Spodumene usually con- 
tains an appreciable quantity of hydrogen substitut- 
ing for lithium. At 720°C (1330°F), spodumene in- 
verts to a tetragonal form, 6-spodumene, which is ac- 
companied by a 30% increase in volume. Spodumene 
is capable of forming immense crystals in nature. 
A single crystal 47 ft (14.3 m) in length and 5 ft 
(1.5 m) in diameter, as well as others almost as 
large, has been found at the Etta mine in South 


Dakota. This implies the remarkable ability of a sin- 
gle crystal to replace a large variety of preexisit- 
ing minerals and yet maintain the integrity of a sin- 
gle crystal, a crystal growth unequaled elsewhere in 
nature. 

Spodumene is usually found as a constituent in cer- 
tain granitic pegmatites in association with quartz, 
alkali feldspars, mica, beryl, phosphates, and a large 
variety of rare minerals. It is also known to occur 
as disseminated grains in some granite gneisses. Spo- 
dumene often alters to a fibrous mass composed of 
eucryptite LiAISiO, and albite, or eucryptite and mus- 
covite. The emerald-green variety, hiddenite, and a 
lilac variety, kunzite, are used as precious stones. 
Spodumene from pegmatites is used as an ore for 
lithium. See LITHIUM; PYROXENE. George W. DeVore 


es 
Spongiomorphida 

An extinct group of genera of Mesozoic fossils. Infor- 
mally known as the spongiomorphs, they were estab- 
lished as a family of scleractinian corals by Fritz Frech 
in 1890 on the basis of specimens from the Zlam- 
bach beds of late Triassic age in Austria. He grouped 
the fossils into four genera (Spongiomorpha, Hep- 
tastylopsis, Heptastylis, and Stromatomorpha). The 
fossils consist of closely spaced longitudinal rods of 
calcite or aragonite that are perpendicular to the 
growth surface and that give off lateral expansions 
which join the rods and in some forms unite to form 
laminae parallel to the growth surface (see illustra- 
tion). Frech noted the similarity of his group of fos- 
sils to the stromatoporoids, particularly to the genus 
Actinostroma, and named one of the species Hep- 
tastylis stromatoporoides. In tangential section, the 
rods of the spongiomorphs have a central dark dot, 
if round, or a line, if oval, and concentric microstruc- 
ture. Their fibrous microstructure suggests that they 





Spongiomorpha acyclica (Triassic, Austria), length 8 cm. 
(From J. P. Smith, Upper Triassic Marine Invertebrate 
Faunas of North America: Spongiomorphs, U.S. Geol. Surv. 
Prof. Pap. 1927) 


were originally aragonite, and their water-jet appear- 
ance in longitudinal section is similar to that of the 
septa of scleractinian corals. Some spongiomorphs 
from the middle Triassic Cassian beds still preserve 
the aragonite mineralogy of the living corals. 

In some spongiomorphs, the rods in tangential 
sections seem to be grouped in sixfold symmetry 
about a central rod with their axial dark lines radi- 
ating away from the central rod. This radial pattern 
was thought by some investigators to be homologous 
to the astrorhizal systems of the Paleozoic stromato- 
poroids. To those who believed the stromatoporoids 
were hydrozoans, this radial arrangement was taken 
as evidence that the spongiomorphs were hydro- 
zoans. However, the radial pattern is not produced 
by convergent canals, as in the stromatoporoids, but 
by the alignment of rods. Further, the hexameral ra- 
dial symmetry is typical of the Mesozoic corals, not 
of hydrozoans. Frech recognized the similarity of his 
specimens to the scleractinian coral genus Astraeo- 
morpha of the family Poritidae. In this family of liv- 
ing corals, the septa tend to break down into a se- 
ries of rods, and the boundaries of the corallites dis- 
appear to produce a structure very similar to the 
spongiomorphs. The affinity of Frech’s genera to the 
poritid corals has been confirmed by the work of P. 
Gautret and J.-P. Cuif on the microstructure of these 
fossils. 

The term spongiomorph should thus be used only 
in an informal sense for the group of genera in which 
the septa are reduced to closely spaced rods and the 
radial structure and individuality of the corallites is 
obscure. 

Spongiomorph corals are known from the Mid- 
dle Triassic of the Alps; the Upper Triassic of Aus- 
tria, Bulgaria, Romania, the United States (California, 
Alaska, Oregon), Turkey, Morocco, Iran, Japan, south 
China, central Asia, and Tajikistan; the Upper Juras- 
sic of China (Tibet), Austria, Spain, and Japan; and 
the Upper Cretaceous of Sweden and the West In- 
dies. See SCLERACTINIA; SCLEROSPONGE; SPHAERACTI- 
NOIDEA; STROMATOPOROIDEA. C. W. Stearn 
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minérologiques de Spongiomorpha acyclica Frech, 
1890, Geobios, 25(3):345-355, 1992; J. P Smith, 
Upper Triassic marine invertebrate faunas of North 
America: Spongiomorphs, US. Geol. Surv. Prof. 
Pap., 141:132-135, 1927; G. D. Stanley and M. T. 
Whalen, Triassic corals and spongiomorphs from 
Hell’s Canyon, Wallowa Terrane, Oregon, J. Paleon- 
tol., 63:800-819, 1989. 


[= ———————aamn| 
Sporozoa 

A subphylum of Protozoa, typically with spores. The 
spores are simple and have no polar filaments. There 
is a single type of nucleus. There are no cilia or 


flagella except for flagellated microgametes in some 
groups. In most sporozoa there is an alternation of 
sexual and asexual stages in the life cycle. In the sex- 
ual stage, fertilization is by syngamy, that is, the union 
of male and female gametes. All sporozoa are para- 
sitic. See PROTOZOA. 

Taxonomy. The subphylum is divided into 
three classes—Telosporea, Toxoplasmea, and 
Haplosporea. 

Telosporea. The class Telosporea is by far the largest 
and most important. It contains two subclasses: 
members of the subclass Gregarinia, which is 
composed of three orders (Archigregarinida, Eu- 
gregarinida, and Neogregarinida), are parasites of 
insects and other invertebrates; members of the sub- 
class Coccidia, which is composed of two orders 
(Protococcida and Eucoccida), are parasites mostly 
of vertebrates, although a few are found in inverte- 
brates. See INSECT DISEASES; TELOSPOREA. 

The order Eucoccida is the largest in the sub- 
class Coccidia. Two of its suborders (Eimeriina and 
Haemosporina) are of great importance. The subor- 
der Eimeriina contains several hundred species of 
coccidia, which cause disease in various domestic 
and wild animals. Most of them multiply in the cells 
of the intestine and cause diarrhea, dysentery, and 
even death. They are monoxenous, multiplying asex- 
ually by schizogony in their host cells and eventually 
producing gametes which unite to form oocysts that 
pass out in the feces, mature on the ground, and in- 
fect other animals when eaten. A typical species is 
Eimeria tenella, which occurs in the ceca of chick- 
ens and may cause their death. 

The suborder Haemosporina includes the malaria 
parasites and related forms. Their life cycle is basi- 
cally similar to that of the Eimeriina, but they gener- 
ally multiply asexually in vertebrate blood cells and 
produce sexual stages in invertebrate hosts. A typical 
species is Plasmodium vivax, which causes benign 
tertian malaria in humans (called benign because it 
seldom kills its victims, in contrast to P. falciparum, 
which causes malignant tertian malaria and has a rel- 
atively high death rate). See MALARIA. 

Spores are present in the Telosporea, and repro- 
duction is typically both sexual and asexual, with 
alternation of generations. Some species occur only 
in a single type of host Gnonoxenous), while oth- 
ers require two types of host (heteroxenous), with 
sexual development in one and asexual in the other. 
Some groups have flagellated microgametes; if these 
are missing, locomotion is by body flexion or glid- 
ing; pseudopods are ordinarily absent, but if pres- 
ent are used for feeding, not locomotion. The name 
Telosporea comes from Greek words which indicate 
that spore formation occurs at the end of the life 
cycle. 

Toxoplasmea. The class Toxoplasmea contains 
“sporozoa” which lack spores and sex. There are no 
pseudopods or flagella at any stage, and locomotion 
is by body flexion or gliding. Cysts or pseudocysts 
(with a wall formed by the host cell) are formed 
and contain many trophozoites. Species in this class 
have a single type of host. 
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Fig. 1. Fine structure of a coccidian merozoite (Eimeria miyairii from the rat). (a) Longitudinal section. Cross sections at 
levels of (b) nucleus, (c) Golgi complex, (d) large dense granules, (e) cytostome, (f) paired organelle, and (g) conoid. (After J. 


Andreassen and O. Behnke, J. Parasitol., 54:150-163, 1968) 


There seem to be only three genera in this class, 
all of which parasitize vertebrates. Toxoplasma oc- 
curs mostly in mammals. By far the most impor- 
tant species is T. gondii, which is common through- 
out the world and causes toxoplasmosis in humans 
and many lower mammals. Most infected individu- 
als show no symptoms, but the disease has various 
manifestations ranging up to acute, fatal encephali- 
tis in newborn infants. Its normal mode of transmis- 
sion is still unknown. Sarcocystis is found in many 
mammals and birds, especially sheep, cattle, swine, 
and ducks. The cysts, which may be visible to the 
naked eye, are found in the muscles. Hunters some- 
times see them in wild ducks, where they make the 
breast muscles appear striated. They are harmless 
when cooked and eaten, but they are esthetically un- 
desirable, and heavily infected ducks are rarely eaten. 
The third genus, Besnoitia, is relatively uncommon, 
although one species may cause a skin disease of cat- 
tle and another may pit the bones of reindeer. See 
TOXOPLASMEA. 

Haplosporea. It is not certain whether the last class, 
Haplosporea, really belongs in this subphylum. Its 
members have spores but apparently do not repro- 
duce sexually. They have no flagella, but pseudopods 
may be present. Some people think that this class be- 
longs with the Cnidospora, even though its members 
lack a polar filament, while others think it may be re- 
lated to the fungi. At any rate, there are only a few 
species in the group, and they are mostly parasites 
of mollusks, annelids, and other invertebrates. See 
CNIDOSPORA; HAPLOSPOREA. 

Structure. The structure of the Sporozoa varies 
with the species and stage of development, and 


the variations are too diverse to be given here. At 
one stage or another all (except perhaps the Hap- 
losporida) have a characteristic conoid and polar 
ring at the anterior end. Subpellicular fibrils or mi- 
crotubules extend almost the whole length of the 
body just beneath the pellicle and are presumably 
responsible for locomotion. There is a Golgi appa- 
ratus which presumably has to do with secretion or 
excretion, one or more mitochondria which have to 
do with metabolism, a number of micronemes, rod- 
shaped granules, convoluted tubules, sarconemes or 
lankesterellonemes extending backward in the ante- 
rior part of the organism, a paired organelle or se- 
ries of toxonemes originating inside the conoid and 
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Fig. 2. Mature (sporulated) oocyst of a coccidium (Eimeria). 
(After N.D. Levine, Protozoan Parasites of Domestic 
Animals and Man, Burgess, 1961) 


running backward in the cytoplasm, one or more mi- 
cropores (cytostomes, micropyles) in the side of the 
body, several types of granule in the cytoplasm, and 
endoplasmic reticulum (Fig. 1). The electron micro- 
scope is needed to see most of these structures. 

The Telosporea have an oocyst which is produced 
by fusion of male and female zygotes. Its structures 
can be seen with the light microscope. The char- 
acters of the oocyst vary with the species. In the 
Eimeriina there are generally one or more sporocysts 
within the oocyst, and one or more sporozoites in 
each sporocyst. The oocyst may have a micropyle, 
which may be capped. There may be a polar granule 
within the oocyst and its wall may be composed of 
one or more layers. There may be residual material 
in the oocyst and sporocyst, left over after formation 
of the sporocysts or sporozoites (Fig. 2). 

Life cycle. The life cycle of a typical telosporean 
sporozoon involves alternation of sexual and asexual 
generations. The sexual forms may occur in one type 
of host and the asexual forms in another, or there may 
be only a single type of host. The latter situation is 
characteristic of the eimeriin Eucoccida. A typical 
example is Eimeria tenella, which lives in the ep- 
ithelial cells of the chicken cecum (Fig. 3). When a 
chicken eats a mature (sporulated) oocyst, the oocyst 
breaks in the gizzard, releasing the sporozoites. A 
sporozoite (stage 1 in Fig. 3) enters an intestinal 
epithelial cell (2) and rounds up and turns into a 
first-generation schizont (3), which forms a number 
of merozoites (4) by multiple fission. These break 
out of the host cell (5), enter new host intestinal 
epithelial cells (6), and turn into second-generation 
schizonts (7, 8), which produce second-generation 
merozoites (9, 10), which break out of the host cell 
(11), and may form either third-generation schizonts 
(12, 13), which produce third-generation merozoites 
(14, 15), or gamonts. All third-generation schizonts 
produce gamonts. The gamonts are either male or fe- 
male. The male ones (16, 17) produce a large number 
of flagellated microgametes (18), while the female 
ones simply grow, becoming macrogametes (19, 20). 
The microgametes fertilize the macrogametes, and 
the resultant zygote (21) rounds up, secretes a wall 
around itself to become an immature oocyst, and 
leaves the host cell (22). It passes out of the chicken 
and becomes mature (sporulates) on the ground. The 
sporont undergoes meiosis and becomes haploid, 
throws off a polar body, and forms four sporoblasts 
(23), each of which forms a sporocyst containing two 
sporozoites. When this sporulated oocyst is eaten by 
the chicken, the sporozoites are released in the gut 
and the cycle begins again. 

Eucoccida. In the suborder Eimeriina, since each 
species has a definite number of asexual genera- 
tions, the infection is self-limiting. This is not true 
in the suborder Haemosporina, which contains the 
malaria parasites. In this group the number of asex- 
ual generations is indefinite, a moderate number of 
microgametes is formed, asexual stages occur in a 
vertebrate host, and sexual stages and sporozoite for- 
mation occur in an invertebrate, usually an insect. 
In human malaria, for instance, the vertebrate host 
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Fig. 3. Life cycle of the chicken coccidium Eimeria tenella. The numbered steps are 


explained in the text. (After N. D. Levine, Protozoan Parasites of Domestic Animals and 


Man, Burgess, 1961) 


is a human and the invertebrate host an Anopheles 
mosquito. The vertebrate host is infected when the 
invertebrate host bites it. In the third suborder of this 
group, the Adeleina, few microgametes are formed 
and they are produced when the male and female ga- 
monts are attached to each other, that is, when they 
are in syzygy. 

Gregarinia. In the second large telosporean subclass, 
the Gregarinia, the mature trophozoites (vegetative 
stages) are large and extracellular. They occur in 
the digestive tract or body cavity of invertebrates. 
Typically (Gin the order Eugregarinida), no schizonts 
are formed, the trophozoites producing gamonts and 
then gametes, which fuse to form a zygote which pro- 
duces an oocyst and then a number of sporozoites. 

Toxoplasmea. In the class Toxoplasmea there is no 
sexual reproduction. The trophoxoites divide by ex- 
ternal or internal binary or multiple fission, and cysts 
or pseudocysts with many naked trophozoites are 
formed. Norman D. Levine 
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Sports medicine 


A branch of medicine concerned with the effects 
of exercise and sports on the human body, including 
treatment of injuries. Sports medicine can be divided 
into three general areas: clinical sports medicine, 
sports surgery, and the physiology of exercise. Clin- 
ical sports medicine includes the prevention and 
treatment of athletic injuries and the design of ex- 
ercise and nutrition programs for maintaining peak 
physical performance. Sports surgery is also con- 
cerned with the treatment of injuries from contact 
(human or object) sports. Exercise physiology, a 
growing field of sports medicine, involves the study 
of the body’s response to physical stress. It comprises 
the science of fitness, the preservation of fitness, and 
the role of fitness in the prevention and treatment of 
disease. 

The various aspects of sports medicine touch on 
many different branches of medicine and surgery, 
including cardiology, neurology, orthopedics, pul- 
monary medicine, and endocrinology. Nevertheless, 
sports medicine is becoming a distinct discipline. 

Advances in instrumentation also led to increased 
understanding of cardiac function and its response to 
exercise. The pressure drum and sphygmograph me- 
chanically transmit and record arterial pulses. The 
modern electrocardiogram and other techniques 
now give information on the mechanical and elec- 
trical properties of heart action at rest and during 
exercise. 

Investigations of the effects of exercise on systems 
other than the heart and lungs were initiated in the 
1930s. The notion of exercise as a route to good 
health became widespread during the latter part of 
the nineteenth century. 

Diagnosis and therapy. The health problems arising 
from sports include injuries resulting from overuse, 
such as ill-defined aches and pains, tendinitis, and 
chondromalacia (softening of the cartilage), one 
cause of “runner’s knee.” Such injuries are becom- 
ing increasingly common as more people engage in 
amateur athletics; they also afflict some musicians 
who develop overuse of muscle-tendon units result- 
ing in pain of the hand and fingers, commonly seen 
in string players. Sports medicine also treats acute 
ailments such as dehydration and heat stroke, injury 
to the musculoskeletal system, the eyes, the brain 
and spinal cord, and (rarely) cardiac arrest. The usual 
therapy for overuse injuries is rest. Injuries caused 
by contact can require urgent surgical care. See PER- 
FORMING ARTS MEDICINE. 

Strategies to prevent injuries and maintain peak 
performance have become an important part of 
sports medicine. Careful attention to diet and fluid 
intake is routine for professional and amateur ath- 
letes during both training and sports events. (The use 
of performance-enhancing drugs, such as steroids 
and stimulants, by some athletes is an unfortunate 
result of the competition for improved athletic per- 
formance.) Studies of biomechanics have led to 
preventive measures such as improved helmets, 


shoes, and supportive clothing. See BIOMECHANICS; 
STEROID. 

Exercise physiology. This subdivision of sports 
medicine is involved with the responses of healthy 
individuals to physical stress and aging in all parts 
of the body. Hormonal, cardiovascular, renal, neuro- 
logic Gncluding mental function), musculoskeletal, 
and other bodily systems are studied. Modern tech- 
nologies are applied, because many are noninvasive 
and allow repeated measurements. 

Exercise and wellness. The role of sports in the pre- 
vention of disease and the improvement of health is 
a major concern of sports medicine. It is widely be- 
lieved that physical fitness can help ward off many 
of the illnesses common in industrial society, mainly 
obesity and cardiovascular disease but also osteo- 
porosis (brittleness of bones) and such chronic com- 
plaints as back pain and headaches. In addition, it 
is believed that a regimen in which regular exercise 
is accompanied by beneficial habits, such as a diet 
that is high in fiber and low in fats, moderate con- 
sumption of alcohol, and an avoidance of smoking, 
can lower the incidence of some illnesses, including 
cancers. 

Conclusive evidence of exercise’s capacity to pre- 
vent disease is lacking, but the circumstantial evi- 
dence is strong. Weight loss, lowered cholesterol, 
and improved neurologic, gastrointestinal, and gen- 
eral body function all seem to contribute to this ef- 
fect. Otto Appenzeller 

Bibliography. O. Appenzeller (ed), Sports 
Medicine: Training, Fitness, Injuries, 3d ed., 
1988; E. Jokl, Running, psychology and culture, 
Ann. N. Y. Acad. Sci., 301:970-1001, 1977. 
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Spot welding 

A resistance-welding process in which coalescence 
is produced by the flow of electric current through 
the resistance of metals held together under pres- 
sure. A low-voltage, high-current energy source is re- 
quired (Fig. 1). Usually the upper electrode moves 
and applies the clamping force. Pressure must be 
maintained at all times during the heating cycle to 
prevent flashing at the electrode faces. Electrodes are 
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Fig. 1. Spot-welding circuit. When electrodes are closed on 
the workpiece, the circuit is completed for welding. 
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Fig. 2. Distribution of temperature in local (numbered) 
elements of a spot-welding operation. 


water-cooled and are made of copper alloys because 
pure copper is soft and deforms under pressure. See 
RESISTANCE WELDING. 

The electric current flows through at least seven 
resistances connected in series for any one weld. 
They are (1) upper electrode, (2) contact between 
upper electrode and upper sheet, @) body of 
upper sheet, (4) contact between interfaces of 
sheets,(5) body of lower sheet, (6) contact between 
lower sheet and electrode, and (7) lower electrode 
(Fig. 2). Heat generated in each of the seven sections 
will be in proportion to the resistance of each. The 
greatest resistance is at the interfaces in 4, and heat 
is most rapidly developed there. The liquid-cooled 
electrodes 1 and 7 rapidly dissipate the heat gener- 
ated at the contact between electrodes and sheets 2 
and 6, and thus contain the metal that is heated to 
fusion temperature at the interfaces. 

After the metals have been fused together, the elec- 
trodes usually remain in place sufficiently long to 
cool the weld. An exception is in welding quench- 
sensitive metals, where it is desirable to remove the 
electrodes as soon as possible to allow the heat to 
be conducted in the surrounding metal, preventing 
steep quench gradients. See WELDING AND CUTTING 
OF MATERIALS. Eugene J. Limpel 

Bibliography. L. Jeffus, Welding: Principles and Ap- 
plications, 5th ed., 2002; H. Zhang and J. Senkara, 
Resistance Welding, 2005. 
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Spray flow 

A special case of a two-phase (gas and liquid) flow 
in which the liquid phase is the dispersed phase and 
exists in the form of many droplets. The gas phase is 
the continuous phase, so abstract continuous lines 
(or surfaces) can be constructed through the gas at 
any instant without intersection of the droplets. The 
droplets and the gas have velocities that can be dif- 
ferent, so both phases can move through some fixed 
volume or chamber and the droplets can move rela- 
tive to the surrounding gas. See GAS; LIQUID; PARTIC- 
ULATES; TWO-PHASE FLOW. 


Applications. Spray flows have many applications. 
Sprays are used to introduce liquid fuel into the 
combustion chambers of diesel engines, turbojet 
engines, liquid-propellant rocket engines, and oil- 
burning furnaces. They are used in agricultural and 
household applications of insecticides and pesti- 
cides, for materials and chemicals processing, for fire 
extinguishing, for cooling in heat exchangers, for ap- 
plication of medicines, and for application of coat- 
ings (including paint and various other types of lay- 
ered coatings). Common liquids (such as water, fuels, 
and paints) are used in sprays. It is sometimes useful 
to spray uncommon liquids such as molten metals. 
In the various applications, the approximately spher- 
ical droplets typically have submillimeter diameters 
that can be as small as a few micrometers. See METAL 
COATINGS. 

Formation. Sprays are formed for industrial, com- 
mercial, agricultural, and power generation pur- 
poses by injection of a liquid stream into a gaseous 
environment. In addition, sprays can form naturally 
in a falling or splashing liquid. Injected streams of 
liquid tend to become unstable when the dynamic 
pressure (one-half of the gas density times the square 
of the liquid velocity) is much larger than the coef- 
ficient of surface tension divided by the transverse 
dimension. Typically, the liquid stream disintegrates 
into ligaments (coarse droplets) and then into many 
smaller spherical droplets. The breakup (or atomiza- 
tion) process is faster at higher stream velocity, and 
the final droplet sizes are smaller for higher stream 
velocities. Spray droplet sizes vary and typically are 
represented statistically by a distribution function. 
The number of droplets in a spray can be as high 
as a few million in a volume smaller than a liter. See 
ATOMIZATION; JET FLOW; SURFACE TENSION. 

Drag force on droplets. The droplets experience a 
drag force as they move relative to the gas. The mag- 
nitude and direction of the drag depend on the mag- 
nitude and direction of the relative velocity between 
the gas and the droplet. The drag force is propor- 
tional to the square of the droplet diameter, while 
the droplet mass is proportional to the cube of the 
diameter; therefore, the acceleration, which equals 
the drag per unit mass, is inversely proportional to 
the diameter. Drag causes the relative velocity to de- 
crease faster for smaller droplets. Droplets of only a 
few micrometers in size move at the gas velocity. See 
FLUID-FLOW PRINCIPLES. 

Flow in and around droplets. A droplet moving rel- 
ative to the surrounding gas behaves similarly to an 
aerodynamic body. An observer fixed to the droplet 
would see that the gas flow first decelerates as it 
moves toward the droplet and then accelerates as it 
passes the droplet. The illustration shows a cross 
section of the flow around and within a droplet. The 
frictional force at the gas-liquid interface retards the 
flow in the gas layer at the droplet surface. The ap- 
proximate thickness of this boundary layer is pro- 
portional to the droplet radius divided by the square 
root of the Reynolds number. The nondimensional 
Reynolds number is the triple product of the gas 
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Cross section of a vaporizing droplet, showing the relative gas-droplet motion and 


internal circulation. 


density, relative velocity, and droplet radius divided 
by the coefficient of gas viscosity. For droplets, the 
range of the Reynolds number is 0-200. The flow 
stream in the boundary layer can separate behind 
the droplet surface, causing a wake with recircu- 
lating flow in the gas. The wake formation causes 
a pressure drop across the droplet. This pressure 
difference and the friction force result in the drag 
force. The friction force exerted on the liquid sur- 
face of the droplet causes the internal liquid to cir- 
culate. The streamlines in the illustration show the 
spherical vortex pattern that emerges, with recir- 
culation through a region called the internal wake. 
The convective current affects the transport of heat 
to the droplet interior from its surface region. See 
BOUNDARY LAYER FLOW; REYNOLDS NUMBER; WAKE 
FLOW. 

Droplettrajectories. The trajectories of the droplets 
in a spray depend on the aerodynamic forces (lift 
and drag) and the force of gravity that they experi- 
ence. The aerodynamic forces on a droplet are mod- 
ified by the proximity of neighboring droplets. The 
drag force is reduced for a droplet moving in tandem 
behind another droplet. The wake behind the first 
droplet is moving in the same direction as the second 
droplet, thereby reducing the relative velocity and 
the drag force. (This same phenomenon occurs for 
trucks moving in a convoy.) Droplets moving side by 
side can experience lift forces, causing them to either 
attract or repel each other. The lift force is caused by 
the modification of the pressure due to the Bernoulli 
effect as the gas flows between the two droplets. 
As the two droplets come closer, the gas velocity 
first increases, reducing the inside pressure below 
the outside pressure, thereby pushing the droplets 
toward each other. As the droplets come still closer, 
blockage of the inside flow occurs, increasing the in- 
side pressure above the outside pressure and push- 
ing the droplets apart. The aerodynamic forces can 
cause droplets to collide but, generally, the droplets 
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in a spray are diverging. The conduction and con- 
vection of heat and the diffusion of vapors are also 
affected by neighboring droplets. See AERODYNAMIC 
FORCE. 

Heat transfer and vaporization. In many applica- 
tions (such as combustors and heat exchangers), a 
high-temperature environment exists in the gas sur- 
rounding each droplet of a spray. Substantial heat 
transfer to the liquid surface results; some of the heat 
is required to vaporize the liquid at the surface, and 
the remainder of the heat is transferred to the droplet 
interior, thereby raising its temperature with time. 
The droplet lifetime is defined as the time required to 
vaporize the total droplet after its introduction to the 
hot environment. The droplet temperature is lower 
than the ambient gas temperature, with continuous 
increase in the gas temperature through a thermal 
gas layer surrounding the droplet. An increase in the 
relative velocity, or equivalently in the Reynolds num- 
ber, causes the thickness of the thermal layer to de- 
crease, thereby increasing the heating rate and the 
vaporization rate and decreasing the droplet lifetime. 
See HEAT TRANSFER. 

The temperature of a rapidly vaporizing droplet 
(for example, a fuel droplet in a combustor) will 
vary throughout its lifetime in a transient manner. 
The internal liquid circulation provides convective 
augmentation to the conduction of heat from the 
droplet surface toward the interior. A temperature 
variation through the liquid interior can remain dur- 
ing the lifetime of the droplet. For a droplet vaporiz- 
ing slowly in a low-temperature gas environment, the 
droplet temperature can be approximately uniform 
through the interior at the theoretical maximum 
temperature, which is called the wet-bulb temper- 
ature. A submillimeter-size water or fuel droplet in a 
gas environment with a temperature above 1000 K 
(1340°F) can vaporize in milliseconds. Vaporization 
time varies with droplet diameter to a power be- 
tween 1.5 and 2.0. 

The vapor that comes from the droplet is typi- 
cally different chemically from the surrounding gas. 
It must convect and diffuse away from the droplet 
and mix with the surrounding gas. Sometimes the 
vapor also reacts with the surrounding (for example, 
fuel vapor in a heated air environment). See EVAPO- 
RATION. 

Supercritical behavior. At very high pressures and 
temperatures, the distinction between a vapor and its 
liquid disappears. That is, above the critical tempera- 
ture and critical pressure values, the addition of heat 
results in a continuous change in density rather than 
the discontinuous phase change (vaporization) that 
is experienced at lower pressures and temperatures. 
In many applications, a liquid stream is injected into a 
chamber at supercritical pressure. If the initial liquid 
temperature is subcritical but the pressure through 
the gas film and the ambient gas temperature are 
supercritical, heating of the liquid can result, with 
a continuous change of density as the temperature 
increases from subcritical to supercritical values. 
The gas mixture in the film surrounding the droplet 
can have substantially higher critical pressures than 


either the pure vapor or the ambient gas. Sometimes 
the gas film is subcritical even if the ambient gas is 
supercritical. Vaporization, with a discontinuous 
density change, can occur in this situation. As the liq- 
uid droplet approaches the critical temperature, sur- 
face tension forces are substantially reduced. There- 
fore, distortion of the droplet shape and secondary 
atomization become more likely. See CRITICAL 
PHENOMENA; SUPERCRITICAL FLUID. 
William A. Sirignano 
Bibliography. A. Frohn and N. Roth, Dynamics of 
Droplets, Springer-Verlag, New York, 2000; A. H. 
Lefebvre, Atomization and Sprays, 1989; W. A. 
Sirignano, Fluid Dynamics and Transport of 
Droplets and Sprays, 1999; Z. Zapryanov and S. 
Tabakova, Dynamics of Bubbles, Drops and Rigid 
Particles, vol. 50 of Fluid Mechanics and Its Appli- 
cations, 1999. 
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Spread spectrum communication 


A means of communicating by purposely spreading 
the spectrum (frequency extent or bandwidth) of 
the communication signal well beyond the required 
bandwidth of the data modulation signal. Spread 
spectrum signals are typically transmitted by elec- 
tromagnetic waves in free space, with usage in both 
nonmilitary and military systems. 

Motivation for using spread spectrum signals is 
based on the following: 

1 Spread spectrum systems have the ability to re- 
ject hostile as well as unintentional jamming by in- 
terfering signals so that information can be commu- 
nicated. 

2 Spread spectrum signals have a low probability 
of being intercepted or detected since the power in 
the transmitted wave is “spread” over a large band- 
width or frequency extent. 

3 Since these signals cannot be readily demodu- 
lated without knowing the code and its precise tim- 
ing, a level of message privacy is obtained. 

4 The wide bandwidth of the spread spectrum sig- 
nals provides tolerance to multipath (reflected waves 
that typically take longer to arrive at the receiver than 
the direct desired signal so that the two can be dis- 
tinguished). 

5 A high degree of precision in ranging (distance 
measuring) can be obtained by using one type of 
spread spectrum signaling called direct sequence, 
with applications to navigation. 

6 Multiple access, or the ability to send many in- 
dependent signals over the same frequency band, 
is possible in spread spectrum signaling. See BAND- 
WIDTH REQUIREMENTS (COMMUNICATIONS); COMMU- 
NICATIONS SCRAMBLING; ELECTRICAL INTERFERENCE; 
ELECTRONIC WARFARE; RADIO-WAVE PROPAGATION. 

There are four generic types of spread spectrum 
signals: direct sequence (DS) or pseudonoise (PN), 
frequency hopping (FH), linear frequency modula- 
tion (chirp), and time hopping (TH). The first two 
methods are much more commonly used today than 
the other two. 
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Chirp modulation. Chirp modulation is a spread 
spectrum method which was developed for radar 
use in the mid-1940s. The basic idea is to transmit 
a long rectangular pulse whose carrier frequency is 
linearly increased. The frequency-modulated signal 
returned from the target passes through a filter in 
the receiver at a velocity of propagation propor- 
tional to frequency. The result is a pulse that is 
much shorter in time duration than the transmitted 
pulse with a larger peak power. Unchirped pulses 
such as interference or jamming pulses do not “com- 
press” at the receiver, so that this method yields a 
processing gain or advantage for the chirped signal. 
See RADAR. 

Direct sequence systems. Direct sequence modu- 
lation is characterized by phase-modulating a sine 
wave by an unending string of pseudonoise code 
chips (symbols of much smaller time duration than 
a bit). This unending string is typically based on a 
pseudonoise code that generates an apparently ran- 
dom sequence of code chips that repeats only after 
the pseudonoise code period. If T, is the duration of 
a bit expressed in seconds, then typically the dura- 
tion of a pseudonoise code chip, T,, would be much 
smaller than 7,, so that many chips would occur dur- 
ing one bit (typically from hundreds to millions). An 
important parameter in direct sequence systems is 
the processing gain (PG), which is given by the ratio 
T,/T,. Larger values of processing gain give greater 
immunity to interference. 

Transmitter. In many systems both the data and 
the pseudonoise code phase-modulate the carrier 
by 0° or 180°, which is called binary phase- 
shift keying (BPSK). In a basic binary phase- 
shift keyed, pseudonoise-encoded spread spectrum 
system (Fig. 1a), digital data representing the 
information to be transmitted are binary phase-shift 
keyed onto the carrier. Then the pseudonoise code 
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Fig. 1. Equipment for a basic direct-sequence (pseudonoise-encoded), binary phase-shift 


keyed, spread spectrum system. (a) Transmitter. (b) Receiver. 
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Fig. 2. Example of how a narrow-band jammer is despread in the front end of a receiver. 
(a) System model. (b) Transmitted signal and jammer. (c) Despread signal and jammer. 


generator also binary phase-shift keys the carrier, 
and the composite signal is transmitted. See ELECTRI- 
CAL COMMUNICATIONS; MODULATION; PHASE MODU- 
LATION. 

Receiver. For the receiver to function, it must first 
align the receiver pseudonoise code generator tim- 
ing with the received pseudonoise code signal timing 
(Fig. 1b). This is accomplished in two distinct steps, 
code acquisition and code tracking. Code acquisi- 
tion is process of searching in time by correlating 
the locally generated code against the received signal 
plus noise. The search continues until the correlation 
process produces an output above a predetermined 
threshold. This occurs either when the local and re- 
ceived codes are in time alignment or there is a false 
alarm due to noise. In either case the code loop is 
engaged and given a fixed length of time to track 
the received code. When true tracking occurs, the 
received signal is said to be despread, leaving only 
the data to be demodulated. 

The code tracking circuit, which is called a delay 
lock loop, tracks the code (keeps the local receiver 
code aligned with the received code) once acquisi- 
tion has been achieved. When the received code is 
being tracked, it can be multiplied by the locally gen- 
erated pseudonoise code which then removes the 
code on the received signal. 

At this point a BPSK-data-modulated carrier signal 
remains to be demodulated. In order to do this, a car- 
rier tracking loop must be utilized, such as a Costas 
loop or a squaring loop. Their function is to track the 
carrier and thereby allow phase-coherent demodula- 
tion of the data, that is, multiplication by a signal ex- 
actly in phase with the received carrier, which leaves 
only the baseband data to be detected. 

A bit synchronizer then tracks (provides bit tim- 
ing for) the data bit sequence, and a data demodula- 
tor utilizes a matched filter to detect the data. The 
matched filter for binary phase-shift keyed data is a 
T,-second integration over each bit time. The output 
of the integrator produces either a positive or a neg- 


ative output, which is then interpreted as a binary 1 
or a 0. This bit sequence of 0’s and 1’s determines 
the message being sent. The data bits transmitted 
are either 1 or —1, which maps to either a binary 
O or a 1. If the signal-to-noise ratio is sufficiently 
large, very few errors will occur in the bit detection 
process. 

The above discussion has obviously been oversim- 
plified. For example, the reconstructed pseudonoise 
code is never perfectly aligned with the received 
code due to noise and filtering effects, and the locally 
generated carrier likewise is not exactly in phase 
with the received carrier for similar reasons. Degra- 
dations therefore occur to the system when com- 
pared to an idealized system. Other modulations be- 
sides BPSK are used. 

Protection against jamming. It is instructive to see how 
the effects of a narrow-band jammer (narrow-band 
compared to the spreading-code bandwidth) are di- 
minished at the receiver due to the despreading ac- 
tion in a direct sequence system. A jamming sig- 
nal, which interferes with the reception of the de- 
sired signal, may be intentional or unintentional. 
Figure 2 illustrates a simplified direct-sequence 
transmitter and receiver including a signal with a 
narrow-band jammer. For this example it is assumed 
that the data and direct spread spectrum signal are 
BPSK-modulated onto the carrier and transmitted 
via the transmitter antenna. A jammer transmits its 
narrow-band jammer in the direction of the receiver 
antenna. At the receiver the sum of the signal and the 
jammer are captured at the receiver antenna and de- 
spread by a synchronous despreading signal, PN. 
The process of synchronously despreading (math- 
ematically multiplying) the received signal returns 
the signal to the prespread bandwidth. However, the 
process of despreading the jammer, since it is unre- 
lated to the signal, spreads the jammer to the band- 
width of the original spread signal, and only a small 
amount of the jammer power enters the receiver 
in the data bandwidth. Thus the processing gain of 
the direct-sequence system (PG = 7;/T,) greatly re- 
duces the effect of a narrow-band jammer on its per- 
formance. Furthermore, it follows that the spectral 
height of the spread signal is much lower than if it 
were not spread; hence, it is much harder to detect 
by an unauthorized user in the constant presence of 
the receiver noise. See JAMMING. 

Attributes. The six attributes of spread spectrum 
systems that were discussed above apply in this 
direct-sequence, pseudonoise spread spectrum sys- 
tem. Suppose an unmodulated tone at the carrier 
frequency is present at the receiver along with the 
spread signal. Then in the pseudonoise code de- 
spreading process, the undesired tone is spread or 
multiplied by the local pseudonoise code, thereby 
spreading its spectrum to a bandwidth of 2/T,. How- 
ever, the received signal is collapsed to a bandwidth 
of 2/T, (Tp > TƏ so that the data detector “sees” 
only a small fraction (7,/T,) of the original tone en- 
ergy. Low probability of intercept is achieved by 
virtue of the fact that the spread signal is very wide 
band and has low spectral height, and therefore 


cannot be easily detected in a noise background. Mes- 
sage privacy is enhanced if the actual pseudonoise 
code sequence used by the transmitter and receiver 
is cryptographically derived and is unknown to un- 
wanted listeners. Multipath tolerance is achieved by 
searching first with minimum delay and then expand- 
ing the search to larger delays. The desired signal 
will have less delay than the multipath signal and 
will be acquired first. Additional processing can be 
used to greatly minimize the multipath problems. If 
a set of essentially orthogonal codes, such as Gold 
codes (a special type of pseudonoise code), are used, 
many users can occupy the same bandwidth and 
only transmitter-receiver pairs with the same code 
can communicate. Since the pseudonoise chips are 
very short in time, ranging can be made accurate to 
a small fraction (5%) of a chip time. The higher the 
chip rate, the greater the range accuracy. See CRYP- 
TOGRAPHY. 

Time hopping. In time hopping, a time interval T 
is selected to be large compared to the data bit 
time, T,, and is divided into, say, K slots. Then, dur- 
ing one slot time (T/K seconds), T/T, bits are all 
burst-transmitted. The slot time that is transmitted 
is selected pseudorandomly so that only the trans- 
mitter and the receiver know the slot time code. In 
time hopping, the power increases by K times com- 
pared to what a continuous signal would require, 
but the time-hopping signal is on for only 1/K of the 
time, resulting in the same average power. The pro- 
cessing gain is K for BPSK modulation in this case, 
since bits are transmitted at K times the rate before 
time hopping. This scheme has high bandwidth effi- 
ciency, but high peak power and critical timing are 
required. 

Frequency-hopped systems. In a direct-sequence 
system, the phase of the carrier changes pseudoran- 
domly with the pseudonoise code. In a frequency- 
hopping system, the frequency of the carrier changes 
according to a pseudonoise code with a consecutive 
group of pseudonoise code chips defining a partic- 
ular frequency. Typically either multiple frequency- 
shift keying (MFSK) or differential phase-shift key- 
ing (DPSK) is used as the data modulation. Multi- 
ple frequency-shift keying is a modulation scheme in 
which one of a number of tones (2, 4, 8, and so forth) 
is transmitted at a given time according to a group of 
consecutive data bits (7 bits produce 2+” tones). At 
each hop frequency one of the 2*” tones is selected 
according to the n bits, and one of 2+” correspond- 
ing frequencies, centered about the hop frequency, 
is transmitted. In conjunction with frequency hop- 
ping, multiple frequency-shift keying would imply, at 
each instant of time, a given carrier frequency that 
depends on the hop-pseudonoise code sequence and 
the consecutive group of the most recent n data bits. 
Differential phase-shift keying is similar to phase-shift 
keying except that only the differences of the phases 
(not the actual phases) are encoded and noncoher- 
ent techniques (not requiring a carrier loop) can be 
employed at the receiver. 

Frequency-hopped systems are classified as fast or 
slow. Slow hopping occurs if one or more data sym- 
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bols are transmitted in the time interval of one hop 
(T,). Fast hopping occurs if two or more hops occur 
per data symbol. Although fast hopping is more com- 
plex to implement, it offers greater jammer protec- 
tion. The processing gain for a frequency-hopped 
system is defined by PG = W/R,, where W is the 
hopped bandwidth and R, is the data rate. 

Figure 3 illustrates a simple frequency-hopped 
system with binary frequency-shift keying (BFSK) 
data modulation. In this example a data bit 1 is de- 
noted by a positive offset of Af hertz, and a 0 by 
a negative offset of Af hertz from the selected hop 
frequency. In Fig. 3 a 0 is sent in the first hop period 
since it occurs in the lower portion of the frequency 
bin, and in the second hop period a 1 is sent (at a dif- 
ferent hop frequency). This process continues indef- 
initely. A well-designed hopping pattern will cover 
all parts of the hopping bandwidth with the same 
frequency of occurrence. 

A device called a frequency synthesizer achieves 
the actual frequency selection. For example, a 12- 
bit segment of the pseudonoise code may corre- 
spond to one of 2” different frequencies, so that 
one of approximately 4000 (217) frequencies is se- 
lected each hop time. Frequency synthesizers are 
used in both the transmitter and the receiver (Fig. 4). 
The transmitter modulates the data by typically using 
either multiple frequency-shift keying or differen- 
tial phase-shift keying modulation, which in turn 
is frequency-hopped by the frequency synthesizer. 
At the receiver, an acquisition process is utilized 
to synchronize the receiver frequency synthesizer 
with the received hopping signal, and then a track- 
ing system maintains synchronism. Finally a bit syn- 
chronizer provides timing for the data demodula- 
tor which demodulates the original, transmitted data 
bits. 

Hybrid systems. Besides the four basic systems dis- 
cussed above, hybrid schemes combining two or 
more of these systems have been used. The most 
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Fig. 3. Example of a frequency-hopped signal pattern. 
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Fig. 4. Frequency-hopping equipment for (a) transmitter and (b) receiver. 


common hybrid system is frequency hopping in 
conjunction with direct sequence; that is, each fre- 
quency is “spread” by a direct-sequence code. 

Code division multiple access. An important as- 
pect of spread spectrum communications is multi- 
ple access. Code-division multiple access (CDMA) 
is a method by which spread spectrum signals are 
utilized to allow the use of multiple signals over 
the same frequency band. The two common types 
are direct-sequence CDMA (DS/CDMA) systems and 
frequency-hopped CDMA (FH/CDMA) systems. See 
MULTIPLEXING AND MULTIPLE ACCESS. 

DS/CDMA. This method is based on modulating the 
carrier phase via the multiple access code and the 
data modulation. Multiple codes can be used over 
the same channel because the different codes are 
nearly uncorrelated. That is to say, when any distinct 
pair of codes are multiplied together and the prod- 
uct is compared chip by chip over one data symbol, 
the code chips (+1’s or —1’s) have about the same 
number of “plus ones” as “minus ones.” Thus two 
distinct codes are uncorrelated. However, when the 
same code is correlated against itself, its product is all 
ones, and thus is completely correlated. This prop- 
erty of different codes being uncorrelated is required 
for each code pair used in the multiple access system 
in order to minimize crosstalk or mutual interference 


effects. When multiple CDMA signals exist on the 
same channel, CDMA noise (background noise) oc- 
curs. See CROSSTALK. 

FH/CDMA. This method uses spread spectrum codes 
to control the hop frequencies at various values 
across the spread bandwidth in such a way that, 
most of the time, they do not collide. Typically the 
data modulation is multiple frequency-shift keying, 
as discussed above. In many applications it is not pos- 
sible to construct codes that will not collide (or will 
have zero correlation in the DS/CDMA case). These 
occasional collisions cause a degradation in the bit 
error rate performance. However, coding and inter- 
leaving/deinterleaving mitigate the effects of the col- 
lisions. The more channels that are used on the same 
band, the greater the interference level, which is 
sometimes viewed as background noise. See ELEC- 
TRICAL NOISE. 

Personal communication systems. In 1983 an ana- 
log cellular radio system, called the Advanced Mobile 
Phone System (AMPS), was introduced in the United 
States. Other analog systems were later introduced in 
Europe and Asia. In 1993 a CDMA spread spectrum 
personal cellular service was introduced in North 
and South America, Asia, and elsewhere. This stan- 
dard, called TIA IS-95, is a second-generation (2G) 
standard. Each mobile user is assigned one member 
of a set of 64 orthogonal functions know as Walsh 
functions. The Walsh functions are generated as rows 
of special square matrices called Hadamard matrices. 
On the forward channel (from the base station to the 
mobile user) the Walsh functions are used for spread- 
ing, and on the reverse channel (user to base station) 
they are used as orthogonal functions for the mod- 
ulation. Third-generation (G) CDMA systems are in 
the beginning stages, and offer more features and 
greater data rates than second-generation systems. 
See MOBILE COMMUNICATIONS. 

Other applications. Although the early evolution of 
spread spectrum systems was motivated primarily 
by military interests, nonmilitary applications have 
enjoyed considerable development. One important 
example that has both military and nonmilitary users 
is the Global Positioning Systems (GPS), which is a 
direct-sequence, CDMA, spread spectrum system for 
transmitting the satellite ranging codes. See SATEL- 
LITE NAVIGATION SYSTEMS. 

The space shuttle utilizes a direct-sequence spread 
spectrum communication system on its forward link. 
It relays data through the geostationary Tracking 
and Data Relay Satellites (TDRS) by using either 
S-band or Ku-band spread spectrum links. Another 
example in the military arena is the Milstar sys- 
tem, which utilizes frequency-hopping spread spec- 
trum communication over a very large bandwidth to 
achieve considerable immunity from unfriendly jam- 
ming signals. See MILITARY SATELLITES; SPACE COM- 
MUNICATIONS; SPACE SHUTTLE. 

Globalstar is a commercial satellite system that uti- 
lizes a CDMA signal structure along with a bent-pipe 
transponder to provide communications much like a 
cell phone, except that Globalstar is satellite-based. 
Its signal structure is similar to IS-95. Currently the 


handsets are considerably more expensive than cell 
phones. However, these phones have the advantage 
that they can be reached beyond the range of cell 
phones. See COMMUNICATIONS SATELLITE. 

The main reception source of news. stories 
for radio and newspapers is small spread spec- 
trum ground stations. Another application of code- 
division multiple access is transmission directly by 
satellite from a bank’s automatic teller machine to 
that bank’s computer facility. A home security system 
using spread spectrum techniques imposed on the 
ac power line has been used in Japan. Many cordless 
telephones utilize direct-sequence spread spectrum 
techniques. Jack K. Holmes 

Bibliography. S. Glisic and B. Vucetic, Spread Spec- 
trum CDMA Systems for Wireless Communications, 
Artech House Publishers, 1997; J. K. Holmes, Cober- 
ent Spread Spectrum Systems, Krieger Press, 1990; 
R. L. Peterson et al., Introduction to Spread Spec- 
trum Communications, Prentice Hall, 1995; M. K. 
Simon et al., Spread Spectrum Communications 
Handbook, tev. ed., 1994. 
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Spring (hydrology) 

A place where groundwater discharges upon the 
land surface because the natural flow of ground 
water to the place exceeds the flow from it. Springs 
are ephemeral, discharging intermittently, or perma- 
nent, discharging constantly. 

Springs are usually at mean annual air tempera- 
tures. The less the discharge, the more the temper- 
ature reflects seasonal temperatures. Spring water 
usually originates as rain or snow. Meteoric water 
compositions for the deuterium (D) and oxygen-18 
('80) are given by the empirical equation 


5D = 86180 + 10 


where 6 is the isotope composition in parts per thou- 
sand referred to standard mean ocean water. 

Standard mean ocean water is defined as zero in 
both O18 and D. Meteoric water from evaporation of 
seawater is depleted in D and !8O for kinetic reasons 
and hence the values of 5D and 5180 are negative and 
increasingly so the farther from the Equator. 

Hot springs. Hot-spring water may differ in com- 
position from meteoric water through exchange be- 
tween the water and rocks. Common minerals con- 
sist of component oxides. Oxygen of minerals has 
more '8O than meteoric water. Upon exchange, the 
water is enriched in !8O. Most minerals contain little 
deuterium, so that slight deuterium changes occur. 
Water in hot rock is buoyant relative to surround- 
ing water and isues as hot springs. Temperatures of 
the deep water may be estimated. Estimates stem 
from mineral solubilities with temperature. If satu- 
ration with a mineral is assumed and the chemical 
composition of the water known, a temperature may 
be calculated. The solubility of quartz, SiO2, is com- 
monly used for estimating the deep temperatures of 
hot springs. Common minerals also contain silica, 
so that temperature estimates are ambiguous. Some 
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hot-spring waters are acid from the oxidation of hy- 
drogen sulfide to sulfate. 

Mineral springs. Mineral spring waters have high 
concentrations of solutes. Mineral springs have long 
been used for therapy; the study of such use of 
springs is balneology. Pliny the Younger reported 
mineral springs in Belgium and Anatolia, which are 
still in use. Although mineral waters are still valued 
for therapy in much of the world, in North America 
such use has declined greatly since the early twenti- 
eth century. 

Mineral springs have wide ranges in chemistry and 
temperatures, and hot mineral springs may be clas- 
sified as hot springs as well as mineral springs. Most 
mineral springs are high either in sodium chloride 
or sodium bicarbonate (soda springs) or both; other 
compositions are found, such as a high percentage 
of calcium sulfate from the solution of gypsum. The 
water and the solutes may be of different origins. 
The stable isotope compositions of the water are the 
best guides to the sources of the water. For the min- 
eral springs of Europe, the United States, and Asia 
Minor, deuterium and '8O compositions show that 
the waters are meteoric. 

Carbon dioxide. Carbon dioxide is commonly found 
dissolved in or issuing through mineral water, and is 
identifiable by its °C composition. Carbon of carbon- 
ate minerals deposited in oceans is near zero in °C 
compared to an international standards, the Peedee 
belemnite. The carbon of the carbon dioxide in most 
of the mineral springs of Europe, the United States, 
and Asia Minor is close to zero in °C isotope compo- 
sition. It is inferred that the carbon dioxide results 
from the breakdown (metamorphism) of the carbon- 
ate minerals that are present in rocks of marine sed- 
imentary origin. 

Carbon dioxide from mineral springs of volcanic 
islands of the Atlantic Ocean seems to come from the 
source of the volcanic rocks, the Earth’s mantle, be- 
cause of the similarity of the isotope compositions of 
the carbon dioxide in the springs and in carbon diox- 
ide bubbles in volcanic rocks erupted on the deep 
ocean floor in the —4 to —8 per thousand range. 
Carbon dioxide from great depth also discharges 
through springs in the Sierra Nevada, California, and 
the Alaska peninsula. In mineral springs from the 
Rocky Mountains, carbon dioxide comes from both 
very deep sources and the shallower chemical reac- 
tions of marine rocks. 

A third source of carbon dioxide is organic mate- 
rial. Springs in the Coast Ranges of California, along 
the Copper River of Alaska, and in New Zealand yield 
mixtures of carbon dioxide and methane. The car- 
bon dioxide is depleted in carbon-13 and probably 
comes from the breakdown of organic (woody) ma- 
terials in the sedimentary rocks. 

Salts. Other solutes in the mineral springs are in- 
ferred to come from the rocks, but isotope data do 
not show origins. Salt deposits are sources for solutes 
in some instances. There are many mineral springs 
for which there is no possible salt deposit source for 
the solutes. Chemical compositions may be used to 
infer the sources of the solutes. Chloride-rich waters 
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in marine sedimentary rocks also contain bromide, 
iodide, and boron. During metamorphism of the rock 
the waters may be retained as fluid inclusions or as 
films along grain boundaries. Flushing of the retained 
brines may result in the saline springs occasionally 
found in metamorphic terraines. 

Chemical equilibrium. The chemical compositions 
of spring waters are seldom in chemical equilib- 
rium with the air. Ground waters whose recharge 
is through grasslands may contain a thousand times 
as much CO, as would be in equilibrium with air, and 
those whose recharge is through forests may contain 
a hundred times as much as would be in equilibrium 
with air. The high CO, content, along with other so- 
lutes, makes some spring waters quite nutrient to 
aquatic plants. If the CO2-rich ground water has dis- 
solved calcite, loss of CO, in a spring (chiefly by 
photosynthesis) may cause the calcite to precipitate 
to make travertime. Ground water flowing through 
rocks or sediments containing organic material or 
reduced minerals such as sulfides loses its dissolved 
oxygen by oxidizing the organic matter and the sul- 
fides. When the dissolved oxygen is lost, iron enters 
the groundwater as ferrous ion, Fe?+. The Fe?* is 
not chemically compatible with air, and springs con- 
taining Fe?* lose it as Fe(OH); upon oxidation. Thin 
films of Fe(OH); on the water surface are iridescent 
and resemble oil films. Unlike oil films, the Fe(OH); 
films are brittle and if ruptured do not reform. Ac- 
cumulations of the iron precipitates may lead to bog 
iron ores. Sulfate in ground water may be reduced in 
the presence of organic matter to H2S, giving some 
springs the odor of rotten eggs. See GEYSER; GROUND- 
WATER HYDROLOGY. Ivan Barnes 

Bibliography. D. K. Todd and L. W. Mays, Ground- 
water Hydrology, 3d ed., 2004; P. Younger, Ground- 
water in the Environment: An Introduction, 2006. 
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Spring (machines) 

A machine element for storing energy as a function of 
displacement. The flywheel, in contrast, is a means 
for storing energy as a function of angular velocity. 
Force applied to a spring member causes it to de- 
flect through a certain displacement thus absorbing 
energy. See FLYWHEEL. 

A spring may have any shape and may be made 
from any elastic material. Even fluids can behave 
as compression springs and do so in fluid pressure 
systems. Most springs take on specific and familiar 
shapes such as helix, flat, or leaf springs. All me- 
chanical elements behave to some extent as springs 
because of the elastic properties of engineering ma- 
terials. 

Uses. Energy may be stored in a spring for many 
uses: to be released later, to be absorbed at the instant 
the energy first appears, and so on. 

Motive power. One of the early and still the most 
frequent uses of springs is to supply motive power 
in a mechanism. Common examples are clock and 
watch springs, toy motors, and valve springs in auto 
engines. In these, energy is supplied to, and stored 


in, the spring by applying a force through a suitable 
mechanism to deflect or deform the spring. The en- 
ergy is released from the spring by allowing it to push 
(as in the valve) or twist (as in a clock) a mechanism 
through a required displacement. 

Return motion. A special case of the spring as a source 
of motive power is its use for returning displaced 
mechanisms to their original positions, as in the door- 
closing device, the spring on the cam follower for an 
open cam, and the spring as a counterbalance. To a 
certain extent the springs in vehicles of transporta- 
tion are in this category. They are designed to keep 
the car at a certain level with respect to the road 
or rails, returning the vehicle to this position if dis- 
placed by applied forces. 

Shock absorbers. Frequently a spring in the form of a 
block of very elastic material such as rubber absorbs 
shock in a mechanism. For example, the four legs 
of a punch press rest on four blocks of rubber. The 
rubber pads prevent the die-closing inertia forces of 
the press from transferring down through the legs to 
the floor with impact or hammer blow proportions. 
With the rubber pads under the press legs, the force 
on the floor builds up relatively slowly and no shock 
is evident. As the acceleration of the die block goes 
to zero, the inertia force goes to zero and the rubber 
pad springs, which were deflected by the press blow, 
are relaxed and ready for the next stroke of the press 
die. The whole press moves up and down relative to 
the floor, but by proper selection of the rubber pad 
the elastic constant is such that this motion is small. 
See SHOCK ABSORBER. 

Vibration control. Springs serve an important function 
in vibration control by supplying the necessary flexi- 
bility in the support of the vibrating mechanism and 
the required opposing forces as a result of their de- 
flection. In controlling vibration, the body or mass 
of the mechanism must be freely supported so as to 
generate forces opposing the vibrating forces. These 
opposing forces tend to bring the sum of the forces 
on the vibrating body to zero. Vibration-absorbing 
mounts are available in a wide range of sizes and 
spring constants to meet most requirements. These 
do not prevent vibration, which is a function of 
speed and the balance of the mechanism, but rather 
they minimize its effect on the machine frame or the 
mounting. See SHOCK ISOLATION. 

Force measurement. Springs have long been used in 
simple weighing devices. Accurate weighing is usu- 
ally associated with dead-weight devices or balances, 
but modern spring scales have received wide ac- 
ceptance and certification for commercial use. Ex- 
tremely accurate springs for heavy loads are used to 
calibrate testing machines, and they are employed 
in scales over crane hooks for weighing material as 
it is hoisted. Carefully calibrated springs are used 
in instruments such as electric meters and pressure 
gages. See BALANCE. 

Retaining rings. A relatively modern machine part in 
which the spring function is used as a holding means 
is the retaining or snap ring. This device is a split 
ring of square, rectangular, or special cross section. 
It fits in a groove on a cylindrical surface or in a bore 
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Fig. 1. Flat or leaf spring responds to perpendicular forces 
in either direction. 


and stays in place by spring force. A retaining ring 
prevents or restrains relative axial motion between 
a shaft or bore and the components on the shaft or 
in the bore. See SNAP RING. 

Types. Springs may be classified into six major 
types according to their shape. These are flat or leaf, 
helical, spiral, torsion bar, disk, and constant force 
springs. 

Flat or leaf spring. A leaf spring is a beam of cantilever 
design with a deliberately large deflection under a 
load (Fig. 1). One end ofa leaf spring is usually firmly 
anchored to the frame of the machine and the other 
end is linked to the moving machine elements by 
a two-force (pin-ended) link. Force may be tension 
or compression with no modification of the design. 
This push-pull feature is the great advantage of a leaf 
spring, plus the fact that a relatively large amount of 
energy can be stored in a small space. 

Helical spring. The helical spring consists essentially 
of a bar or wire of uniform cross section wound into 
a helix. The last turn or two at each end of the spring 
is modified to a plane surface perpendicular to the 
helix axis, and force can then be applied to put the 
helix in compression. The ends of the spring helix 
may be modified into hooks or eyes so that a force 
may be applied in tension. In general it is necessary 
to design helical springs so that force may be trans- 
mitted either in tension or in compression but not 
both ways for the same spring (Fig. 2). Where revers- 
ing forces occur in a spring mechanism, it is better 
to use a bar or leaf spring and in some cases a disk 
spring. 

Spiral spring. In a spiral spring, the spring bar or wire 
is wound in an Archimedes spiral in a plane. Each end 
of the spiral is fastened to the force-applying link in 
the mechanism. 

A spiral spring is unique in that it may be deflected 
in one of two ways or a combination of both of them 
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(Fig. 3). If the ends of the spiral are deflected by 
forces perpendicular to the spiral plane, the spiral is 
distorted into a conical helix. For better stability and 
ease of applying the forces, the spring is often made 
as a conical helix to start with and is deflected into 
a plane spiral. 

A spiral spring may also have the forces act tangent 
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Fig. 2. Helical spring. (a) Wound tight to extend under axial tension. (b) Wound loose to 
contract under axial compression. 
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Fig. 3. Spiral spring is unique in responding to torsional or 
translational forces. 
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to the spiral as in a clock spring. The spiral is wound 
quite open and the tangential force in tension on the 
spiral tends to close the gaps between successive 
turns. The spring behaves like a beam, bending to a 
shorter radius of curvature and thus storing energy. 

Torsion bar. A torsion bar spring consists essentially 
of a shaft or bar of uniform section. It stores energy 
when one end is rotated relative to the other. It is 
used in the spring system of the chassis of modern 
motor cars. See TORSION BAR. 

Disk spring. Where large forces are present and 
space is at a premium, the disk spring may be used, 
although it is usually expensive to design and build. 
It consists essentially of a disk or washer supported 
at the outer periphery by one force (distributed by a 
suitable chuck or holder) and an opposing force on 
the center or hub of the disk (Fig. 4). For greater de- 
flections several disks may be stacked with the forces 
transmitted from one to the next by inner and outer 
collars. 

Constant-force spring. Many mechanisms require that a 
constant force be applied regardless of displacement. 
The counterbalancing of vertically moving masses 
against the force of gravity is a typical example. The 
Neg’ator spring of Hunter Spring Co. provides such 
a constant force; it uses a tight coil of flat steel spring 
stock. When the outer free end is extended and the 
coil allowed to rotate on its shaft or pintle, the spring 
presents a constant restoring force. 

Design. Each type spring has its special features 
and design refinements. Common to all forms of 
springs are the basic properties of elastic materi- 
als. Within the elastic range of a material, the ratio 
of applied force to resulting deflection is constant. 
Spring systems can be designed to have a variable 
ratio. The ratio is the spring rate or scale and has 
the dimension of force per unit length. See HOOKE’S 
LAW. 

In a helical spring the elastic action stresses the 
wire in torsion. The following variables are subject 
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(c) Belleville spring; (d) slotted curved washer; (e) finger 
washer; (f) wave spring washer. 
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Fig. 5. Correction factor for curvature and shear. (After 
A. M. Wahl, Mechanical Springs, Penton, 1944) 


to the designer’s action and decision: 


free length L spring rate k 
mean diameter D modules of elasticity in 

wire diameter d torsion G 
allowable stress § number of active turns N 
applied load p deflection at load P F 


These variables are related by Eq. (1), which 


s= 8PD a) 

~ wa 
neglects the effect of coil curvature on the stress, and 
by Eq. (2). For springs subject to frequent cycling, as 


8PD°>N 


Ga = 


a valve spring, the correction factor for wire curva- 
ture must be included. The generally used correction 
is the factor proposed by A. M. Wahl. Introducing 
this factor changes Eq. (1) to the form of Eq. @), in 


8PD (= 1 ~~) 


md? fond * G ©) 


max = 
which C = D/d. The portion in brackets is Wahl’s 
factor K whence maximum stress is K times the tor- 
sional stress (Fig. 5). 

Various combinations of spring dimensions for the 
variables listed above will produce an acceptable 
spring. Usually the design is chiefly limited by allow- 
able stress of materials and space. 

Where compression and tension springs are cy- 
cled at very high frequencies, surging may cause high 
local stress and result in early fatigue failure. Surging 
is the inability of all parts of the spring to deflect 
at the same rate due to the inherent inertia in the 
coils. This phenomenon is closely associated with 
the natural frequency of the spring; springs should 
be designed so that their natural frequency and cyclic 
rate are as far apart as practical. 

L. Sigfred Linderoth, Jr. 

Bibliography. Society of Automotive Engineers, 
Spring Design Manual, 2d ed., 1995; Spring Man- 
ufacturers Institute, Handbook of Spring Design, 


rev. ed., 2002; A. M. Wahl, Mechanical Springs, 
2d ed., reprinted 1991. 
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Spruce 


Evergreen tree belonging to the genus Picea of the 
pine family. The needles are single, usually four-sided, 
and borne on little peglike projections; the cones 
are pendulous. Resin ducts in the wood may be seen 
with a magnifying lens, but they are fewer than in 
Pinus. See EVERGREEN PLANTS; PINALES; PINE. 

Eastern species. The white spruce (P glauca), 
ranging from northern New England to the Lake 
States and Montana and northward into Alaska, is 
distinguished by the somewhat bluish cast of its nee- 
dles, small cylindrical cones (Fig. 1c), and gray or 
pale-brown twigs without pubescence (hairs). Of 
erect pyramidal habit, it usually attains a height of 
60-70 ft (18-21 m) with trunk diameters of 1-2 ft 
(0.3-0.6 m), but occasionally in British Columbia and 
Alberta the tree reaches heights of 80-140 ft (24- 
43 m) with diameters up to 4 ft (1.2 m). When 
bruised, the leaves of some individuals emit a dis- 
agreeable odor, hence the names cat or skunk 
spruce. The pale, straw-colored wood is soft and 
straight-grained without different coloration of the 
heartwood. The wood is used for paper pulp, con- 
struction, and interior finish. The evenly grained 
wood is used in the manufacture of musical instru- 
ments. 

Red spruce (P rubens) is a similar tree but with 
greener foliage; smaller, more oval cones (Fig. 1b); 
and more or less pubescent twigs. Occurring natu- 
rally with white spruce in the northeastern United 
States and adjacent Canada, red spruce extends 
southward along the Appalachians into North Car- 
olina. As in white spruce, the greatest use of the 
wood is for paper pulp. 

Black spruce (P mariana) ranges from northern 
New England and Newfoundland to Alaska. How- 
ever, it occurs sparingly in the Appalachians to West 
Virginia. The cones are smaller than in the white 
and red species and are egg-shaped or nearly spher- 
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Fig. 1. Cones of eastern and western spruce. (a) Black 
spruce (Picea mariana). (b) Red spruce (P. rubens). 

(c) White spruce (P. glauca). (d) Colorado blue spruce 
(P. pungens). (e) Norway spruce (P. abies). (Brooklyn 
Botanic Garden) 





Fig. 2. Blue spruce (Picea pungens) in Bryce Canyon 
National Park, Utah. (National Park Service) 


ical (1-1!/⁄ in. or 2.5-3.8 cm long) and persistent 
(Fig. 1a). The twigs are pubescent. Known also as 
bog or swamp spruce in the southern part of its 
range, black spruce appears at high elevations in 
the north. The long-fibered wood is ideal for paper 
pulp and is also used for paddles, oars, construction, 
and shipbuilding. It is short-lived as an ornamental 
tree. 

Western species. Blue spruce (P pungens) (Fig. 2), 
also known as Colorado blue spruce, is probably the 
best known of the western species (Fig. 1d) because 
of its wide use as an ornamental tree. The twigs are 
glabrous (without pubescence). The tree is popu- 
lar as an ornamental in northern Europe, as well 
as throughout the United States. Individuals vary in 
the depth of blue coloration, the most pronounced 
being that of the Koster variety. Most of the orna- 
mental specimens come from grafts made on Norway 
spruce. 

Engelmann spruce (P engelmanni) has needles 
usually of a deep blue-green color, sometimes much 
like those of the blue spruce, but the young twigs 
are slightly hairy. The cones, although cylindrical, 
are smaller than in blue spruce, being about 11⁄2- 
3 in. (3.8-7.6 cm) long. This species is also a Rocky 
Mountain tree like the blue spruce, but it is more 
widely distributed from British Columbia to Arizona 
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and also in the mountains of Oregon and Washing- 
ton. It is an important timber species of the Rocky 
Mountain region. 

Sitka spruce (P sitchensis) is the largest spruce in 
the Northern Hemisphere. The leaves have a pun- 
gent odor, are considerably flattened, and stand out 
from the twig in all directions. Ranging from Alaska 
to northern California, it occupies a coastal strip 
about 40-50 mi (64-80 km) wide. Mature trees are 
180-200 ft (55-61 m) tall and 31/,-6 ft (1.1-1.8 m) 
in diameter, but in some instances larger dimen- 
sions have been recorded. Timber volume in the 
United States is nearly all in the states of Washington 
and Oregon. The wood is used for furniture, doors, 
blinds, and paper pulp, and in piano manufacture of 
sounding boards. 

The Norway spruce (P abies), the common spruce 
of Europe, is much planted in the United States for 
timber, as well as for ornamental purposes. It can 
be recognized by the dark-green color of the leaves; 
glabrous, pendent, short branchlets; and cones 
4-6 in. (10-15 cm) in length (Fig. 1e), usually near 
the top of the tree. An important source of paper 
pulp, it has often been planted for that purpose in the 
northern and eastern United States. It is also used for 
shelter belts and Christmas trees, although it is 
not as desirable for the latter purpose as the firs 
and Douglas-fir, since it sheds its needles quickly 
in a warm room. There are many cultivated varieties. 
See FOREST AND FORESTRY; TREE. 

Arthur H. Graves; Kenneth P. Davis 
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Sputtering 

The ejection of material from a solid or liquid sur- 
face following the impact of energetic ions, atoms, 
or molecules. Sputtering was first noted in 1853 by 
W. R. Grove and in 1854 by M. Faraday as the oc- 
currence of a metallic coating on the glass walls of 
a discharge tube. E. Goldstein in 1902 showed that 
the sputtering effect was caused by positive ions of 
the discharge striking the cathode and ejecting cath- 
ode material. Sputtering now is the basis of a large 
variety of methods for the synthesis and analysis of 
materials. 

Classification. Sputtering can be classified accord- 
ing to the mode of energy loss of the incident (pri- 
mary) particle. Nuclear stopping involves billiard 
ball-like atomic collisions in which a significant mo- 
mentum transfer occurs; it dominates for incident 
ion energies below about 1-2 keV per nucleon. Elec- 
tronic stopping involves collisions in which little 
momentum is transferred, but significant electronic 
excitation is caused in the target; it dominates for 
energies above about 10 keV per nucleon. 


Sputtering has also been classified into physi- 
cal and chemical sputtering. Physical sputtering in- 
volves a transfer of kinetic energy from the incident 
particle to the surface atoms leading to ejection, 
while chemical sputtering occurs when the incident 
species react chemically with the target surface lead- 
ing to the formation of a volatile reaction product 
which evaporates thermally from the surface. 

The main models for physical sputtering are the 
thermal sputtering and the collision cascade mod- 
els. The thermal sputtering model suggests that the 
ion impact instantaneously heats a small region of 
the surface to a temperature well above the boiling 
point of the material, so that a small amount of ma- 
terial can evaporate before the heat is dissipated to 
the surrounding solid. The collision cascade model 
suggests that the impinging ion collides with surface 
atoms, transferring kinetic energy to these atoms 
which then collide with their neighbors, and so on. 
Atoms at or near the surface which are set in motion 
out of the surface with a kinetic energy great enough 
to overcome the chemical binding to their neighbors 
are sputtered. There is little evidence that thermal 
sputtering is important for most materials; in con- 
trast, the collision cascade model accounts for the 
vast majority of observed sputtering phenomena. In 
particular, sputtering of single crystals leads to ejec- 
tion in certain preferred directions correlated with 
the crystal lattice, which clearly would not occur in 
a random disordered hot zone. See CRYSTAL STRUC- 
TURE. 

Yields. In the nuclear stopping regime, sputtering 
yields, Y, can be expressed as in the equation 


Y x Sn atom/ion 

U 

where S, is the nuclear stopping coefficient (energy 
deposited in the solid per unit distance traversed) 
and U is the surface binding energy of the target ma- 
terial (assumed equal to the heat of sublimation per 
atom). Sputtering threshold energies—the incident 
ion energy at which sputtering is just detectable— 
are on the order of 10 times the sublimation energy, 
or some tens of electronvolts, for most elements. 
Sputtering yields are a maximum for incident ion en- 
ergies around 200-300 eV per nucleon, where the 
nuclear stopping coefficient is a maximum. For argon 
ions (atomic mass 40) this corresponds to energies 
around 10 keV, at which energy sputtering yields for 
most materials under argon ion bombardment are in 
the range 1-10 atoms sputtered per incident ion. For 
incident ion energies below about 10 keV, a major 
fraction (80-90%) of the sputtered atoms come from 
the outermost surface atom layer, most of the remain- 
der from the second layer, while essentially all the 
sputtered atoms come from no deeper than about 
four atomic layers; this is the basis of the use of sput- 
tering in surface analysis. 

Sputtering yields in the electronic stopping regime 
cannot be expressed simply, as mechanisms are 
still obscure. Hard, “billiard-ball” collisions are rare 
at high energies, because the incident ion travels 
through the solid as a bare nucleus, or certainly 
with many of its electrons stripped away, and is thus 


extremely small. However, highly energetic particles 
can deposit up to several thousand electronvolts per 
nanometer into electronic excitation and ionization 
in the substrate. Mechanisms suggested for coupling 
of this electronic excitation to the nuclear motion 
necessary for sputtering include Coulomb repulsion 
of ionized atoms from one another along the par- 
ticle track, or excitation to antibonding (repulsive) 
electronic states. 

Complex materials. Sputtering of complex 
materials—metal alloys, inorganic and organic 
compounds and polymers, and minerals—can 
produce complex results. The relative efficiencies 
with which different elemental species are ejected 
following ion impact can differ, giving rise to 
preferential sputtering. When preferential sputter- 
ing occurs, the species sputtered with the lower 
efficiency accumulates to a higher concentration 
at the surface. Subsurface collisions of the incident 
ion, which penetrates several nanometers into the 
solid at kiloelectronvolt energies, cause atomic 
motion leading to ion beam-induced atomic mixing 
of surface and subsurface layers over the ion pene- 
tration depth. Chemical bonds can be broken, and 
sometimes new bonds can be formed. Sputtering 
of solids which have multiple phases, or which are 
polycrystalline, leads to the development of surface 
roughness due to the differences in sputtering yields 
between different regions. See ION BEAM MIXING. 

Semiconductor devices. Sputtering is widely used 
in the manufacture of semiconductor devices; sput- 
ter deposition is used to deposit thin films with a high 
degree of control by sputtering material from a target 
onto a substrate; sputter etching is used to remove 
unwanted films in a reversal of this process. Most re- 
moval and deposition systems generate the energetic 
sputtering ions in radio-frequency (rf) or direct cur- 
rent (dc) glow discharges in argon gas. Reactive ion 
etching is a chemical sputtering process in which 
chemically active sputtering species form volatile 
compounds with the target material leading to sig- 
nificantly higher etch rates and great selectivity. For 
example, fluorine-containing compounds etch sili- 
con rapidly by forming volatile silicon tetrafluoride 
(SiF,) but do not etch aluminum or other metals used 
to make electrical interconnections between devices 
on a semiconductor chip because the metal fluorides 
are involatile. Sputter etching and reactive ion etch- 
ing have the useful advantage of being anisotropic— 
that is, they etch only in one direction (usually nor- 
mal to the surface) so that very fine surface features 
can be delineated. See CRYSTAL GROWTH; GLOW DIS- 
CHARGE; INTEGRATED CIRCUITS. 

Materials characterization. In materials character- 
ization, sputtering is used to remove surface ma- 
terial controllably, allowing in-depth concentration 
profiles of chemical composition to be determined 
with a surface-sensitive sampling technique such as 
Auger electron spectroscopy, x-ray photoelectron 
spectroscopy, secondary ion mass spectrometry, or 
ion scattering spectrometry. Sputtering ejects not 
only atomic but also molecular species from surfaces, 
and a fraction of these ejected species are ionized as 
a result of the energetic collisions leading to ejection. 


Analysis of these secondary ions forms the basis of 
secondary ion mass spectrometry. Inorganic materi- 
als can be analyzed by identification of atomic sec- 
ondary ions; organic materials can give characteristic 
molecular ions. Degradation of organic materials by 
the ion beam irradiation is reduced by dispersal in 
a liquid matrix or by operation at very low irradi- 
ation rates. Projectile energies both in the nuclear 
stopping and in electronic stopping regimes result 
in ejection of very large molecular ions up to 5000 
atomic mass units for nuclear stopping and over 
15,000 atomic mass units for electronic stopping, 
allowing analysis of very large biomolecules by mass 
spectrometry. See AUGER EFFECT; SURFACE PHYSICS. 
Peter Williams 
Bibliography. R. Behrisch and W. Eckstein (eds.), 
Sputtering by Particle Bombardment, 2007; B. 
Chapman, Glow Discharge Processes: Sputtering 
and Plasma Etching, 1980; M. Ohring, The Mate- 
rials Science of Thin Films, 2d ed., 2001; K. Wasa, 
M. Kitabatake, and H. Adachi, Thin Films Materi- 
als Technology: Sputtering of Compound Materials, 
2004. 
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Squalene 


A Czo triterpenoid hydrocarbon. Squalene is made 
up of six (trans-1,4)isoprene units linked as two far- 
nesy1 (head-to-tail) groups that are joined tail to tail 
in the center (Fig. 1). 

Squalene can be isolated in large quantities from 
the liver oils of the shark (Squalus spp.) and other 
elasmobranch fishes, and is a relatively inexpensive 
compound. It undergoes oxidation reactions typical 
of a polyene. Complete hydrogenation of the liver oil 
gives the saturated hydrocarbon squalane, which is 
used in lotions and skin lubricants. 

The major significance of squalene is its role as a 
central intermediate in the biogenesis of all steroids 
and triterpenoids. Thus, although squalene accumu- 
lates in a narrow phylogenetic group of primitive 
fish, it is a ubiquitous metabolite in all organisms 
that contain steroids. 

The biosynthetic pathway begins with acetic acid 
units and leads via mevalonate and isopentenyl py- 
rophosphate to the sesquiterpene farnesol. Two far- 
nesyl pyrophosphate units are then coupled tail to 
tail. This coupling involves the formation of presqua- 
lene followed by rearrangement and reduction of the 
cyclopropane ring (Fig. 2a). 

The transformation of squalene to polycyclic triter- 
penoids and steroids is initiated by epoxidation of 
one double bond followed by cyclization. The prod- 
uct depends on the conformation, or folding, of the 


Squalene 


Fig. 1. Structure of squalene; the tail-to-tail joining is indicated by T. 
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Fig. 2. Biosynthetic pathways. (a) Formation of squalene. (b) Transformation of squalene 


to cholesterol. 


chain. The pathway from squalene oxide to choles- 
terol is indicated in Fig. 2b. A concerted sequence 
of bond formation and methyl group rearrangements 
leads to the triterpenoid lanosterol, which is a major 
constituent of sheep wool fat. The final steps to 
cholesterol include oxidative loss of three methyl 
groups. Further enzymatic oxidations in specific 
tissues give rise to the several classes of steroid 
hormones. See STEROID; TERPENE; TRITERPENE. 
James A. Moore 
Bibliography. S. V. Bhat, B. A. Nagasampagi, and 
S. Minakshi, Chemistry of Natural Products, 2005; 
T. W. Graham Solomons and C. B. Fryhle, Or- 
ganic Chemistry, 8th ed., 2003; R. K. Murray et al., 
Harper’s Biochemistry, 25th ed., 1999. 


A 
Squaliformes 

An order of sharks comprising six families, 24 genera, 
and 97 species, commonly known as dogfish sharks. 
The order differs from all other sharks by the follow- 





Spiny dogfish (Squalus acanthias). (Photo courtesy of Alaska Fisheries Science Center, 
National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 
U.S. Department of Commerce) 


ing combination of characters: two dorsal fins, with 
or without spines; no anal fin; spiracle behind eye; 
and lower nictitating eye membrane absent. For the 
most part, squaliform sharks inhabit continental and 
insular shelves and slopes, and sea mounts. Some 
species live at great depths. So far as known, squali- 
forms are ovoviviparous and produce from one to 22 
pups per litter, depending on the species. The dwarf 
lantern shark (Etmopterus perryi), at 17 cm (6.7 in.) 
in length, probably is the world’s smallest shark; the 
largest squaliform is the Pacific sleeper shark (Som- 
niosus pacificus), which is up to 7 m (23 ft) in length. 
The diet consists of bony fishes, cephalopods, crus- 
taceans and other invertebrates, mammals, and other 
sharks. See ELASMOBRANCHII; SELACHII. 

Squalidae (dogfish sharks). Squalidae has a spine 
(without grooves) preceeding both dorsal fins, teeth 
in the lower jaw only slightly larger than those in 
the upper jaw, usually a precaudal pit, and a lateral 
keel on each side of the caudal peduncle. The family 
comprises two genera. Cirrhigaleus (two species) 
is very distinctive in having a robust body and bar- 
bels in the form of large nasal flaps that reach the 
mouth. By comparison, Squalus, comprising eight 
species, has a more slender body and much smaller 
nasal flaps. The species S. acanthias (spiny dogfish) 
(see illustration) is probably is the world’s most 
abundant shark and also the largest of the family, at- 
taining a length of 160 cm (63 in.), although most 
are much smaller. Squalids are antitropical (found in 
both hemispheres but not in equatorial regions) in 
the eastern and western Atlantic and Pacific oceans, 
but apparently are absent in the Indian Ocean. 

Centrophoridae (gulper sharks). The family Cen- 
trophoridae is distinguished from other squaliforms 
by having grooved spines in both dorsal fins, teeth in 
the lower jaw larger than those in the upper jaw, and 
precaudal pit and lateral keels absent from the cau- 
dal peduncle. Gulper sharks comprise two genera 
and 14 species, which are represented in the east- 
ern Atlantic from Iceland to South Africa, and in the 
Indian and western Pacific oceans. They range in size 
from about 79 to 160 cm (1 to 63 in.). 

Etmopteridae (lantern sharks). Etmopteridae is a 
family of small deep-water sharks comprising five 
genera and 41 species. The family is characterized by 
the following combination of characters: both dor- 
sal fin spines grooved; spines usually quite large, with 
second spine larger and both spines free of fin; upper 
lobe of caudal fin with subterminal notch; and lumi- 
nous organs usually present on body. Species range 
in adult total length from 17 to 84 cm ©.7 to 33 in.). 
Lantern sharks occur on continental and insular 
slopes from tropical to temperate zones of the 
Atlantic (northward to Iceland), Indian, and Pacific 
oceans. Etmopterus, the principal genus, comprises 
31 species. 

Somniosidae (sleeper sharks). The family Somniosi- 
dae comprises seven genera and 17 species of rela- 
tively small sharks, ranging in length from about 49 to 
114cm (19 to 45 in.). Sleeper sharks differ from other 
squaliform sharks by usually lacking dorsal fin spines; 
however, if present, spines are small and in both fins. 


They also have a lateral ridge on each side between 
the pectoral and pelvic fins, as well as luminous or- 
gans in most species. Sleeper sharks occur mostly on 
continental and insular shelves and slopes (some are 
oceanic) of the Atlantic, Indian, and Pacific oceans, 
and on Arctic and sub-Antarctic shelves. One of the 
sleeper sharks, the Portuguese shark (Centroscym- 
nus coelolepis), which occurs in the western North 
Atlantic and eastern Atlantic from Iceland to South 
Africa, ranges in depth from 270 to 3675 m (885 to 
12,055 ft). 

Oxynotidae (rough sharks). Oxynotidae, compris- 
ing one genus (Oxynotus) and five species, is the 
most bizarre family of squaliform sharks. The fam- 
ily is characterized as follows: trunk deep and com- 
pressed, triangular-shaped in cross section; promi- 
nent ventrolateral ridge between pectoral and pelvic 
fins; dorsal fins saill-ike, with the first extending 
forward over the pectoral fins and gill slits; both dor- 
sal fins with spines embedded in the fin, with only 
tips exposed; skin very rough; and most species with 
luminous organs. The family occurs on continental 
and insular slopes of the western and eastern Atlantic 
(including the Mediterranean Sea) and the western 
Pacific at depths of 500 to 720 m (1640 to 2360 ft. 
The species range in length from 49 to 150 cm 
(19 to 59 in.). 

Dalatiidae (kitefin sharks). Dalatiidae, comprising 
seven genera and 10 species (perhaps more), is char- 
acterized as follows: small size, ranging from 25 to 
50 cm (10 to 20 in.); dorsal fins without spines ex- 
cept in genus Squaliotus, which may have small 
spines in the first dorsal; and luminous organs, mostly 
on belly. Kitefin sharks occur on continental and 
insular shelves and slopes of tropical and temper- 
ate zones of the Atlantic, Indian, and Pacific oceans. 
The pygmy shark (Euprotomicrus bispinatus), only 
27 cm (10.6 in.) in maximum length, occurs at or 
near the surface at night and may make lengthy daily 
vertical migrations of 1500 m (4900 ft) or more, 
a round-trip of almost 2 mi. Species of the genus 
Isistius are called cookiecutter sharks (after the 
cookie-shaped bite wounds that they inflict on the 
bodies of prey). They have a short bulbous snout, and 
their large, fleshy suctorial mouth allows the shark 
to attach itself to prey, bony fishes and cetaceans, 
much in the manner employed by lampreys. When 
attached, the cookiecutter shark bites circular 
chunks of flesh from its prey. Its lower teeth, quite 
different from the uppers, are very large (in fact, 
in proportion to its size, the largest teeth of all 
sharks), erect, bladelike, and interlocking, making 
a formidable weapon for this relatively small shark. 

Herbert Boschung 

Bibliography. L. J. V. Compagno, FAO Species Cat- 
alogue, vol. 4: Sharks of the World: An Annotated 
and Illustrated Catalogue of Shark Species Known 
to Date, Part 1, Hexanchiformes to Lamniformes, 
FAO Fish. Synop. no. 125, pp. 212-249, 1984; L. J. V. 
Compagno, Checklist of Chondrichthyes, pp. 503- 
547 in W. C. Hamlett (ed.), Reproductive Biology 
and Phylogeny of Chondrichthyes: Sharks, Batoids, 
and Chimaeras, Science Publishers, Enfield, NH, 


2005; J. S. Nelson, Fishes of the World, 4th ed., Wiley, 
New York, 2006. 


SS 
Squall 


A strong wind with sudden onset and more gradual 
decline, lasting for several minutes. Wind speeds in 
squalls commonly reach 30-60 mi/h (13-27 m/s), 
with a succession of brief gusts of 80-100 mi/h 36- 
45 m/s) in the more violent squalls. Squalls may be 
local in nature, as with isolated thunderstorms, or 
may occur over a wide area in the vicinity of a well- 
developed cyclone, where the squalls locally rein- 
force already strong winds. Because of their sud- 
den violent onset, and the heavy rain, snow, or hail 
showers which often accompany them, squalls cause 
heavy damage to structures and crops and present 
severe hazards to transportation. 

The most common type of squall is the thunder- 
squall or rain squall associated with heavy convective 
clouds, frequently of the cumulonimbus type. Such a 
squall usually sets in shortly before onset of the thun- 
derstorm rain, blowing outward from the storm and 
generally lasting for only a short time. It is formed 
when cold air, descending in the core of the thun- 
derstorm rain area, reaches the Earth’s surface and 
spreads out. Particularly in desert areas, the thun- 
derstorm rain may largely or wholly evaporate be- 
fore reaching the ground, and the squall may be dry, 
often associated with dust storms. See DUST STORM; 
SQUALL LINE; THUNDERSTORM. 

Squalls of a different type result from cold air 
drainage down steep slopes. The force of the squall is 
derived from gravity and depends on the descending 
air which is colder and more dense than the air it re- 
places. So-called fall winds of this kind are common 
on mountainous coasts of high latitudes, where cold 
air forms on elevated plateaus and drains down fiords 
or deep valleys. Channeling of the air through nar- 
row valleys increases their force. Squall winds (bora) 
of 100 mi/h (45 m/s) or more are observed in win- 
ter, when arctic air masses from Russia intermittently 
spill over the mountains of former Yugoslavia into the 
warm coastal regions of the Adriatic. 

Violent squalls also characterize the warm foehn 
winds of the Alps and the similar chinook winds on 
the eastern slopes of the Rocky Mountains. Some- 
times destructive, especially in winter, they are char- 
acteristically very gusty and variable over short dis- 
tances. Strong surface wind speeds are a response 
to the distribution of pressure established in a stand- 
ing breaking wave, somewhat analogous to standing 
waves in water flowing over and around submerged 
obstacles. Momentum transferred to the surface lay- 
ers is dissipated in small-scale turbulence, an effec- 
tive dissipative process affecting jet streams world- 
wide. See CHINOOK; JET STREAM; STANDING WAVE; 
WIND. Chester W. Newton; Edwin Kessler 

Bibliography. C. D. Ahrens, Meteorology Today: An 
Introduction to Weather, Climate, and the Environ- 
ment, 6th ed., 1999; C. D. Whiteman, Mountain Me- 
teorology: Fundamentals and Applications, 2000. 
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Squall line 


Sa SEE 
Squall line 


A line of thunderstorms, near whose advancing edge 
squalls occur along an extensive front. The thundery 
region, 12-30 mi (20-50 km) wide and a few hun- 
dred to 1250 mi (2000 km) long, moves at a typi- 
cal speed of 15 m/s (30 knots) for 6-12 h or more 
and sweeps a broad area. In the United States, se- 
vere squall lines are most common in spring and 
early summer when northward incursions of mar- 
itime tropical air east of the Rockies interact with 
polar front cyclones. Ranking next to hurricanes in 
casualties and damage caused, squall lines also sup- 
ply most of the beneficial rainfall in some regions. 
See SQUALL. 

A squall line may appear as a continuous wall of 
cloud, with forerunning sheets of dense cirrus, but 
severe weather is concentrated in swaths traversed 
by the numerous active thunderstorms. Their pas- 
sage is marked by strong gusty winds, usually veer- 
ing at onset, rapid temperature drop, heavy rain, 
thunder and lightning, and often hail and tornadoes. 
Turbulent convective clouds, 6-9 mi (10-15 km) 
high, present a severe hazard to aircraft, but may 
be circumnavigated with use of radar. See AIRBORNE 
RADAR. 

Formation requires an unstable air mass rich in 
water vapor in the lowest 0.6-2 mi (1-3 km), such 
that air rising from this layer, with release of heat 
of condensation, will become appreciably warmer 
than the surroundings at upper levels. Broad-scale 
flow patterns vary; Fig. 1 typifies the most intense 
outbreaks. In low levels, warm moist air is carried 
northward from a source such as the Gulf of Mex- 
ico. This process, often combined with high-level 
cooling on approach of a cold upper trough, can 
rapidly generate an unstable air mass. See THUNDER- 
STORM. 
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Fig. 1. Successive locations of squall line moving eastward 
through unstable northern portion of tongue of moist air 
(dark area) in warm sector of a cyclone. Black arrows show 
general flow in low levels; colored arrows, axes of 
strongest wind at about 0.6 mi (1 km) above ground and at 
6-7 mi (10-12 km). 
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Fig. 2. Section through squall-line-type thunderstorm. 
Precipitation is represented by black broken lines. 
1 km = 0.6 mi. 
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The instability of this air mass can be released 
by a variety of mechanisms. In the region down- 
stream from an upper-level trough, especially near 
the jet stream, there is broad-scale gentle ascent 
which, acting over a period of hours, may suffice; 
in other cases frontal lifting may set off the con- 
vection. Surface heating by insolation is a contrib- 
utory mechanism; there is a preference for forma- 
tion in midafternoon although some form at night. 
By combined thermodynamical and mechanical pro- 
cesses, they often persist while sweeping through a 
tongue of unstable air, as shown in Fig. 1. Squall lines 
forming in midafternoon over the Plains States often 
arrive over the midwestern United States at night. 
See STORM. 

Figure 2 shows, in a vertical section, the simpli- 
fied circulation normal to the squall line. Slanting of 
the drafts is a result of vertical wind shear in the 
storm environment. Partially conserving its horizon- 
tal momentum, rising air lags the foot of the updraft 
on the advancing side. In the downdraft, air entering 
from middle levels has high forward momentum and 
undercuts the low-level moist layer, continuously re- 
generating the updraft. Buoyancy due to release of 
condensation heat drives the updraft, in whose core 
vertical speeds of 66-132 mi/h (30-60 m/s) are com- 
mon near the midlevel of the storm. Rain falling from 
the updraft partially evaporates into the downdraft 
branch, which enters the storm from middle levels 
where the air is dry, and the evaporatively chilled 
air sinks, to nourish an expanding layer of dense air 
in the lower 0.6-1.2 mi (1-2 km) that accounts for 
the region of higher pressure found beneath and be- 
hind squall lines. In a single squall-line thunderstorm 
about 12 mi (20 km) in diameter, 1-2 x 107 1b/s 
(5-10 kilotons/s) of water vapor may be condensed, 
half being reevaporated within the storm and the 
remainder reaching the ground as rain or hail. See 
MESOMETEOROLOGY. Chester W. Newton 

Bibliography. C. D. Ahrens, Meteorology Today: An 
Introduction to Weather, Climate, and the Environ- 
ment, Oth ed., 1999; E. Kessler (ed.), Thunderstorm 
Morphology and Dynamics, 2d ed., 1992; E K. 
Lutgens, E. J. Tarbuck, and D. Tasa, The Atmo- 
sphere: An Introduction to Meteorology, 10th ed., 
2006. 
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Squamata 


The dominant order of living reptiles, composed of 
the lizards and snakes. The group first appeared in 
Jurassic times and today is found in all but the cold- 
est regions. Various forms are adapted for arboreal, 
burrowing, or aquatic lives, but most squamates are 
fundamentally terrestrial. There are over 7700 ex- 
tant species: 4800+ lizards and 2900+ snakes. See 
REPTILIA. 

The order is readily distinguished from all known 
reptiles by its highly modified skull. The immediate 
ancestors of primitive lizards are members of the 
diapsid order Eosuchia, in which a pair of tempo- 
ral foramina are present on both sides of the head. 
A similar pattern is found among living reptiles in 
the tuatara, order Sphenodonta (Rhynchocephalia), 
and the crocodylians, order Crocodylia. In the Squa- 
mata, however, in the case of the quadratojugal and 
jugal bones, which border the lower margin of the 
inferior temporal opening, the former has been lost 
completely and the latter greatly reduced. In con- 
sequence, there is only a single temporal opening 
present, and the quadrate has become enlarged and 
movable. Even this temporal opening is lost or re- 
duced in many forms. No other reptiles show these 
modifications, which allow for great kinesis in the 
lower jaw since it articulates with the quadrate. In 
addition, the order is distinct from other living reptile 
groups because its members have no shells or sec- 
ondary palates, and the males possess paired penes 
(hemipenes). Another distinguishing characteristic 
of the squamates is the synchronous shedding of the 
outer layer of the integument, called ecdysis. The rate 
of this shedding is not cyclic, but is based fundamen- 
tally on the rate of growth of the individual organ- 
ism, which in turn is based on resource acquisition. 
See CROCODYLIA; EOSUCHIA; LEPIDOSAURIA; SPHEN- 
ODONATA. 

Traditionally, the order Squamata has been di- 
vided into two major subgroups: the lizards, subor- 
der Sauria (= Lacertilia), and the snakes, suborder 
Serpentes. The latter group is basically a series of 
limbless lizards, and it is certain that snakes are de- 
rived from some saurian ancestor. There are many dif- 
ferent legless lizards, and it has been suggested that 
more than one line has evolved to produce those 
species currently grouped together as snakes. Be- 
sides this possibility, there is the additional problem 
of separating the two suborders. No single attribute 
will suffice for this purpose, but the features in the 
table, used in combination, will definitely allocate 
any questionable form to one or the other group. 


Sauria 


The majority of lizards are insectivorous, but a few 
species feed on plants. Others, notably the Varanidae 
and allies, feed on larger prey, including birds and 
mammals. The largest living lizard is the Komodo 
dragon (Varanus komodoensis) [Fig. 1], which at- 
tains a length of 10 ft G m) and a weight of 250 Ib 
(113 kg). Several small geckos (Fig. 2) and African 
chameleons are about 1.5 in. (4 cm) long when 





Fig. 1. Komodo dragon (Varanus komodoensis), Komodo 
National Park, Indonesia. 


fully grown. See CHAMELEON; GECKO. 

Physiology. The senses of lizards include the usual 
ones found in all terrestrial vertebrates. Thermal, tac- 
tile, and pain receptors are distributed over the body 
surface. 

Olfaction. The principal chemoreceptors are the 
nasal organ proper, and the organ of Jacobson or 
the vomeronasal organ, a specialized derivative of 
the olfactory apparatus. The nasal organs open 
into the mouth cavity through the internal nares and 
are connected to the outside by the external nares. 
Inspired air passes through the organ on its way to 
the throat and lungs. Small particles of substances in 
the air are trapped by the nasal mucosa and analyzed 
by sense cells in this lining. The paired organs of 
Jacobson lie anterior to the olfactory sacs and are sep- 
arated from them. They open only into the mouth, 
and particles in watery solution from this region are 
analyzed by them. In many lizards the tongue has a 
divided tip, and the two divisions are used to pick 
up particles outside the mouth and bring them back 
into it. The tips are then rubbed against the open- 
ings of the vomeronasal organs. Jacobson’s organs 
are indicated as separate areas of the nasal organ in 
Amphibia and turtles, are partially separated from 
the main sac in Rhynchocephalia, and are vestigial in 
crocodiles, birds, and most mammals. See CHEMORE- 
CEPTION; OLFACTION; SENSE ORGAN. 

Hearing. The auditory apparatus of lizards shows re- 
markable variation correlated with the mode of life. 
In the majority of forms, a well-developed eardrum 
is present in a slight recess at the posterior mar- 
gin of the head near the quadrate bone. A ligament 
attaches the small middle-ear ossicle, the stapes, to 
the inner face of the tympanum, and the other end 





Fig. 2. Western banded gecko (Coleonyx variegatus). 
(Photo by NPS/Tonto National Monument, Arizona) 
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Squamata 





Comparison of Sauria and Serpentes 








Determining 
structure Sauria Serpentes 
Limbs Usually two or four; sometimes none None 
Eyelids Usually movable; sometimes immovable Immovable 
External ear opening Usually present; sometimes lacking None 
Pectoral girdle Usually present; rarely none None 


Braincase Usually open anteriorly, bounded by connective 
tissue sheath instead of bone 
Mandible Two halves, usually articulating 


Completely bony anteriorly 


Two halves connected by elastic ligament, 
not articulating 








fits into an opening into the inner-ear region. Sound 
is transmitted from the eardrum along the stapes 
to the fluid of the inner ear. In many burrowing 
lizards, the eardrum is partially or completely cov- 
ered by skin and scales or is lost entirely. Most of 
these forms are insensitive to airborne sounds and 
receive their auditory stimuli from the substratum 
through the lower jaw to the quadrate and then to 
the stapes. See EAR (VERTEBRATE); HEARING (VERTE- 
BRATE); PHONORECEPTION. 

Vision. Lizards have moderately good vision except 
for burrowing forms in which the eyes are reduced 
or covered over by skin, scales, or even bone. Most 
lizards with unreduced eyes have color vision, but 
some nocturnal families such as the Gekkonidae 
(geckos) and Xantusiidae (night lizards) are color- 
blind. In certain forms, the movable eyelids have a 
transparent window through which an image may 
be seen even when the eyes are closed. In several 
groups, the two eyelids have fused and are com- 
pletely transparent. The fused lids form a protective 
spectacle in geckos, night lizards, certain burrowing 
skinks, and other forms. See COLOR VISION; EYE (VER- 
TEBRATE); VISION. 

Locomotion. The majority of lizards are quadrupedal 
in locomotion and are usually ambulatory scamper- 
ers or scansorial (adapted for climbing) [Fig. 3]. 
Some forms are bipedal, at least when in haste. Per- 
haps the most famous of the bipedal forms is the 
Jesus Cristo lizard, Basiliscus, of Middle America 
which runs for some distance across the surface of 
small streams when frightened. In many burrowing 
lizards, the limbs are reduced so that sometimes only 
the anterior pair is present or all are gone. In these 
species, locomotion is of a serpentine, curvilinear 





Fig. 3. Tree lizard (Urosaurus ornatus). (Photo by 
NPS/Tonto National Monument, Arizona) 


type. One genus of Asian lizards, Draco, has large 
membranes between its limbs that can be used as 
gliding surfaces to increase the distances of leaps 
and to reduce the rate of descent. 

Thermoregulation. Lizards, in common with all other 
reptiles, are dependent upon external sources for 
maintenance of their body temperatures. They are 
ectothermic. Thermoregulation is attained by means 
of gross behavioral adjustments. A lizard moves into 
shade if its temperature rises, and moves into areas 
of higher temperatures as its temperature drops. In- 
dividuals are most active between temperatures of 
60 and 108°F (16 and 42°C) with optimum activ- 
ity between 80 and 90°F (27 and 32°C). Tempera- 
tures above 110°F (43°C) are lethal, and no lizard can 
survive in direct desert sunlight for more than 15- 
30 min. See TEMPERATURE ADAPTATION; THERMOREG- 
ULATION. 

Respiration and excretion. Respiration is achieved by 
means of paired lungs. The excretory product of pro- 
tein metabolism is an insoluble substance, uric acid. 
The feces are dark, oblong, and capped by a white 
mass of uric acid when deposited. See EXCRETION; 
RESPIRATION; RESPIRATORY SYSTEM. 

Coloration. The coloration of each species of lizard 
is characteristic. Most forms exhibit marked differ- 
ences in coloration between the sexes, at least during 
the breeding season, and frequently the young are 
markedly different from the parents. Color changes 
occur in rapid fashion among some species, and 
all are capable of metachrosis or changing color to 
a certain extent. Contrary to popular legend, the 
changes are not made to match the color of the 
background; they seem, rather, to be under neuro- 
hormonal control and to fluctuate with temperature 
and light and with the activities of the lizard. See 
CHROMATOPHORE; SEXUAL DIMORPHISM. 

Sound production. Many lizards make sounds ranging 
from hisses that are produced by simple expulsion 
of air through the glottis, to squeaks of small forms 
and loud cluckings of the geckos. Sounds are used 
for defensive bluffing and probably for recognition 
as well. 

Defense mechanisms. The basic defense against 
discovery utilized by most lizards is “freezing” mo- 
tionless in position. The majority are protectively 
colored and when still are extremely difficult to see. 
However, if they are located, rapid flight becomes the 
second line of defense. Even if a predator catches a 
lizard, the game is not over, because many forms bite 
and claw. Among the more interesting specialized 


defensive devices of saurians are the bluffing tech- 
niques of certain forms. The Australian frilled lizard, 
Chlamydosaurus, of the family Agamidae, can ex- 
pand a huge nuchal collar which when presented 
with open mouth to a predator is an effective deter- 
rent to aggression. The agamid, Uromastix, of Africa 
and the iguanid, Ctenosaura, of Middle America 
have spiny tails that may lash out to injure a poten- 
tial enemy. The horned lizards, Phrynosoma, of the 
United States and Mexico are armed with numerous 
spines, particularly on the head, and make a rather 
unpleasant mouthful. In addition, they have the ca- 
pacity of squirting blood from ruptured vessels in the 
lower eyelids into the mouth or face of a predator. 
The blood contains chemical substances, secreted 
into it in the eye region, that are extremely repugnant 
to coyotes and dogs. A great many lizards are able to 
throw off their tails voluntarily (tail autotomy), and 
reflexes in the tail make it thrash about to distract a 
predator from its real objective. In many cases, the 
tail is brightly colored to add to the illusion. The tail 
will regenerate on the lizard, but the new tail will 
lack a bony support. See PROTECTIVE COLORATION; 
REGENERATION (BIOLOGY). 

Venomous species. There are only two species of 
venomous lizards, both members of the genus Helo- 
derma, in the family Helodermatidae. The Gila mon- 
ster (H. suspectum) [Fig. 4] is found in western New 
Mexico, Arizona, extreme southern Nevada, south- 
western Utah, eastern California, and northwestern 
Mexico. The beaded lizard (H. horridum) is a Mexi- 
can species. Venom is produced in glands along the 
lower jaw and penetrates wounds inflicted by the an- 
terior recurved teeth (that is, teeth that curve back 
toward the animal). No mechanism for injection of 
the venom into the wound is present, contrary to 
the situation in most snakes. Although occasionally 
fatal, the bites of these large lizards, of which a length 
of 3 ft (0.9 m) is not uncommon, are not as danger- 
ous as those of snakes, nor are human beings bitten 
frequently. 

Rhythmic activity. Saurians show rather definite 
rhythms in their activities. In temperate regions, sea- 
sonal cycles are correlated with temperature and 
sexual activity, and hibernation occurs in the win- 
ter. General activity continues year-round in tropic 
climates. Daily activity cycles are typical of most 
species and include a definite daily pattern of feed- 





Fig. 4. Gila monster (Heloderma suspectum). (Photo by 
NPS/Tonto National Monument, Arizona) 


ing, drinking, resting, basking, and excreting. Most 
lizards are diurnal, and their daily cycles are usually 
correlated with temperature. Certain families, no- 
tably the Gekkonidae and Xantusiidae, tend strongly 
toward nocturnal habits. 

Territoriality. Most lizards spend their entire life 
within a rather restricted area and return at the end 
of each day to the same site to sleep. The males of 
the majority of forms exhibit territorial behavior and 
defend a central portion of their home range against 
trespass by other males of the same species. Defense 
of the territory usually involves much bluffing and 
noise, and the “defending” lizard is usually the vic- 
tor. Various color patches, scale crests, and other dis- 
tinctive marks play a part in intimidation and display 
at these times. Fights frequently occur and generally 
consist of attempts at biting. Male monitor lizards, 
Varanidae, rear up on their hindlegs and bite and 
slash with the front claws. No truly gregarious lizards 
are known, and none are migratory. See TERRITORI- 
ALITY. 

Courtship activity. Males usually breed with any fe- 
male that they meet in moving about their home 
ranges during the breeding season. A complex 
courtship pattern involving a chase and considerable 
prenuptial nuzzling and biting is typical. During cop- 
ulation the male frequently grasps the head or neck of 
the female in his mouth, and the tails of the breeding 
pair are entwined so that their cloacas are brought 
into contact. The male has paired hollow hemipenes 
which normally lie retracted into the base of the tail. 
When engorged with blood, these organs are everted 
to be used singly. 

Reproduction. Most species produce eggs that 
have leathery or calcareous shells. The eggs are 
buried in the soil or hidden in decaying logs or under 
bark. In many cases, the female guards the eggs 
against predators. Some species are ovoviviparous 
and retain the eggs within the body until they hatch. 
Others are truly viviparous with a seroallantoic pla- 
cental connection between mother and embryo. 
Gestation or incubation is generally 5-8 weeks. In 
oviparous and ovoviviparous forms, at least, the 
young have a special egg tooth which grows up from 
the tip of the upper jaw. This tooth is used to cut 
through the egg membranes and the shell at the time 
of hatching and is lost shortly thereafter. 

Classification of lizards. The following list indicates 
the major evolutionary lines, families, and distribu- 
tion of lizards. Families indicated by an asterisk con- 
tain limbless, snakelike species (based on M. J. Ben- 
ton, 2005). 


Iguania line 
Family Iguanidae: the Americas, Madagascar, Fiji 
Agamidae: Africa, southern and central 
Asia, Australia 
Chamaeleonidae: Africa, Madagascar, 
southern India, Near East, southern 
Spain 
Gekkota line 
Family Gekkonidae: circumtropical, all continents 
and most continental and oceanic islands 
Pygopodidae: Australia, New Guinea 
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Scincomorpha line 
Family Xantusiidae: North and Middle America, 
Cuba 
Teiidae: the Americas 
Lacertidae: Africa, Eurasia 
*Gerrhosauridae: Africa, Madagascar 
*Scincidae: cosmopolitan, except frigid 
areas 
Gymnophthalmidae: Central and South 
America 
Cordylidae: southern Africa 
Annulata line 
*Family Amphisbaenidae: tropics of Africa and 
America, Iberian Peninsula 
*Bipedidae: Mexico 
Trogonophidae: northern Africa and 
Middle East 
Rhineuridae: Florida (U.S.) 
Anguimorpha line 
“Family Anguidae: the Americas, Eurasia 
*Anniellidae: the Californias 
Xenosauridae: southeastern China, 
Central America 
Helodermatidae: North America 
Varanidae: Africa, southern Asia, Australia 
Lanthonotidae: North Borneo 


Serpentes 


Snakes (Figs. 5 and 6) are basically specialized, limb- 
less lizards which probably evolved from burrow- 
ing forms but have now returned from subterranean 
habitats to occupy terrestrial, arboreal, and aquatic 
situations. In addition to those features already men- 
tioned as distinguishing the snakes from lizards, the 
following characteristics are typical of all serpents. 
There is no temporal arch so that the lower jaw and 
quadrate are very loosely attached to the skull. This 
gives the jaw even greater motility than is the case in 
lizards. The body is elongate with 100-200 or more 
vertebrae, and the internal organs are elongate and 
reduced. A spectacle covers the eye, and there is no 
tail autotomy. In most features of their biology, ser- 
pents resemble lizards; the two groups are closely 
related. 

The largest living snake is the Indian python 
(Python reticulatus), which reaches 30 ft (9 m) in 
length and a weight of 250 Ib (113 kg). The largest 





Fig. 5. Western coral snake (Micruroides euryxanthus). 
(Photo by NPS/Tonto National Monument, Arizona) 





Fig. 6. Eastern ribbon snake (Thamnophis sauritus). (Photo 
by NPS/Little River Canyon National Preserve, Alabama) 


venomous snake is the king cobra (Ophiophagus 
hannab) of southern Asia, which is known to attain 
a length of 18 ft (5.5 m). A number of small tropi- 
cal burrowing snakes are vastly smaller by compari- 
son, being about 5 in. (12.5 cm) in length when fully 
grown. The longevity of snakes is not well known, 
but one captive cobra lived 28 years and many of the 
larger snakes probably live at least 25 years. 

Physiology. The senses of snakes are fundamen- 
tally similar to those of lizards. Great dependence 
is placed upon olfaction and the Jacobson’s organs. 
The tongue of all snakes is elongate and deeply bi- 
furcated. When not in use it can be retracted into 
a sheath located just anterior to the glottis, but it is 
protrusible and is constantly being projected to pick 
up samples for the organs of Jacobson from the sur- 
rounding environment. Snakes are deaf to airborne 
sounds and receive auditory stimuli only through the 
substratum via the bones of the head. The eyes are 
greatly modified from those in lizards, and there is no 
color vision. Some groups are totally blind and have 
vestigial eyes covered by scales or skin. 

Thermoreception. Specialized thermoreceptors, not 
found in any other vertebrates, are present in many 
boas and pythons (family Boidae, and in all pit vipers) 
(subfamily Crotalinae). In the boids, these receptors 
form a series of depressions in the lip scales and are 
called labial pits. In the crotalids, there is a single, 
large loreal pit located about midway between the 
eye and nostril and somewhat below them on the 
side of the head. These organs are sensitive to tem- 
perature differences of about 2°F (1.1°C) and are ap- 
parently used to locate prey whose temperature may 
be higher or lower than that of the surrounding en- 
vironment. 

Locomotion. Four basic patterns of locomotion are 
found in snakes, and several may be used by a partic- 
ular individual at different times. The most familiar 
type is serpentine curvilinear (lateral undulatory). In 
this pattern, the snake moves forward by throwing 
out lateral undulations of the body and pushing them 
against any irregularity in the surface. Snakes using 
rectilinear locomotion move forward in a straight 
line, without any lateral undulations, by producing 
wavelike movements in the belly plates. Laterolin- 
ear locomotion, or sidewinding, is used primarily on 
smooth or yielding surfaces and is very complex. In 
essence, the snake anchors a portion of the body in 
the substratum and lifts the rest out laterally to a new 


position. When the lifted part is anchored again, the 
portions of the body left behind are lifted to the new 
position. By a constant lifting and anchoring of al- 
ternate parts, the snake moves in a lateral direction. 
Concertina locomotion movement resembles the ex- 
pansion and contraction of that musical instrument. 

Other activities. The basic bodily functions of 
snakes—thermoregulation, respiration, excretion, 
and elimination—are the same as those in lizards. 

Defense mechanisms. Snakes and lizards are alike 
in utilizing procrypsis and flight as their principal 
means of defense. Protective coloration is of prime 
importance in the first of these responses. Although 
all snakes are capable of some color change, none 
show the marked changes characteristic of many 
lizards. Snakes are excellent bluffers, and hissing and 
striking, sometimes with the mouth closed, may be 
used to intimidate or discourage possible predators. 
Among the most effective deterrents to attack are 
the specialized warning devices of the rattlesnakes 
(Fig. 7). Rattlesnakes, however, are not bluffing, be- 
cause they can back up their threats with an injection 
of venom. These rattles are actually an extension of 
the integument; moreover, each time the individual 
organism sheds (ecdysis), a new rattle is added. Con- 
trary to popular lore, the age of the snake cannot be 
determined by the number of rattles. Some snakes 
may shed once over a year period, while other snakes 
may shed three times. 

Ecology. The seasonal and daily activity patterns 
of snakes are similar to those described above for 
lizards. More snakes are nocturnal than diurnal, and 
they generally have a larger home range than do 
lizards. Home sites are found in most species, but 
no territorial behavior is known. A “combat dance” 
in which the males rear upright and push and weave 
around one another occurs frequently. The dance 
has often been described as being a courtship dis- 
play, but it always involves two males. Its exact func- 





Fig. 7. Arizona black rattlesnake (Crotalus viridis cerberus). 
(Photo by NPS/Tonto National Monument, Arizona) 


tion remains a mystery. Snakes are generally nongre- 
garious, although the sea snakes, Elapidae, travel in 
large schools. Migrations occur in temperate regions 
to and from den sites, and the sea snakes migrate 
annually to breeding grounds. 

Courtship. Courtship is distinctive for each form 
but usually involves tracking of the female by the 
male, considerable nuzzling and rubbing of the fe- 
male by the male, and a wrapping of the bodies of the 
breeding pair tightly around one another for mating. 
Male snakes utilize their hemipenes one at a time. 
Eggs are buried or hidden and are leathery or cal- 
careous. Development is oviparous, ovoviviparous, 
or viviparous. Incubation or gestation takes about 
60-100 days. 

Nutrition. Food is almost always of animal material 
such as amphibians, reptiles, birds and their eggs, 
and mammals. However, some forms eat lizard and 
snake eggs to a large extent, as well as insects, mol- 
lusks, or fish; most snakes probably eat carrion. The 
majority of species are rather specific in food pref- 
erence. Prey may simply be grabbed and swallowed, 
or specialized techniques may be employed, depend- 
ing upon the species. Constriction of the victim is 
practiced by many forms. In this case, the food or- 
ganism is large and is grasped in the snake’s mouth 
while coils of the muscular body are wrapped around 
the prey. The coils are gradually tightened until the 
victim’s heart stops or suffocation occurs. Venom 
may be injected into the body of the prey by certain 
forms. Food articles are always swallowed whole. 
The process is complex but consists primarily of a 
series of coordinated movements of the head and 
jaws. The lower jaws are connected to one another 
by an elastic ligament which allows each half to be 
moved independently of the other. The teeth on the 
jaws are recurved so that by alternately moving the 
right half of the jaw forward and then the left and 
hooking the teeth into the prey the snake pulls itself 
forward over the prey’s body. 

Venomous snakes. The vast majority of living 
snakes are harmless to humans, although a num- 
ber are capable of inflicting serious injury with their 
venomous bites. The venom apparatus has evolved 
principally as a method of obtaining food, but it 
is also advantageous as a defense against attackers. 
Venomous snakes may be placed in three categories 
on the basis of their venom apparatus. Opistho- 
glyphodont snakes have fangs in the rear of the 
mouth. Proglyphodont snakes have fangs in the front 
of the mouth. This group can be subdivided into the 
proteroglyphodont in which the anterior fangs are 
fixed and immovable—for example, cobras, mam- 
bas, kraits, coral snakes, and sea snakes—and the 
solenoglyphodont in which the fangs are inserted 
on the movable maxilla so that they may be rotated 
forward when the mouth is open and folded back 
against the roof of the mouth when not in use, as in 
vipers. 

Fangs are teeth modified for the injection of venom 
into the victim, and the venom glands are modified 
salivary glands connected to the grooved fangs by a 
duct. Special muscles are present in all proglyphous 
snakes to force the venom into the wound. The 
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venom itself is a complex substance containing a 
number of enzymes. Certain of these enzymes attack 
the blood, others the nervous system, and some are 
spreaders. The venom of each species is distinct and 
contains different kinds, combinations, and propor- 
tions of the deleterious enzymes. 

The rear-fanged and proteroglyphodont forms 
have relatively short fangs and require some chewing 
of the victim in order to inject the venom. Small rear- 
fanged forms are consequently harmless to human 
beings because they cannot get their mouths around 
any part of a person’s body to bring the fangs into 
contact. Vipers, however, have very long fangs, and 
they usually strike or bite with blinding speed. Their 
injection mechanism is efficient, and a bite usually 
means that a considerable amount of venom has been 
injected. 

Classification of snakes. The following list indi- 
cates the major groups of living snakes and their dis- 
tribution. 


Family Typhlopidae: circumtropical 
Leptotyphlopidae: circumtropical 
Anomalepididae: southern Central America 
and northern South America 

Aniliidae: southwestern Asia and Central 
America 

Anomochilidae: Malaysia, Sumatra, Borneo 

Boidae: circumtropical, western United States 

Bolyeriidae: Round Island dndian Ocean) 

Tropidophiidae: West Indies, Central 
America, South America 

Acrochordidae: India, Solomon Islands, 
Australia 

Colubridae: cosmopolitan, except frigid 
areas; some species are opisthoglyphodont 

Elapidae: Africa, Asia, Australia, the Americas; 
all species are proglyphodont 

Viperidae: Eurasia, Africa; all species are 
proglyphodont 
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Squash 

The common name for edible fruits of several species 
of the genus Cucurbita: C. pepo, C. moschata, C. 
maxima, and C. mixta. Those species originated in 
the Americas but are now grown in most countries 
around the world. Within squash there is a tremen- 
dous variation in size, shape, color, and usage (see 
Fig. 1). 

Summer squash. The most clearly defined group 
is summer squash, fruit of any species of Cucurbita 
eaten as a vegetable when immature. It is most com- 
monly C. pepo, and its plants are usually bushy rather 
than having long runners. In some countries fruits 





may be picked as early as the day the flowers open, 
but they are eaten at all stages up to the time when 
the rind starts to harden. Fruit color may be white, 
yellow, or light or dark green, and the green may be 
solid or striped. Shapes may be flattened disks as in 
Pattypan, cylindrical as in Zucchini and Cocozelle, 
or with necks as in the straightneck and crookneck 
types. Summer squash has mild flavor, high water 
content (about 95%), and relatively low nutritional 
value. It is not especially high in any one nutrient 
but contributes modest amounts of several vitamins 
and minerals. 

Winter squash. Winter squash is fruit of Cucur- 
bita eaten when mature and derives its name from 
its ability to be stored for several weeks or months 
without consumption. Varieties of winter squash are 
found in all four species. The Table Queen group, 


Fig. 1. Three varieties of squash. (a) Zucchini. (b) St. Pat 
Scallop squash (W. Atlee Burpee Co.). (c) Crookneck 
squash (Asgrow Seed Co., The Upjohn Co.). 


synonymous with Acorn, is C. pepo, Butternut be- 
longs to C. moschata, Green-striped Cushaw is C. 
mixta, while C. maxima has the widest range of 
types, including Buttercup, Hubbards, and Delicious 
of various colors, Banana, and Boston Marrow. Flesh 
color varies from light yellow to dark orange, and 
the edible portion ranges from thin to very thick, 
depending on variety. 

Characteristics. The percentage of soluble solids, 
which are mostly sugars, in cooked winter squash is 
commonly 8-10% and may be as high as 16%, putting 
squash in the same range as good melons. Squash is 
a good source of calories and of vitamin A, particu- 
larly those with dark orange flesh, which are high in 
carotene. Varieties tend to have their own character- 
istic flavors: the greatest difference is between the 
Table Queen group and the other winter squashes. 
Plants of winter squash are characteristically viny 
in habit with runner lengths of 12 to 20 ft G.6 to 
6 m) under good growing conditions, but bush or 
semiviny forms have been developed in some of the 
types. The bush forms of winter squash tend to be 
lower in solids and quality, especially if they set a 
heavy load of fruit. 

There is some confusion in terminology between 
winter squash and pumpkin, and the use of these 
names differs from place to place. Both names refer 
to mature fruits, and varieties called squash and oth- 
ers called pumpkins may be found in all four Cucur- 
bita species. In the United States, varieties eaten as 
a vegetable are generally called squash, while those 
used for pies, stock feed, or autumn decorations are 
usually called pumpkins. Pumpkins tend to be round 
with orange exterior color and flesh that is paler, 
more fibrous, and less sweet than squash. See PUMP- 
KIN. 

Propagation. Squash is propagated by seed and its 
flowering habit is monoecious, that is, with sepa- 
rate male and female flowers on the same plant. 
Male flowers usually appear several days before the 
first female. Pollination is done naturally by insects, 
and within a species types which look very differ- 
ent cross readily with each other. However, the fruit 
is not affected in the year crossing occurs. Natural 
crossing occurs only rarely between the different 
species. Separation of the sexes makes controlled 
crossing relatively easy, and first-generation hybrids 
are increasingly being used as varieties, especially in 
summer squash where hybrid vigor is manifested in 
earlier flowering. One result of hybridization is that 
Butternut, Table Queen, Zucchini, and others which 
were originally single varieties have become groups 
of varieties with similar fruit shape but differing fruit 
sizes, colors, times of maturity, and plant sizes. See 
BREEDING (PLANT). 

Squash seed is usually planted directly in the field 
after the soil is warm, in rows 4-6 ft (1.2-1.8 m) 
apart for bush type and 6-9 ft (1.8-2.7 m) apart 
for vining types. Plants may be spaced from 2-3 ft 
(0.6-0.9 m) apart in both cases. In cool regions, 
plants may be started in a greenhouse, provided they 
are grown in containers which permit transplanting 
without much disturbance of roots and before they 
become too large. The best size for transplanting may 


be reached in 10-14 days under good growing con- 
ditions. 

Harvesting and storage. Summer squash begins to 
bear in 50-60 days from planting. The fruit grows 
very rapidly and must be harvested at frequent inter- 
vals (3-4 times per week) if one wishes the plants to 
continue bearing for an extended period. 

Winter squash should be harvested when fully ma- 
ture for best quality. Unless planted very early, it is 
often left in the field until the first light frost occurs, 
then harvested before the exposed fruit can be dam- 
aged by subsequent frost. Storage temperatures of 
50-60°F (10-16°C) are best because chilling injury 
occurs at lower temperatures and shortens storage 
life. For all varieties except Table Queen, a curing 
period of 1-2 weeks at 70-80°F (21-27°C) may im- 
prove quality by hastening conversion of starch to 
sugar. H. M. Munger 

Diseases. There are more than 40 diseases of 
squash and about 16% of the annual crop is lost due 
to diseases. Diseases caused by the mosaic group of 
viruses are of primary importance because of their 
widespread occurrence in all areas where the crop 
is grown, both in the United States and elsewhere. 
These viruses produce such striking symptoms in in- 
fected plants, such as malformation and mottling of 
the leaves (Fig. 2) and distortion and abnormal dis- 
coloration of the fruits (Fig. 3). These viruses can 
be transmitted by seed, insects, and mechanical con- 
tact. Effective control measures are the use of virus- 
free seed, eradication of weed hosts of the viruses, 
and elimination of insect vectors. 

Seed decay is common when squash is planted in 
cool wet soil, so treatment of the seed with registered 
fungicides is essential. The fungus Fusarium solani 
causes stunting, yellowing, and wilting of squash 
plants. The most effective control measures against 
this disease are the use of noncontaminated seed 
and extended crop rotations. In the arid southwest- 
ern United States and in squash production areas 
throughout the world with similar climates, pow- 
dery mildew, Sphaerotheca fuliginea, is a severe 
production-limiting disease, This disease can be eas- 
ily recognized by the abundant production of white 
masses of spores on the leaves of infected plants 
so that they appear to have been sprinkled with a 





Fig. 2. Squash leaves, mottled by mosaic virus. 
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Fig. 3. Crookneck squash. (a, b) Fruit mottled by mosaic 
virus. (c) Healthy fruit. 


heavy white powder. In the late 1960s, effective sys- 
temic fungicides were developed that gave excellent 
control of this disease. By the mid-1970s, however, 
the fungus had begun to develop strains resistant to 
these fungicides, and growers returned to the use 
of older, more conventional, protective fungicides. 
There have been efforts by plant breeders to develop 
squash varieties and hybrids with genetic resistance 
to powdery mildew. 

When heavy, prolonged dew periods are preva- 
lent during the growing season, the downy mildew 
fungus, Pseudoperonospora cubensis, severely 
defoliates the plants. Since no effective level of re- 
sistance to this disease is available in commercial va- 
rieties, the only practical control procedure is the ap- 
plication of protective fungicides at regular intervals 
throughout the growing season. See PLANT PATHOL- 
OGY. Claude E. Thomas 

Bibliography. D. M. Bates, R. W. Robinson, and C. 
Jeffrey (eds.), Biology and Utilization of the Cucur- 
bitaceae, 1989. 


TA 
Squatiniformes 

An order of bizarre sharks, commonly known as 
angel sharks, comprising one family, one genus 
(Squatina), and 15 species that superficially resem- 





Pacific angel shark (Squatina californica). (Photo by Tony Chess/NOAA Fisheries, 
Southwest Fisheries Science Center) 


ble rays (see illustration). The following combina- 
tion of characters identify the angel sharks: body 
flattened dorsoventrally; pectoral fins winglike and 
spread forward alongside head but not attached to 
it; mouth almost terminal; nostrils terminal, with 
barbels on anterior margin; eyes and large spira- 
cles on top of head; five gill slits partly on sides of 
head and partly on ventral surface; two small spine- 
less dorsal fins on tail; anal fin absent; and lower 
lobe of caudal fin longer than upper lobe. Maximum 
lengths of three species vary from 63 to 89 cm (25 to 
35 in.), and 12 species vary from 120 to 200 cm 
(47 to 79 in.). 

Angel sharks are found primarily in cool temper- 
ate to tropical zones of the eastern and western 
Pacific, eastern and western Atlantic, and extreme 
southwestern Indian Ocean, but they do not occur 
in most of the Indian Ocean or in the central Pacific 
reaches of Oceania (that is, the archipelagoes and 
islands of the central and south Pacific). They have a 
wide depth range, from nearshore to the upper con- 
tinental slopes at 1300 m (4265 ft). They feed on a va- 
riety of small bony fishes, crustaceans, cephalopods, 
gastropods, and bivalves by using their highly pro- 
trusible jaws to snap up prey at high speed. See ELAS- 
MOBRANCHII; SELACHII. Herbert Boschung 

Bibliography. L. J. V. Compagno, Checklist of 
Chondrichthyes, pp. 503-547 in W. C. Hamlett 
(ed.), Reproductive Biology and Phylogeny of 
Chondrichthyes: Sharks, Batoids, and Chimaeras, 
Science Publishers, Enfield, NH, 2005;L. J. V. 
Compagno, Checklist of living elasmobranches, 
pp. 471-498 in W. C. Hamlett (ed.), Sharks, Skates, 
and Rays: The Biology of Elasmobranch Fishes, 
Johns Hopkins University Press, Baltimore, 1999; 
L. J. V. Compagno, FAO Species Catalogue, vol. 4: 
Sharks of the World: An Annotated and Illustrated 
Catalogue of Shark Species Known to Date, Part 
2, Carcharhiniformes, FAO Fish. Synop. no. 125, 
pp. 251-655, 1984; J. S. Nelson, Fishes of the World, 
4th ed., Wiley, New York, 2006. 
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Squeezed quantum states 


Quantum states for which certain variables can be 
measured more accurately than is normally possible. 

All matter and radiation fluctuate. Much random 
fluctuation derives from environmental influence, 
but even if all these influences are removed, there 
remains the intrinsic uncertainty prescribed by the 
laws of quantum physics. The position and momen- 
tum of a particle, or the electric and magnetic com- 
ponents of an electromagnetic field, are conjugate 
variables that cannot simultaneously possess defi- 
nite values (Heisenberg uncertainty principle). It is 
possible, however, to have the position of a particle 
more and more accurately specified at the expense 
of increasing momentum uncertainty; the same ap- 
plies to electromagnetic field amplitudes. This free- 
dom underlies the phenomenon of squeezing or the 
possibility of having squeezed quantum states. With 
squeezed states, the inherent quantum fluctuation 


may be partly circumvented by focusing on the less 
noisy variable, thus permitting more precise mea- 
surement or information transfer than is otherwise 
possible. See UNCERTAINTY PRINCIPLE. 

Vacuum and laser light. According to quantum elec- 
trodynamics, the vacuum is filled with a free elec- 
tromagnetic field in its ground state that consists of 
fluctuating field components with significant noise 
energy. If ¢ is a phase angle and a(ġ) and alọ + 
(t/2)] are two quadrature components of the field 
(for example, the electric and magnetic field ampli- 
tudes), the vacuum mean-square field fluctuation is 
given by Eq. (1), independently of the phase angle. 


i 
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Equation (1) is normalized to a photon; the corre- 
sponding equivalent noise temperature at optical fre- 
quencies is thousands of kelvins. Equation (1) also 
gives the general fluctuation of an arbitrary coherent 
state, which is the quantum state of ordinary lasers. 
Further environment-induced randomness is intro- 
duced in addition to Eq. (1) for other conventional 
light sources, including light-emitting diodes. See CO- 
HERENCE; ELECTRICAL NOISE; LASER; QUANTUM ELEC- 
TRODYNAMICS. 

Squeezed-state light. In a squeezed state, the 
quadrature fluctuation is reduced below Eq. (1) for 
some ġ, as given in Eq. (2). At that point, squeez- 
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4 
ing, that is, reduction of field fluctuation below the 
coherent state level, occurs. The fluctuation of the 
conjugate quadrature is correspondingly increased 
to preserve the uncertainty relation, Eq. (3). Ina two- 
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photon coherent state, or squeezed state in the nar- 
row sense, Eq. (3) is satisfied with equality. As a con- 
sequence, for a fixed energy E, the field quadrature 
signal-to-noise ratio is maximized by a two-photon 
coherent state. In particular, it becomes 4ECE + 1) 
compared to 4£ for a coherent state. The two-photon 
coherent states are nothing but a slight generaliza- 
tion of the well-known minimum uncertainty states 
already studied by E. Schrédinger. The designation 
squeezed state is partly derived from the fact that 
the noise circle of Eq. (1) is squeezed to an ellipse 
when Eq. (3) is satisfied with equality (see illus.). 
Such states are called two-photon coherent states 
partly because only even counts of photons are pos- 
sible for them when the mean field amplitude is re- 
moved, which is very characteristically a quantum 
feature. 

Generation and detection. Squeezed light can be 
generated by a variety of processes, especially non- 
linear optical processes. To detect squeezing, an 
improved form of optical homodyne detection is em- 
ployed called balanced homodyning. The first suc- 
cessful experimental demonstration of squeezing, in 
1985, involved a four-wave mixing process in an 


Squeezed quantum states 






D 
~ squeezed state 


~ vacuum state 


Field-amplitude fluctuation (Aat4(¢)) as a function of 
phase angle ¢. The noise circle of the vacuum state, Eq. (1) 
in the text, is squeezed to an ellipse according to Eq. (3). 


atomic beam of sodium atoms. Since then, squeez- 
ing has been observed in a variety of other systems; 
the most pronounced was seen in parametric oscilla- 
tion and amplification with about a factor of 4 below 
Eq. (1). See NONLINEAR OPTICAL DEVICES; NONLIN- 
EAR OPTICS. 

Potential applications. Squeezing was first studied 
in connection with optical communication, although 
it was evident that reduced quantum fluctuation 
might find applications in precision measurements. 
Utilization of squeezed light has been proposed 
for interferometry, gravitational-wave detection, and 
spectroscopy, among other areas. The full realiza- 
tion of these applications is still remote. See OPTICAL 
COMMUNICATIONS. 

Intensity squeezing. A phenomenon closely related 
to squeezing, which is also exhibited by some two- 
photon coherent states, is antibunching, where the 
photon number fluctuation in a sub-poissonian state 
is reduced below the coherent-state Poisson level. 
This sub-poissonian behavior is sometimes called am- 
plitude squeezing or intensity squeezing. The gener- 
ation, propagation, and detection of sub-poissonian 
light and squeezed light have similar problems; ap- 
plications are similar and complementary. Strongly 
sub-poissonian light has been observed from 
constant-current-driven semiconductor lasers. See 
DISTRIBUTION (PROBABILITY). 

Coherence and loss. Squeezed light and sub- 
poissonian light constitute the most important 
classes of nonclassical light, specifically, those lights 
whose quantum state cannot be represented as a ran- 
dom superposition of coherent states. Nonclassical 
light with space-time coherence can be generated 
by nonlinear optical processes. On the other hand, 
the quantum fluctuation characteristic is sensitive to 
the losses introduced during propagation and de- 
tection. Loss not only attenuates the signal energy 
on nonclassical light, but it also mixes in vacuum 
(coherent-state) fluctuation quantum-mechanically, 
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thus degrading the desirable quantum noise char- 
acteristic of nonclassical light. This could be a se- 
rious problem for obtaining good sources and for 
many applications. The loss problem could be alle- 
viated in some situations by using appropriate quan- 
tum amplifiers, specifically, parametric amplifiers for 
squeezed light and photon-number amplifiers for 
sub-poissonian light. These amplifiers, in particular 
the photon-number amplifier, are not yet sufficiently 
developed for such applications. See PARAMETRIC 
AMPLIFIER. 

Contractive states. In the context of a particle, the 
position and momentum take the roles of a(@) and 
al + (/2)]. A two-photon coherent state is then 
just a slightly generalized minimum-uncertainty wave 
packet. A subclass of these states called contractive 
states have been found to be of potential importance 
in gravitational-wave detection via the motion of a 
free mass. It has been shown that quantum measure- 
ments involving such states could, in principle, lead 
to arbitrarily precise monitoring of the position of a 
free mass, which was formerly thought to be impossi- 
ble. See NONRELATIVISTIC QUANTUM THEORY; QUAN- 
TUM MECHANICS. Horace P. Yuen 

Bibliography. R. Loudon and P L. Knight, Squeezed 
light, J. Mod. Opt., 34:709-759, 1987; Special issue: 
Quantum noise reduction in optical systems: Exper- 
iments, Appl. Phys. B, vol. 55, no. 3, 1992; Special 
issue: Squeezed states of the electromagnetic field, 
J: Opt. Soc. Amer. B, vol. 4, no. 10, 1987; H. P. Yuen, 
Two-photon coherent states of the radiation field, 
Phys. Rev. A, 13:2226-2243, 1976. 
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Squid 

The common name applied to cephalopods of the 
order Teuthoidea. They are marine mollusks that in- 
habit the oceans of the world. Squids (see illustra- 
tion) are characterized by having eight arms and two 
longer tentacles around the mouth; an elongated, 
tapered, usually streamlined body; an internal rod- 
or bladelike shield (gladius); and fins on the body 
(mantle). The arms have two (infrequently four or 
six) rows of suckers and occasionally clawlike hooks, 
and the tentacles have terminal clubs with suckers, 
hooks, or both. The muscular, elastic tentacles are 
contractile, not retractile into pockets like those of 
cuttlefishes (Sepioidea). See SEPIOIDEA. 





Short-finned squid (Illex borealis). (Donald Flescher; image courtesy of Marine Biological 
Laboratory Archives) 


Squids have an exceptionally well-developed brain 
and organs of the central nervous system that ap- 
proach in complexity and function those of fishes 
and even some birds and mammals. The eyes are 
large and provide acute vision. Thousands of pig- 
ment cells (chromatophores) in the skin are individu- 
ally innervated to accommodate very rapid color and 
color pattern changes important in camouflage and 
behavior. Highly innervated photophores, capable 
of producing bioluminescent light, may be present 
on the skin, arms, or tentacles, or on the internal 
organs and eyes, and function to warn predators, 
eliminate the silhouette of deep-sea forms, or en- 
able recognition of species or sexes. See BIOLUMI- 
NESCENCE; CHROMATOPHORE; NERVOUS SYSTEM (IN- 
VERTEBRATE); PHOTORECEPTION. 

Squids are active, powerful swimmers, driven by 
jet propulsion as water taken into the mantle cav- 
ity is forcefully expelled through the funnel. Prey, 
normally shrimps, fishes, or other squids, are cap- 
tured with the two tentacles and held with the arms 
while the beaks, like parrot beaks, cut off bites that 
the radula and tongue shove down the throat. Squids 
are preyed upon by toothed whales (sperm whales, 
dolphins), pinnipeds (seals, sea lions), fishes (tunas, 
mackerels, swordfish, groupers, sharks), and pelagic 
birds (petrels, albatrosses). 

Two groups (suborders) of squids are recognized: 
Myopsida, like Loligo pealei and Lolliguncula bre- 
vis from the east coast of North America and the 
Caribbean and Loligo opalescens from the west 
coast of North America; and Oegopsida, the open- 
ocean and deep-sea forms like J//ex illecebrosus of 
the western North Atlantic ocean and Dosidicus 
gigas of the eastern Pacific Ocean from off Chile to 
California. 

Squids are important in biomedical research, es- 
pecially that concerning neurophysiology and bio- 
physics, in which the giant nerve axon plays a 
major role. Squids are eaten around the world; an- 
nual catches total about 1 million metric tons. An 
excellent source of protein, squid is very low in fats 
and high in essential minerals. See CEPHALOPODA; 
COLEOIDEA; TEUTHOIDEA. Clyde F. E. Roper 
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Hanlon and J. B. Messenger, Cephalopod Behaviour, 
Cambridge University Press, 1996; J. B. Messenger, 
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SQUID 


An acronym for superconducting quantum interfer- 
ence device, which actually refers to two different 
types of device, the dc SQUID and the rf SQUID. 
The dc SQUID consists of two Josephson 
tunnel junctions connected in parallel on a 
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Fig. 1. Direct-current (dc) SQUID with enclosed magnetic 
flux ®. / = applied current; V = generated voltage. 






































superconducting loop (Fig. 1). A small applied cur- 
rent flows through the junctions as a supercurrent, 
without developing a voltage, by means of Cooper 
pairs of electrons tunneling through the barriers. 
However, when the applied current exceeds a cer- 
tain critical value, a voltage is generated. When a mag- 
netic field is applied so that a magnetic flux threads 
the loop, the critical value oscillates as the magnetic 
flux is changed (Fig. 2), with a period of one flux 
quantum, ®y = b/(2e) © 2.07 x 1071 weber, where 
h is Planck’s constant and e is the electron charge. 
The oscillations arise from the interference of the 
two waves describing the Cooper pairs at the two 
junctions, in a way that is closely analogous to the 
interference between two coherent electromagnetic 
waves. See INTERFERENCE OF WAVES; JOSEPHSON EF- 
FECT; SUPERCONDUCTIVITY. 

The rf SQUID consists of a single junction inter- 
rupting a superconducting loop. In operation, it is 
coupled to the inductor of an ZC-tank circuit ex- 
cited at its resonant frequency by a radio-frequency 
(rf) current. The rf voltage across the tank circuit 
oscillates as a function of the magnetic flux in the 
loop, again with a period of one flux quantum. Al- 
though SQUIDs were for many years operated while 
immersed in liquid helium, ceramic superconduc- 
tors with high transition temperatures make possible 
devices operating in liquid nitrogen at 77 K. 

SQUIDs have important device applications. Usu- 
ally with the addition of a superconducting input 
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Fig. 2. Maximum supercurrent /, versus ®/®, for dc SQUID, 
where ® = enclosed magnetic flux, ®y = flux quantum. 


circuit known as a flux transformer, both dc and rf 
SQUIDs are used as magnetometers to detect tiny 
changes in magnetic field. The output of the SQUID 
is amplified by electronic circuitry at room temper- 
ature and fed back to the SQUID so as to cancel 
any applied flux. This makes it possible to detect 
changes in flux as small as 10~° of one flux quantum 
with SQUIDs based on low-transition-temperature 
superconductors, corresponding to magnetic field 
changes of the order of 1 femtotesla in a 1-hertz band- 
width. Suitable modifications to the input circuit en- 
able the SQUID to measure other physical quantities, 
including voltages, displacement, or magnetic sus- 
ceptibility. SQUIDs are also used for logic and switch- 
ing elements in experimental digital circuits and 
high-speed analog-to-digital converters. See ANALOG- 
TO-DIGITAL CONVERTER; INTEGRATED CIRCUITS; MAG- 
NETOMETER; SUPERCONDUCTING DEVICES. 
John Clarke 
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1994; T. Van Duzer and C. W. Turner, Principles 
of Superconductive Devices and Circuits, 2d ed., 
1999. 
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Squirrel 

A rodent in the family Sciuridae. This family includes 
all tree squirrels and flying squirrels, as well as squir- 
rels that live on the ground such as the woodchuck, 
marmot, chipmunk, and prairie dog. There are 
51 living genera and 272 species living in a wide va- 
riety of habitats ranging through tropical rainforests, 
tundra, alpine meadows, and semiarid deserts in 
Europe, Asia, Africa, North America, and most of 
South Africa. They are absent from Australia, Mada- 
gascar, Polynesia, and the Sahara Desert. Locomo- 
tion is plantigrade (walking with the whole sole of 
the foot touching the ground). Sensitive vibrissae are 
present on the head, feet, and outsides of the legs. 
The dental formula is I 1/1, C 0/0, Pm 1-2/1, M 3/3 
for a total of 20 or 22 teeth. The incisors grow con- 
tinually throughout the animal’s life. See CHIPMUNK; 
DENTITION; MARMOT; PRAIRIE DOG; RODENTIA. 

Tree and flying squirrels. Tree and flying squirrels 
are mostly small and slender-bodied with bushy tails 
often as long as their bodies (see illustration). The 
tail is primarily used to maintain and correct the bal- 
ance of the animal in leaping from branch to branch, 
but it may also be used as a rudder when leaping, as 
a flag to communicate social signals, and as a wrap- 
around blanket when the animal sleeps. Soft pads are 
present on the soles of the feet, and sharp claws are 
present on all toes. These squirrels range in size from 
the African pygmy squirrel (Myosciurus pumilio) of 
Cameroon, Gabon, Congo, and Equatorial Guinea, 
which has a body length of about 60-70 mm (2- 
3 in.), a narrow tail of about 50-60 mm (2 in.), and 
weighs about 16 g (0.6 oz), to the large Oriental giant 
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Delmarva fox squirrel (Sciurus niger cinereus). (Photo by W. H. Julian/U.S. Fish and 
Wildlife Service) 


squirrel (Ratufa) of India, Nepal, Borneo, and the 
Malay Peninsula, which may reach a length of over 
0.9 m G ft) and weigh up to 3 kg (6.5 1b). Habi- 
tats vary considerably. The European pine squirrel 
(Sciurus vulgaris) and the North American red squir- 
rel (Tamiasciurus hudsonicus) dwell primarily in 
pine and evergreen forests, whereas the gray squirrel 
(S. carolinesis) of eastern North America is a resi- 
dent of groves of oak, maple, and hickory trees. The 
largest tree squirrel in eastern North America, the 
fox squirrel (S. niger), favors open upland groves of 
oak, hickory, beech, and longleaf pine rather than 
the deep unbroken forest. 

Tree squirrels are agile and are active during the 
day, unlike flying squirrels, which are nocturnal and 
have large eyes for seeing in dim light. They hoard 
nuts, seeds, and fruits in holes in trees or in un- 
derground storage sites. Food caches are located by 
memory and also by the keen sense of smell. Nests 
are constructed primarily of leaves and twigs and 
are located either in tree cavities or in the fork of a 
tree. Tree squirrels are active all year, although they 
may remain in their nests for extended periods dur- 
ing severe winter weather. Breeding usually occurs 
in February with litters being born in March or April. 
Newborn squirrels are hairless, blind, and helpless. 
Their eyes open in about 5 weeks. 

Other tree squirrels include the flame and pygmy 
squirrels of South America; the Indian palm (Funam- 
bulus) and common oriental (Callosciurus) squit- 
rels of Oriental Asia and the Pacific Islands; and 
the oil palm (Protoxerus), pygmy (Myosciurus), sun 
(Heliosciurus), and bush (Paraxerus) squirrels of 
Africa. 

Flying squirrels are not really fliers but gliders. 
Their “flying” equipment consists of a thin, loose 
furry membrane (patagium) extending along each 
side of the body between the fore and hind limbs. 
As the animal launches into the air from some lofty 


bough, it spreads its legs, drawing the membranes 
taut. This wide expanse gives the squirrel enough 
support to permit it to glide in a downward course 
for a considerable distance. Before landing, the squir- 
rel checks its speed by manipulating its tail and 
comes to rest head up on the trunk of a tree. The 
flying squirrel cannot sustain its flight for any long pe- 
riod of time; it lacks propelling organs like the wings 
of a bat or bird. It can, however, glide through the 
air up to a distance of 80 yards (73 m) or more. Fly- 
ing squirrels are found in forested regions of Europe, 
Asia, and North America south to Honduras. They 
are absent from South America, Africa, and Australia. 
Nests are usually constructed in tree cavities but may 
be in birdhouses. Flying squirrels are active all year 
and feed on nuts, seeds, berries, and insects. Lichens 
and fungi may be eaten during the winter. On occa- 
sion, they kill and eat sleeping birds and bird eggs. 

Ground squirrels. Ground-dwelling squirrels are 
usually larger and more heavy-bodied than tree squir- 
rels. They have powerful forelimbs and large claws 
for digging. They live in subterranean burrows or 
have their underground dens among rocks. Burrows 
usually consist of a nest chamber and several side 
rooms. Ground-dwelling squirrels are omnivorous. 
Grasses, clover, forbs, flowers, and bulbs make up the 
bulk of the diet, but insects, bird eggs, and even car- 
rion may also be eaten. They are diurnal, and some, 
such as woodchucks and marmots, hibernate. The 
14 species of marmots (Marmota) are found 
throughout most of the United States and Canada 
as well as in the Alps of Europe and eastward in the 
mountains to China and southeastern Siberia. 

The five species of prairie dogs (Cynomys) are 
stout bob-tailed rodents that were famous for their 
“towns” in the midwestern United States. These 
were actual underground cities with miles of well- 
worn tunnels and dens extending in every direction. 
Some large towns may have had millions of inhabi- 
tants. A few towns still remain, but prairie dog popu- 
lations were decimated by cattle ranchers who killed 
them with strychnine because they consumed the 
same forage as the herds of cattle. 

Chipmunks (Tamias) inhabit sunlit woodlands 
and open pastures. They favor stone walls, rocks, 
and fallen timber. The 25 species are found in Asia 
and North America. Chief foods are seeds, grains, 
nuts, and berries. Food is transported in their cheek 
pouches to underground storage sites. 

Ground squirrels (Spermophilus, Ammosper- 
mophbilus, Atlantoxerus, Xerus, and Spermopbilop- 
sis) are robust mammals that have rounded heads, 
short ears, short legs, and somewhat bushy tails that 
are short to medium in length. The 51 species are 
social animals that live in densely populated towns 
beneath the ground. They feed largely on green vege- 
tation, seeds, grasshoppers, cicadas, and many other 
insects. 

Most ground squirrels hibernate, with some en- 
tering hibernation as early as July and August and 
not coming out until February. One litter of 7 to 10 
young is produced annually. See HIBERNATION AND 
ESTIVATION. Donald W. Linzey 
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A binary star system in the Milky Way Galaxy consist- 
ing of a neutron star or a black hole accreting mat- 
ter in the form of a disk from an early-type compan- 
ion star. The most remarkable feature of the system 
is the presence of bipolar narrow outflows of mat- 
ter (jets), with a velocity of 26% the speed of light, 
whose ejection axes change with time. See BLACK 
HOLE; NEUTRON STAR. 

Discovery. SS 433 owes its name to a catalog pub- 
lished in 1977 by C. B. Stephenson and N. Sanduleak. 
They made an optical survey of the sky (the SS sur- 
vey), searching for sources with a peculiarity in their 
emission spectrum, namely a strong hydrogen emis- 
sion line. SS 433 is the 433rd object in their list of 
sources. Between 1967 and 1981, SS 433 was de- 
tected during other observing campaigns at differ- 
ent wavelengths, and appeared in many other cat- 
alogs as a radio source, an x-ray source, and as an 
optical variable star. Yet, the discovery that really 
drew the attention of scientists to this source was 
related to the characteristics of its hydrogen emis- 
sion lines. See ASTRONOMICAL ATLASES; ASTRONOM- 
ICAL SPECTROSCOPY; RADIO ASTRONOMY; VARIABLE 
STAR; X-RAY ASTRONOMY. 

Further optical observations of SS 433 in 1978 
showed, besides the very strong hydrogen emission 
line in the spectrum (the so-called Ha line, that is, 
the first atomic transition in the hydrogen Balmer 
series, at a wavelength of 656.3 nm), a few other 
lines whose wavelengths did not correspond to any 
known element. The extraordinary discovery, at first 
incomprehensible, was that the wavelengths of these 
“strange” lines also moved with time. A well-known 
effect that can cause shifts in the wavelength of a 
line is the Doppler effect. If the gas that emits a line 
is moving toward the observer, the line in the spec- 
trum appears at a shorter wavelength than the wave- 
length at which it has been emitted at rest with the 
gas (that is, blueshift of the line), while if the gas is 
moving away from the observer, the line appears at a 
larger wavelength (that is, redshift of the line). What 
was difficult to understand was the fact that these 
moving lines appeared always in pairs, and moved 
smoothly in a periodic pattern. The solution of the 
mystery came one year later. See ATOMIC STRUCTURE 
AND SPECTRA; DOPPLER EFFECT. 

Geometry of the jets. All the above mentioned un- 
usual spectral features of SS 433 can be well fit with 
a relatively simple geometrical model, the kinematic 
model (Fig. 1). The system is ejecting matter in form 
of two antiparallel narrow (that is, with an opening 
angle less than 5°) jets. These jets precess in a 21° 
half-opening angle cone with a period of about 162 
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Fig. 1. Doppler shifts of the “moving” Ha lines of (a) the approaching jet and (b) the 
receding jet of SS 433 versus the precession phase. The Doppler shift gives the ratio v/c, 
where v is the approach or recession velocity and c is the speed of light. The data are 
shown as open diamonds, and the solid lines are fits of the data using the kinematic 
model. (From S. S. Eikenberry et al., Twenty years of timing SS 433, Astrophys. J., 
561:1027-1033, 2001. © The American Astronomical Society. Reproduced by permission 


of the American Astronomical Society.) 


days. The components of the jets (probably discrete 
blobs of matter) move in a straight line away from 
the system. Since the ejection axes rotate on a cone 
(like a spinning top) and the angle between the pre- 
cession axes and the line of sight is about 78°, the 
projection of the jets onto the plane of the sky re- 
sults in a twisted trace (Fig. 2). The blobs of mat- 
ter have a constant velocity of about 0.26c, where 
c is the speed of light. For the first time, relativistic 
jets (that is, moving at a significant fraction of the 
speed of light, in this case 26%), which had already 
been discovered in distant galaxies, were observed 
relatively close-by, from a stellar object in our own 
galaxy. 

Electromagnetic radiation from jets. The physical 
processes involved in the first steps of collection, 
launch, acceleration, and collimation of matter in the 
form of a jet are still poorly understood. Most jets are 
thought to originate in the innermost regions of an 
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Fig. 2. Radio image of SS 433 obtained in 2003 with the Very Large Array. A contour plot 
of the intensity of radio emission is superimposed on a pixel image. The binary core is the 
brightest spot in the center, from which matter is outflowing in two anitparallel jets. On 
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the precessing jets. 


accretion disk, and the outflowing axis to lie in a 
direction approximately perpendicular to the disk 
plane. Jet models for SS 433 that can be tested by 
observations concern the description of the electro- 
magnetic radiation (light) emitted by the jets when 
they have already formed, say farther then 10° km 
(1 km = 0.6 mi) from the core of the system. 

Emission lines in the spectrum of SS 433 are ob- 
served not only in the optical band but also in x- 
ray spectra, revealing the presence in the jets of 
heavy elements (such as iron) and highly ionized 
atoms. Emission lines indicate that: (1) the jets of 
SS 433 comprise, besides light particles like elec- 
trons, atomic nuclei; and (2) the gas that emits lines 
is thermal (that is, the state of the matter can be de- 
scribed by a macroscopic quantity like the temper- 
ature). The process that emits the light observed in 
the radio band from the jets of SS 433 is, instead, 
nonthermal and is due to synchrotron emission: 
Charged particles wrapping around a magnetic field, 
accelerate and thus emit light. See SYNCHROTRON 
RADIATION. 

The model developed and generally accepted to 
describe the emission of radiation due to the thermal 
component of the jets is the adiabatic cooling model. 
The hypothesis of adiabatic cooling means that the 
gas ejected in the jets does not exchange energy with 
the environment. The gas has a certain density and 
temperature at the time of launch at the base of the 
jet, and both temperature and density decrease with 
the distance from the binary core only because of the 
expansion of the gas in a conical shape. The more 
dense and hot is the gas, the more it emits at shorter 
wavelengths. Close to the binary core, at distances 
less than 10°-107 km, the jets’ gas is at a temperature 
of about 10° K, and emits light in the x-ray band. Then 
the gas in the jets cools, and at distances of about 10"° 
km from the core the temperature is of the order of 
10% K and the gas emits optical radiation. (This is the 
only really model-independent constraint on temper- 


ature and distance, coming from optical spectral anal- 
ysis.) Based on the adiabatic cooling model, farther 
out the gas should be too cool to thermally emit opti- 
cal or x-ray radiation. The nonthermal radio emission 
superimposed on this thermal radiation is probably 
caused by the highly relativistic free electrons in the 
tail of the thermal distribution of the gas. 

The adiabatic cooling model breaks down beyond 
a certain distance from the binary core. In fact, spa- 
tially resolved x-ray observations of the jets of SS 433 
showed highly ionized Doppler-shifted iron emission 
lines, emitted in the jets at distances of about 10! 
km from the binary core, from gas with a tempera- 
ture of the order of 10° K. This evidence indicates 
that reheating of atoms takes place in the jet flow, 
which is still moving with relativistic velocity more 
than 100 days after launch from the binary core 
(Fig. 3). 

A possible explanation has been suggested for this 
reheating. Random variations of about 15% in the ve- 
locity of the blobs launched in the jets are inferred 
from long-term optical monitoring of the source. 
Therefore, a faster blob could catch up with a slower 
blob launched 162 days before with the same pre- 
cession phase; this would produce a shock and a re- 
heating of matter in the jet a few hundred days after 
its launch from the binary core. See SHOCK WAVE. 

Further out, at distances of 10!4-10! km (10- 
100 light-years) from the binary core, x-ray observa- 
tions show bright termination shocks, caused by the 
matter in the cold jets that hits the dense interstellar 
medium. This process stops the jets’ run, releasing 
frictional energy in form of light. At those distances 
the two jets deform, like a seashell, the surround- 
ing W50 nebula, which is thought to be a supernova 
remnant associated with the formation of the neu- 
tron star or black hole in SS 433. See INTERSTELLAR 
MATTER; NEBULA. 

Binary system. As already noted, SS 433 is an x-ray 
binary system, that is, a binary star system in which 
a neutron star or a black hole is accreting matter in 
the form of a disk from a companion star. It is also 
an eclipsing binary system with an orbital period of 
about 13 days. Astronomers agree that the compan- 
ion star is an early-type high-mass star (O-type or A- 
type stars are among the current hypotheses), with 
a mass that should exceed 5 times the mass of the 
Sun. However, the proper identification and classifi- 
cation of this star, and consequently (using the so- 
called mass function) the classification of the com- 
pact object as a neutron star or a black hole, is still 
controversial. In fact, the binary system is imbued 
with a dense envelope of gas probably coming from 
a strong stellar wind from the high-mass companion 
star. This envelope makes difficult any deep inves- 
tigation of the two stars of the system. See BINARY 
STAR; ECLIPSING VARIABLE STARS. 

Equatorial outflow. Radio observations have also 
shown outflows of matter in a direction perpendic- 
ular to the jets’ axes, in the disk plane. This equa- 
torial outflow can be associated to a windlike emis- 
sion from the system (the high-mass companion) or 
to an extension of the accretion disk. It is not clear 
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Fig. 3. X-ray observations of SS 433 performed in 2000 with the Chandra X-ray Observatory. (a) X-ray image. The binary core 
of SS 433 is indicated with a cross and is not bright due to an instrumental effect. The lobes of the x-ray jets are visible on 
either side of the core. The squares indicate the regions of the jets from which the energy spectra are extracted. (b) Energy 
spectrum of east lobe. (c) Energy spectrum of west lobe. The spectra (indicated by crosses) are consistent with a thermal 
continuum (indicated by solid color lines) and Doppler-shifted highly ionized emission lines (indicated by color dotted lines). 


yet if these outflows consist also of thermal matter, 
although x-ray observations seem to support this hy- 
pothesis. 

Based on the principle that the physical processes 
behind the formation and evolution of relativistic jets 
are the same regardless the scale of the astrophysical 
objects involved (either a supermassive black hole in 
the center of a distant galaxy or a stellar collapsed 
object in a galactic binary system), the discovery of 
SS 433 gave astronomers the first near-by laboratory 
to study in more detail one of the most powerful 
phenomena of the universe, relativistic jets of matter. 

Simone Migliari 

Bibliography. S.S. Eikenberry et al., Twenty years of 
timing SS 433, Astrophys. J., 561:1027-1033, 2001; 
B. Margon, Observations of SS 433, Annu. Rev. As- 
tron. Astrophys., 22:507-536, 1984; S. Migliari, R. 
Fender, and M. Méndez, Iron emission lines from ex- 
tended x-ray jets in SS 433: Reheating of atomic nu- 
clei, Science, 297:1673-1676, 2002. 
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Stability augmentation 


The alteration of the inherent behavior of a sys- 
tem. As an example, ships tend to exhibit sig- 
nificant rolling motions at sea. To dampen these 
rolling motions, a roll stabilization (feedback) sys- 
tem can be used. Such a system consists of a set 
of vanes (that is, small wings) extending outward 
from the hull, below the waterline. By varying the 
vane incidence angle relative to the hull, a hydro- 
dynamic lift is generated on the vane. The vanes 
are driven by a feedback system so that the rolling 
motions are opposed by creating positive lift on 
one side of the hull and negative lift on the other 
side. 

As a second example, nearly all satellites require 
some form of stability augmentation to help in keep- 
ing the antennas or sensors aligned with receiving 
equipment on Earth. The stability augmentation is 
effected by thrusters which receive their commands 
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Stability augmentation 


from a feedback system. See SPACECRAFT PROPUL- 
SION. 

As a third example, stability augmentation sys- 
tems are used on aircraft while airborne. This is 
usually achieved by a system which controls one 
or more flight-control surfaces (or engines) automat- 
ically without inputs from the pilot. The inherent 
stability and response behavior of many modern air- 
planes tends toward low damping or even instability. 
The physical reasons have to do with the configura- 
tion of the airplane and the combination of flight 
speed and altitude at which the airplane is operated. 
Several modern fighters and even some transports 
are intentionally designed with no or little inherent 
stability. There are a number of reasons for such a 
design condition. In the case of fighters, excellent 
maneuverability in combat is essential. By making a 
fighter intentionally inherently unstable, it is easy to 
design the control system so that load factors in pull- 
ups or in turns can be built up rapidly. In the case of 
transports, the motivation to design for little or no 
inherent stability is to lower the size of the tail and 
thereby achieve a reduction in drag and weight. See 
AIRPLANE; MILITARY AIRCRAFT. 

Yaw-damper operation. If, for whatever reason, an 
airplane is deficient in the damping of its response 
or is divergent instead of stable, the flying qualities 
are compromised. This can be corrected by installing 
a stability augmentation system. Yaw dampers were 
among the first such systems designed into airplanes. 
The dutch-roll motion of an airplane consists of si- 
multaneous oscillations of the bank (or roll) angle, 
the sideslip angle, and the heading angle; when this 
motion becomes poorly damped, it is difficult for a 
pilot to maintain manual control over the airplane 
when flying through turbulence. A yaw damper can 
usually solve this problem. It normally drives the rud- 
der with the help of a servo (electrohydraulic or elec- 
tromechanical). The servo receives its signal to move 
from a computer-amplifier combination. The com- 
puter has as its inputs the desired yaw rate and the 
actual yaw rate. The desired yaw rate of the airplane 
depends on the flight condition. If the desired condi- 
tion is straight-line flight, the desired yaw rate will be 
zero. If the desired flight condition is a turn, the de- 
sired yaw rate depends on the steepness of the turn 
but is not zero. The actual yaw rate of the airplane 
is sensed by a yaw-rate gyro. The desired yaw rate is 
normally derived from the automatic pilot computer. 
The error signal, which is the difference between the 
two inputs (Fig. 1), is gradually driven to zero so that 
the output rate of the yaw damper equals the input 
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rate. Generically, such systems are called feedback 
systems. See CONTROL SYSTEMS; FLIGHT CHARACTER- 
ISTICS; GYROSCOPE; SERVOMECHANISM. 

Contrast with flight control. The control exercised 
by the stability augmentation system contrasts with 
that exercised by the pilot. The pilot may be con- 
nected with the flight-control surface via a direct 
mechanical link. Alternatively, in many modern air- 
planes the pilot cockpit control movement is sensed 
by a position transducer. The output of the position 
transducer in turn is sent, via a computer-amplifier 
combination, to a hydraulic actuator, referred to 
as a servo, which drives the flight-control surface 
(Fig. 2). Command signals which come from the 
pilot or from the stability augmentation system are 
sent by wire (fly-by-wire) or by optical conduit fly- 
by-light) to the electromagnetic valve. A valve dis- 
tributes high-pressure hydraulic fluid either to the 
left or to the right of the piston so that the piston is 
forced to move. The piston in turn moves the flight- 
control surface. See FLIGHT CONTROLS. 

Types of aircraft systems. When an airplane flies 
through turbulence, an important task is keeping the 
wings level. This becomes a tedious job for a pilot 
during long flights. For that reason, even many small 
airplanes are equipped with an automatic control 
device (called a wing leveler) which keeps the wings 
level. 

Yaw dampers typically address lack of damping 
around the yaw axis (or z axis) of an airplane. 
Many airplanes also require improvement in damp- 
ing around the pitch axis (or y axis). Some even re- 
quire extra damping around the roll axis (or x axis). 
These dampers are referred to as pitch dampers 
and roll dampers, respectively. Several fighter and 
transport airplanes now have negative or low static 
longitudinal stability. Such problems can be ad- 
dressed by an angle-of-attack or load-factor feedback 
system. The reliability of these stability augmen- 
tation systems and the associated computer hard- 
ware and software has improved to the point where 
only three such systems are required in commercial 
airplanes. 

The early digital feedback systems still used fly-by- 
wire systems. Such systems are susceptible to elec- 
tromagnetic interference and therefore must be pro- 
tected (that is, hardened) against such interference. 
Because hardened systems are both costly and heavy, 
more advanced digital feedback systems are fly-by- 
light systems. 

With the introduction of fast in-flight digital com- 
puters, it has become possible to equip airplanes 
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Fig. 1. Block diagram for a yaw damper. The rudder deflects to oppose the error, and the airplane reacts with a different yaw 
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Fig. 2. Electrohydraulically powered elevator system. Hydraulic lines to the actuator are not shown. The actuator is attached 


to a horizontal stabilizer structure. 


with so-called full flight envelop protection systems. 
Such systems are designed to refuse any pilot input 
which might get the airplane into a flight condition 
from which recovery is no longer possible. Such sys- 
tems can easily be arranged to prevent a pilot from 
rolling a commercial airplane too much or to prevent 
the pilot from stalling the airplane. Such systems can 
also be arranged so that loads acting on the wing or 
tail do not approach dangerously high levels. In that 
case the system is referred to as a load-alleviation 
system. 

Flight simulations. A major design problem in all 
of these types of systems is assurance against un- 
foreseen failures or unforeseen actions by the pilot. 
Very detailed failure cause-and-effect analyses are car- 
ried out during the detailed design and development 
stages of modern flight-control systems. However, it 
is not always possible to foresee every possible type 
of failure or unintentional wrong action by the cock- 
pit crew. For that reason extensive so-called iron-bird 
simulations, with and without pilots in the loop, are 
carried out long before the first flight. During these 
simulations, all possible worst-case scenarios are, in 
effect, test-flown on the simulator, and the reaction 
of the system to these events in monitored. If neces- 
sary, design changes are introduced and tested. See 
AIRCRAFT TESTING; SIMULATION. Jan Roskam 

Bibliography. J. H. Blakelock, Automatic Control 
of Aircraft and Missiles, 2d ed., 1991; D. McRuer, 
I. Ashkenas, and D. Graham, Aircraft Dynamics 
and Automatic Control, 1974, paper 1990; R. C. 
Nelson, Flight Stability and Automatic Control, 2d 
ed., 1997; J. Roskam, Airplane Flight Dynamics and 
Automatic Control, parts I and II, 1995. 
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Stabilizer (aircraft) 


The horizontal or vertical aerodynamic wing surfaces 
that provide aircraft stability and longitudinal bal- 
ance in flight. Horizontal and vertical stabilizers (fins) 
are similar to the aircraft wing in structural design 
and function of providing lift at angle of attack to 


the wind. However, stabilizers are not required to 
supply lift to overcome aircraft weight during flight, 
and when the wing-fuselage center of pressure is be- 
hind the aircraft center of gravity, the aerodynamic 
load on the horizontal stabilizer may be downward. 

Stabilizer arrangements. Stabilizers may be swept 
back like wings for improved high Mach-number 
characteristics. The horizontal stabilizer may be 
found in a conventional or tail-last arrangement or 
in a canard or tail-first arrangement, or it may be dis- 
pensed with to give a tailless arrangement. Vertical 
stabilizers, however, are invariably present at the rear 
of the aircraft in single or multiple units. Aircraft de- 
signed for very high speeds may have auxiliary verti- 
cal stabilizers or ventral fins below the wing or fuse- 
lage to avoid the large losses in directional stability 
that occur with conventional arrangements at posi- 
tive angles of attack at Mach numbers above about 
2. Various stabilizer arrangements are shown in the 
illustration. 

Stabilizing function. The stabilizing function of 
the horizontal and vertical stabilizers may be 
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Stabilizer (chemistry) 


represented by the equation below,where 0C,,,/da 


06m _ o Sra q 
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is the dimensionless rate of change of stabilizing 
or weathervane moment with angle of attack of 
sideslip, dC;/da is the dimensionless lift-curve slope 
of the isolated stabilizer based on stabilizer dimen- 
sions, and S’/S and /'// are the ratios of stabilizer area 
to wing area, and of distance from the stabilizer cen- 
ter of pressure to the aircraft center of gravity to a 
characteristic length, respectively. The quantities €, 
and q'/q are flow conditions at the stabilizer and are 
the downwash or sidewash and dynamic pressure, 
respectively, referred to the free stream. 

Control function. On aircraft provided with eleva- 
tor and rudder control surfaces, the stabilizers may 
be either fixed to the fuselage or adjustable in inci- 
dence at a slow rate (about 1°/s) for trim. In either 
case, the stabilizers provide the support for the ele- 
vator and rudder, carrying their loads to the fuselage. 
Supersonic aircraft and guided missiles generally dis- 
pense with hinged trailing-edge elevator and rudder 
control surfaces, using the stabilizers for control as 
well as stabilization. In this case the stabilizers are ac- 
tuated by the primary flight control system at rates up 
to about 50°/s. See FLIGHT CHARACTERISTICS; FLIGHT 
CONTROLS. Malcolm J. Abzug 

Bibliography. A. W. Babister, Aircraft Dynamic Sta- 
bility and Response, 1980; B. Etkin and L. D. Reid, 
Dynamics of Flight-Stability and Control, 3d ed., 
1995; R. C. Nelson, Flight Stability and Automatic 
Control, 2d ed., 1997. 
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Stabilizer (chemistry) 


Any substance that tends to maintain the physical 
and chemical properties of a material. Degradation, 
that is, irreversible changes in chemical composition 
or structure, is responsible for the premature failure 
of materials. Stabilizers are used to extend the useful 
life of materials as well as to maintain their critical 
properties above the design specifications. Oxygen 
and water are the principal degradants, but ultravio- 
let radiation also can have a significant effect (pho- 
todegradation). 

There are three general approaches to stabiliza- 
tion: surface modification, modification of chemical 
composition, and use of additives. The relative im- 
portance of these approaches varies with the mate- 
rial required to be protected and with the mode of 
degradation. 

Surface modification. Wood and metals are pro- 
tected against destructive weathering and corrosion 
by covering with paints or other surface finishes. 
Primers and rust-inhibiting paints are specialized for- 
mulations designed to provide corrosion resistance 
for metals. Aluminum is stabilized against oxidative 
degradation by a natural process in which an oxide 
film forms over the surface which then restricts 
oxygen penetration into subsurface layers. See COR- 
ROSION; PAINT. 


Laminated structures are also used to minimize 
degradation. For example, moisture-resistant poly- 
mers have been laminated over metals to provide 
corrosion resistance. In other instances, naturally re- 
sistant polymer films have been laminated over the 
surface of more sensitive polymers to stabilize the 
latter against photodegradation. Polymers have also 
been stabilized by crosslinking at the surface to re- 
strict penetration of reactants into the bulk of the 
material. 

The protection of food products by plastic wrap- 
pings is an extension of the approach to stabilization 
by surface modification. Plastic wrappings limit the 
amount of oxygen that reaches the food, thus inhibit- 
ing those oxidative reactions that lead to spoilage. 
See FOOD MANUFACTURING. 

Modification of composition. Stainless steels are sta- 
bilized steels. Modifying the structure of steel is done 
by alloying with chromium or nickel corrosive chem- 
ical reactions. See STAINLESS STEEL. 

Both natural and synthetic polymers degrade dur- 
ing processing and long-term exposure. Different 
polymers exhibit considerable variation in the level 
of stability to hostile environments, their stability 
being directly related to the dissociation energies of 
chemical bonds within the structure. When strong 
bonds are present, such as the C—F bonds in fluori- 
nated polymers, a greater level of stability is observed 
in contrast to that of polymers in which the weaker 
C—H bonds predominate. This accounts for the su- 
perior thermal stability of tetrafluoroethylene over 
that of hydrocarbon polymers, which exhibit signifi- 
cant differences in the strength of their C—H bonds. 
The weakest C—H bonds in polypropylene occur at 
the tertiary carbon atoms where methyl groups are 
attached. Since one such bond is present in each re- 
peating unit, polypropylene is much less stable to ox- 
idative degradation than is linear polyethylene. The 
stability of hydrocarbon polymers could be modified 
by controlling the extent of branching. However, an 
improvement in stability by this approach will be 
accompanied by changes in other potentially impor- 
tant properties, for example, modulus. 

There is evidence that some synthetic polymers 
have structural irregularities that could serve as sites 
for degradation to begin. Reduction of such irreg- 
ularities during synthesis or replacement with less 
labile groups could also be used as an approach to 
stabilization. See POLYMER. 

Additives. A wide variety of additives has been de- 
veloped to stabilize polymers against degradation. 
Stabilizers are available that inhibit thermal oxida- 
tion, burning, photodegradation, and ozone deterio- 
ration of elastomers. Research in the chemistry of the 
low-temperature oxidation of natural rubber has re- 
vealed that hydrocarbon polymers oxidize by a free- 
radical chain mechanism. In pure, low-molecular- 
weight hydrocarbons, an added initiator is required 
to produce the first radicals. In contrast, initiation of 
polymer oxidation occurs in the complex molecules 
of elastomers through impurities already present, 
for example, hydroperoxides. See CHAIN REACTION 
(CHEMISTRY); FREE RADICAL. 


Stabilization of hydrocarbon polymers can be 
accomplished with preventative or chain-breaking 
antioxidants. Preventative antioxidants stabilize by 
reducing the number of radicals formed in the initi- 
ation stage. Where hydroperoxides are responsible 
for initiation, the induced decomposition of these re- 
active intermediates into nonradical products pro- 
vides effective stabilization. Suppressing the catalytic 
effects of metallic impurities that increase the rate 
of radical formation can also provide stabilization. 
Chain-breaking antioxidants interrupt the oxidative 
chain by providing labile hydrogens to compete with 
the polymer in reaction with the propagating rad- 
icals. The by-product of this reaction is a radical 
which is not capable of continuing the oxidative 
chain. 

Stabilization of polymers against photooxidation, 
the principal component of outdoor weathering, is 
accomplished by addition of light screens, ultraviolet 
absorbers, or radical scavengers. Carbon black is the 
most effective of all light screens, functioning by pre- 
venting the penetration of ultraviolet radiation into 
the polymer. Correctly applied, this pigment can ex- 
tend the outdoor life of hydrocarbon polymers from 
1-2 years to over 50 years. Ultraviolet absorbers, 
which absorb and harmlessly dissipate damaging ra- 
diation, are not as effective as carbon black. How- 
ever, another class of additives, known as hindered- 
amine light stabilizers, has become available. These 
stabilizers are believed to function by scavenging 
destructive radicals, and they may approximate the 
protection that is provided by carbon black. See AN- 
TIOXIDANT; CARBON BLACK; INHIBITOR (CHEMISTRY); 
PHOTODEGRADATION. W. Lincoln Hawkins 
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2002; Y. A. Shlyapnikov, S. G. Kiryushkin, and A. P. 
Mar’in, Antioxidative Stabilization Of Polymers, 
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Stain (microbiology) 


Any colored, organic compound, usually called dye, 
used to stain tissues, cells, cell components, or cell 
contents. The dye may be natural or synthetic. The 
object stained is called the substrate. The small size 
and transparency of microorganisms make them dif- 
ficult to see even with the aid of a high-power mi- 
croscope. Staining facilitates the observation of a 
substrate by introducing differences in optical den- 
sity or in light absorption between the substrate 
and its surround or between different parts of the 
same substrate. In electron microscopy, and some- 
times in light microscopy (as in the silver impreg- 
nation technique of staining flagella or capsules), 
staining is accomplished by depositing on the sub- 
strate ultraphotoscopic particles of a metal such as 


Stain (microbiology) 


chromium or gold (the so-called shadowing process); 
or staining is done by treating the substrate with 
solutions of metallic compounds such as uranyl ac- 
etate or phosphotungstic acid. This type of stain- 
ing is not dealt with in this article. See ELECTRON 
MICROSCOPE. 

Stain classification. Stains may be classified accord- 
ing to their molecular structure. For example, there 
are the triphenylmethane dyes, such as the fuchsins 
and the methyl violets; the oxazine dyes, such as 
Nile blue; and the thiazine dyes, such as thionine 
and methylene blue. 

The stains may also be classified according to their 
chemical behavior into acid, basic, neutral, and indif- 
ferent. This classification is of more practical value 
to the biologist. An acid dye, such as Congo red and 
eosine, is usually the sodium or potassium salt of a 
dye acid; a basic dye, such as methylene blue and 
Nile blue, is usually the chloride or sulfate of a dye 
base; a neutral dye, such as the eosinate of methylene 
blue, is a complex salt of a dye acid with a dye base; 
and an indifferent dye, such as Sudan III, is one with 
little chemical activity. At ordinary pH values, cells 
stain with basic dyes; at low pH values, they stain 
with acid dyes. 

Staining procedures. The staining of microorgan- 
isms usually begins by making a smear. When it is 
desired to preserve intercellular relations, the smear 
may be replaced by a microculture. The smear or its 
substitute is usually fixed by heat and treated with a 
solution of some suitable biological stain as in sim- 
ple stains or, in more elaborate procedures, it may be 
chemically fixed, treated with mordants, stained, par- 
tially decolorized, and counterstained. In certain pro- 
cedures parts of a substrate are chemically removed 
prior to staining. 

Smear preparation. The material containing the mi- 
croorganisms is smeared or spread on the surface of 
a glass slide. Liquid material may be spread by means 
of a freshly flamed and cooled loop of platinum or 
some suitable alloy. Solid material may be smeared 
directly, or it may be used to make a suspension in 
water or some liquid medium, a loopful of which is 
spread on the slide. The film produced is allowed 
to dry in the air and is fixed by passing the slide, 
film side up, three or four times over the flame of a 
Bunsen burner. Some cytological procedures require 
chemical fixation. After cooling, the film is stained 
and examined. 

Simple stains. These are procedures in which the 
smear, or the substrate, is stained with a dye solu- 
tion for a given length of time, washed with water, 
air-dried, and examined. The most widely used solu- 
tions for simple staining are (1) Loffler’s methylene 
blue, a solution of 0.3 g of methylene blue in 30 ml of 
95% ethanol, mixed with 100 ml of 0.01% potassium 
hydroxide; (2) Ziehl-Neelsen’s carbol fuchsin, a solu- 
tion of 0.3 g basic fuchsin in 10 ml of 95% ethanol, 
mixed with a solution of 5 g phenol in 95 ml of dis- 
tilled water; and (3) Hucker’s crystal violet, prepared 
by mixing a solution of 0.2 g of crystal violet in 20 ml 
of 95% ethanol with a solution of 0.8 g ammonium 
oxalate in 80 ml of distilled water. 
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Complex staining procedures. Among complex 
staining procedures are the flagella and capsule 
stains. Both include the use of a special mordant to 
increase the density of their substrates and their affin- 
ity for the dye. This also increases the thickness of the 
flagella to a range that is resolved with the light micro- 
scope. There are many procedures for demonstrating 
both structures. In addition, most flagella stains rec- 
ommend precautions to prevent destruction of these 
delicate structures while making the smear. 

Nuclear stains. Most nuclear stains are based on the 
staining properties of nucleic acids. In many organ- 
isms, particularly among the bacteria, the cytoplasm 
is rich in ribonucleic acid. As a result, the entire cell, 
in young and mature cultures, tends to stain like a 
nucleus. Consequently, most nuclear stains include 
a step in which the ribonucleic acid is removed, or 
the bacteria are grown in media deficient in nitro- 
gen supply to hinder the synthesis of ribonucleic 
acid. 

Ribonucleic acid may be removed by treatment 
with the enzyme, ribonuclease, or a strong mineral 
acid. This is the basis of the widely used Feulgen re- 
action and of the HCl-Giemsa stain. In both methods 
the fixed smear is placed in normal HCl, usually at 
60°C for about 10 min. In the Feulgen procedure this 
is followed by placing the smear for several hours in 
a solution of basic fuchsin that has been decolorized 
with sodium or potassium metabisulfite. The nuclei 
appear purple to violet in a colorless cell. 

In the HCl-Giemsa procedure, the smear is stained 
with dilute Giemsa’s stain. Stock solutions of this 
stain contain chiefly the eosinates of methylene 
azure and of methylene blue dissolved in a mix- 
ture of methyl alcohol and glycerol. It is a valuable 
cytological stain because of its metachromatism; it 
stains chromatin purple to red and the cytoplasm 
blue. 

Differential stains. These are staining procedures 
that bring out color differences between a substrate 
and its background or between different parts of the 
same substrate. This may be accomplished by sim- 
ple staining with a metachromatic dye; Nile blue, for 
example, stains the neutral fat droplets red and the 
rest of the cell blue. Or it may require controlled 
decolorization or differentiation with acids, alkalies, 
neutral solvents, or some other agents. Examples of 
the latter procedures are Gram’s stain and acid-fast 
stain, and the spore stain. 

Spore stains. These are based on the fact that the 
spore does not take up dyes readily, but once stained 
it resists decolorization. The differentiating agent 
used may be a dilute solution of an organic acid, an 
acid dye, or another basic dye. 

Macchiavello stain. Another differential stain is that 
of Macchiavello for demonstrating rickettsiae. The 
fixed tissue smear is stained with basic fuchsin, 
differentiated with a 0.5% solution of citric acid, 
and counterstained with a 1% solution of methy- 
lene blue. Rickettsiae stain red, and tissue cells stain 
blue. 

Negative stain. Another differential stain is the neg- 
ative stain. In this procedure the cells appear col- 


orless against a colored background. The cells are 
suspended in a droplet of the negative stain, such as 
India ink, water-soluble nigrosine, or Congo red in 
1% aqueous solutions, on a glass slide. The droplet is 
spread out into a thin film and allowed to dry in the 
air. When pathogenic organisms are to be examined, 
the preparation is dipped in a mixture of 1 ml con- 
centrated hydrochloric acid GHCD and 100 ml of 95% 
ethanol. The acid alcohol changes Congo red prepa- 
rations to blue, giving better contrast. Georges Knaysi 

Fluorochrome stain. The fluorochrome staining pro- 
cedure is used for the early and rapid detection of 
disease-producing Mycobacterium tuberculosis and 
M. leprae, together with other species of Mycobacte- 
ria and Nocardia asteroides. The method has been 
used worldwide since the mid-1970s. Widely recog- 
nized as the truant auramine-rhodamine (A—R) acid- 
fast staining procedure, it not only permits the detec- 
tion of tubercle bacilli more quickly in clinical spec- 
imens, such as sputa, urines, and tissues, but also 
gives a higher yield of positive findings than the con- 
ventional Ziehl-Neelson acid-fast procedure, which 
employs carbol fuchsin as the primary dye. The tru- 
ant auramine-rhodamine procedure has been espe- 
cially popular and relatively inexpensive because of 
the improvement in the development of fluorescent 
microscopes. The auramine and rhodamine dyes ap- 
pear to have structures better suited for penetrating 
and stacking along the receptor macromolecules of 
the mycobacterial peptidolipid or peptidoglycolipid 
than does carbol fuchsin. The intracellular concen- 
tration possible with auramine and rhodamine may 
be greater than with carbol fuchsin, which may re- 
sult in the superior staining qualities. The interpreta- 
tion is that the cell wall lipids of mycobacterial cells 
in their native state containing mycolic acid react 
with the two fluorescing dye molecules of auramine 
and rhodamine and therefore absorb the stain. The 
dyes may well confer the property of acid fastness 
because they also function as a permeability barrier 
and impede the penetration of the bleaching agent 
or decolorizer. J. P. Truant 

Bibliography. M. E. Boon and J. S. Drijver, Rou- 
tine Cytological Staining Techniques: Background 
and Practice, 1986; G. Clark, Staining Procedures, 
áth ed., 1980; P R. Lewis and D. P. Knight, Stain- 
ing Methods for Sectioned Material, 1977; P. Mur- 
ray (ed.), Manual of Clinical Microbiology, 7th ed., 
1999. 
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The generic name commonly used for that entire 
group of iron-base alloys which exhibit phenome- 
nal resistance to rusting and corrosion because of 
chromium content. The metallic element chromium 
(Cr) has been used in small amounts to strengthen 
steel since the famed Eads Bridge spanned the Mis- 
sissippi at St. Louis, Missouri, in 1872; but only in 
the present century was it discovered that contents 
with chromium exceeding 10%, with carbon (C) held 
suitably low, make iron effectively rustproof. Onset 
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Fig. 1. Passivity phenomenon in stainless steel. 1200°C = 
2200°F. 


of the property is striking (Fig. 1), and the chromium 
can then be increased to about 27% with even fur- 
ther chemical advantages before the metal becomes 
structurally useless. See ALLOY; STEEL. 

Other alloy elements, notably nickel (Ni) and 
molybdenum (Mo), can also be added to the basic 
stainless composition to produce both variety and 
improvement of properties. Over 100 different stain- 
less steels are produced commercially, about half as 
standardized grades. Some are more properly classed 
as stainless irons since they do not harden as steel; 
others are true steels to which corrosion resistance 
becomes an added feature. Still others that are nei- 
ther properly steels nor irons introduce totally new 
classes of materials, from both mechanical and chem- 
ical standpoints. 

In fact, the great modern developments of heat- 
resisting superalloys and refractory metals stand as 
extreme extensions of the two heat-resisting fami- 
lies of stainless steels. That same resistance to rust- 
ing at ordinary temperatures shows as resistance to 
oxidation at elevated temperatures (Fig. 1), produc- 
ing a family of oxidation-resistant high-temperature 
alloys; and with this basic feature contributed by 
the chromium content, further additions of nickel 
and other elements contribute remarkable proper- 
ties of workability and strength, which extend not 
only to high temperatures but to cryogenic temper- 
atures as well. The nickel-base heat-resisting alloys, 
the most prominent in that field, project from stain- 
less steel in the sense that nickel increasingly re- 
places the iron (Fe). Similarly, the important titanium 
(Ti) and cobalt (Co) alloy systems, such refractory 
metals as niobium (Nb) and tungsten (W), and also 
chromium and molybdenum, represent the far ex- 
tremes for use of these alloy elements in stainless 
steel. 

The shaded band in Fig. 1 covers typical values for 
corrosion penetration of iron alloys with increasing 


chromium content, the actual data referring to hot 
nitric acid. The width of the band is due to differ- 
ences in temperature and concentration. Attack al- 
most ceases near 12% chromium, the alloy type 410; 
and it becomes scarcely measurable as chromium in- 
creases to type 446. The double-dash boundary in 
Fig. 1 is a plot of actual data for hot-gas scaling at 
2190°F (1200°C). The sudden drop occurs in nearly 
the same chromium range as for aqueous corrosion, 
type 446 again being the most resistant alloy. The 
reversal at the far left is also similar for aqueous cor- 
rosion. 

In service, the stainless steels cover three major 
fields: corrosion resistance at ordinary temperatures, 
scaling resistance at high temperatures, and mechan- 
ical strength over the entire temperature range of 
metallurgical engineering. 

Types. Metallurgically, the alloys fall within 
four groups: ferritic, martinsitic, austenitic, and 
precipitation-hardening. 

Ferritic. When chromium is added to iron to de- 
velop corrosion resistance without steel-type hard- 
ening, ferritic alloys are obtained. See IRON ALLOYS. 

The typical alloys in the table are standards listed 
by the American Iron and Steel Institute (AISD. The 
first is the least expensive, and a little aluminum 
(AD is usually added to help the chromium suppress 
steel-type hardening and ensure the ferritic condi- 
tion, since this avoids welding problems from stray 
martensite. The second is the most popular commer- 
cial ferritic grade, widely used for automotive trim; 
and some fractional martensitic hardening is delib- 
erately courted for additional strength. Type 446 is 
primarily used for oxidation resistance, for above 
this chromium content the engineering value of the 
chromium-iron alloys rapidly vanishes. 

Martensitic. When chromium is added to steels with 
carefully chosen carbon contents, the martensitic al- 
loys result (see table). Again, these cover a range 
from the least expensive and most widely used type 























Typical compositions of standard stainless steels 
in percent 
AISI no. Cr C 

Ferritic 

405 13 0.08 

430 A 0.12 

446 26 0.20 
Martensitic 

410 12 0412 

420 13 10.25: 

440-C I 1.00 

AISI no. Cr Ni Other elements 

Austenitic 

302 18 8 C=0.08 

310 25 20 c= 020 

316 16 1s Mo = 2.75 

347 18 a Nb = 0.50 
Precipitation-hardening 

630 (17-4 PH) ug 4 Cu = 3.2, Nb = 0.25 

633 (AM-350) 16.5 4.3 Mo=2.75,N=0.10 

(174-4 Cu Mo) I7 14 Cu = 3.0, Mo = 2.5 
NDS 050m S025 
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410, through the more selected type 420, ideal for 
cutlery, and on to the highly specialized type 440-C, 
whose great hardness makes it suitable for ball bear- 
ings and surgical instruments. 

Austenitic. Nickel becomes the next most impor- 
tant alloying element. Its commonest use is in the 
renowned “18-8” containing 18% Cr and 8% Ni. Un- 
like the atomic structure of ferrite, which has the 
form of a neatly body-centered cube, or the marten- 
site with its atomic disarrangement responsible for 
strength and hardness, the austenite of this third 
class of stainless steel has a face-centered cubic form 
made stable by the nickel. Completely new mechan- 
ical properties appear, outstanding for workability, 
notch toughness, high-temperature strength, and a 
frequently superior corrosion resistance. These are 
generally the most prized and popular of all stainless 
steels and also the most expensive. Only a few of the 
score of standards appear in the table. The parenthet- 
ical C for type 302 refers to a strange sensitivity to 
carbon content, such that 18-8 today is more com- 
monly type 304 with carbon held below 0.08, and 
even type 304 with carbon held below 0.03. Type 
310 lies well toward the nickel-base alloys and is ac- 
cordingly notable for its heat resistance. The molyb- 
denum in type 316 confers a special resistance to 
pitting corrosion, while the niobium in type 347 is 
a highly specialized addition correcting the carbon 
problem in welding 18-8. 

While these alloys are called steels, no martensite 
develops on cooling; neither can the alloys be hard- 
ened by the usual quenching and tempering. Con- 
trarily, they are put into their softest state by pro- 
ducing austenite at high temperature, with subse- 
quent quenching to retain it. Strengthening is devel- 
oped instead by cold working, and by adjusting the 
18-8 composition slightly downward toward 17-7. 
The austenite then tends to decompose inside the 
cold steel; the martensite crowds the slip planes 
loosened by cold working, locking them against fur- 
ther movement and thereby increasing the strength 
(Fig. 2). Hence, ferritic and martensitic steels are lit- 
tle strengthened by cold working, while austenitic 
steels are greatly strengthened by it, notably type 
301. 
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Fig. 2. Graph of the effect of cold-work on tensile strength 
of stainless steels. 1000 psi = 6.9 MPa. 


Precipitation hardening. Exploiting this approach to 
strengthening, and following the example of dura- 
lumin in a neighboring metallurgical field, metallur- 
gists have developed the fourth group of stainless 
steels known as precipitation hardening. A score 
of alloys became available within two decades of 
their commercial acceptance, and trade names rather 
than generic designations are still commonly used. 
The first listed in the table is subclassed marten- 
sitic because the metal behaves primarily as marten- 
site which further experiences internal precipita- 
tion; the second is typical of what are called semi- 
austenitic alloys for the reason that they rather 
closely follow the instability pattern of 17-7; and 
the third belongs to the austenitic subgroup, with 
hardening essentially due to precipitation within 
austenite. The special properties are developed 
through carefully controlled heat treatments that al- 
ways begin with a “solution anneal” at some high 
temperature and are followed by internal precipi- 
tation at some lower temperature. The latter may 
have multiple stages, ranging down to subzero treat- 
ments, and effects of cold work are often also ex- 
ploited, particularly for the semiaustenitic alloys. 
These precipitation-hardening stainless steels are 
the hallmark since the third quarter of the twenti- 
eth century, bringing mechanical properties to lev- 
els unmatched by any other metal or alloy. Tensile 
strengths are actually exceeding the phenomenal fig- 
ure of 300,000 psi (2.1 gigapascals). 

Fabrication. As with steels in general, the so-called 
wrought stainless steels come from the melting fur- 
naces in the form of either ingot or continuously cast 
slabs. Ingots require a roughing or primary hot work- 
ing, which the other form commonly bypasses. All 
then go through fabricating and finishing operations 
such as welding, hot and cold forming, rolling, ma- 
chining, spinning, and polishing. No stainless steel 
is excluded from any of the common industrial pro- 
cesses because of its special properties; yet all stain- 
less steels require attention to certain modifications 
of technique. 

Hot working. Hot working is influenced by the fact 
that many of the stainless steels are heat-resisting 
alloys. They are stronger at elevated temperatures 
than ordinary steel. Therefore they require greater 
roll and forge pressure, and perhaps less reductions 
per pass or per blow. The austenitic steels are partic- 
ularly heat-resistant. 

Welding. Welding is influenced by another aspect of 
high-temperature resistance of these metals—the re- 
sistance to scaling. Oxidation during service at high 
temperatures does not become catastrophic with 
stainless steel because the steel immediately forms 
a hard and protective scale. But this in turn means 
that welding must be conducted under conditions 
which protect the metal from such reactions with the 
environment. This can be done with specially pre- 
pared coatings on electrodes, under cover of fluxes, 
or in vacuum; the first two techniques are partic- 
ularly prominent. Inert-gas shielding also character- 
izes widely used processes, among which at least a 
score are now numbered. As for weld cracking, care 


must be taken to prevent hydrogen absorption in 
the martensitic grades and martensite in the ferritic 
grades, whereas a small proportion of ferrite is al- 
most a necessity in the austenitic grades. Metallurgi- 
cal “phase balance” is an important aspect of welding 
the stainless steels because of these complications 
from a two-phase structure. Thus a minor austen- 
ite fraction in ferritic stainless can cause martensitic 
cracking, while a minor ferrite fraction in austenite 
can prevent hot cracking. However, the most dan- 
gerous aspect of welding austenitic stainless steel 
is the potential “sensitization” affecting subsequent 
corrosion, to be discussed later. See WELDING AND 
CUTTING OF MATERIALS. 

Machining and forming. These processes adapt to all 
grades, with these major precautions: First, the stain- 
less steels are generally stronger and tougher than 
carbon steel, such that more power and rigidity 
are needed in tooling. Second, the powerful work- 
hardening effect (Fig. 2) gives the austenitic grades 
the property of being instantaneously strengthened 
upon the first touch of the tool or pass of the roll. Ma- 
chine tools must therefore bite surely and securely, 
with care taken not to “ride” the piece. Difficult form- 
ing operations warrant careful attention to variations 
in grade that are available, also in heat treatment, for 
accomplishing end purposes without unnecessary 
work problems. Spinning, for example, has a type 
305 modification of 18-8 which greatly favors the 
operation; while at the opposite extreme is type 301 
or 17-7, for those who wish to take advantage of the 
strengthening due to cold working. See MACHINABIL- 
ITY OF METALS. 

Finishing. These operations produce their best ef- 
fects with stainless steels. No metal takes a more 
beautiful polish, and none holds it so long or so well. 
Stainlessness is not just skin-deep, but body-through. 
And, of course, coatings are rendered entirely unnec- 
essary. At this point the higher initial cost of these al- 
loys rapidly falls behind the long-range cost of other 
metals, often making them cheaper to buy on the 
market. 

Service. Service of the stainless steels covers al- 
most every facet of modern metallurgical engineer- 
ing. They began in the chemical industry for rea- 
sons of their corrosion resistance, then graduated 
into high-temperature service for reasons of resis- 
tance to both surfacial sealing and internal creep. 
They have finally proliferated in every field of 
mechanical operation, ranging from high strength 
and hardness in combination with good ductil- 
ity, at elevated as well as cryogenic temperatures, 
through cutlery and spring and nonmagnetic materi- 
als, to the purely ornamental applications of modern 
architecture. 

However, when stainless steel does break down, 
the action is usually dramatic, and nothing exceeds 
the phenomenology of corrosion in the chemical in- 
dustry. The situation can be understood by recog- 
nizing that the passivity of the stainless steels—that 
which prevents or restrains their reaction with envi- 
ronmental chemicals—is not the nonactivity or no- 
bleness which characterizes the noble metals of the 


platinum group. It comes as a surprise to the nonsci- 
entists that the chromium content makes stainless 
steel even more disposed to rust than plain iron. But 
it is the very tendency which immediately fixes the 
oxygen or rust so tightly upon the surface that a pro- 
tective coating results. The same quality applies to 
titanium and aluminum and their alloys. If something 
disturbs the quite special conditions needed for this 
passivity, corrosion takes place. The damage then 
usually takes one of the three major forms illustrated 
in Fig. 3. 

Sensitization. Some phenomenon, not yet fully un- 
derstood, causes chromium carbide to precipitate in 
the grain boundaries of the austenitic grades when 
heated even for short periods of time in the range 
800-1500°F (427-816°C); these grain boundaries 
thereupon become so susceptible to chemical at- 
tack that a piece of formerly solid steel can fall apart 
in the hand like granulated sugar. The most work- 
able explanation is that the carbon has depleted the 
chromium content in forming the carbide, thus leav- 
ing the metal locally nonstainless. Because welding 
necessarily produces a heat zone alongside the bead 
which somewhere must cover this critical range, 
welded steels often are susceptible to sensitization 
when later subjected to chemical service. The prin- 
cipal cures thus far are as follows: (1) The carbon is 
lowered to values where carbide cannot form; this 
has been accomplished with the introduction of type 
304L. (2) Other and even more powerful carbide 
formers—notably titanium (Ti) and niobium (Nb)— 
have been added to the alloys to tie up carbon at 
high temperatures before chromium carbide forms. 
Nevertheless, the problem persists as the principal 
concern in the chemical service of austenitic stain- 
less steel. 

Stress-corrosion cracking. Known for a hundred or 
more years in the brass industry as season cracking, 
the same phenomenon seems to have been discov- 
ered in stainless steel. In marked contrast to the in- 
tergranular nature of sensitization, stress-corrosion 
cracking cuts right through the body of the grain; 
and although the austenitic steels are by far the most 
susceptible, with the martensitic and ferritic follow- 
ing in that order, it appears that any metal or alloy 
can probably be made to fail in this manner if sub- 
jected to the right chemical under appropriate con- 
ditions of stress. Present corrections or preventions 
lie principally in directions of reducing stress levels, 
both internal and external, and avoiding exposure 
to chemicals known to be particularly dangerous. A 
close approach to immunity is indicated through the 
use of highly purifying melting processes in steel- 
making. 

Pit and crevice corrosion. These are forms specifically 
following from oxygen “starvation” in local areas. 
When the steel can no longer rust, the passive con- 
dition converts to an active one; and to worsen the 
situation, all of the remaining surface then takes on 
the role of cathode in a galvanic cell which concen- 
trates all the anodic or dissolving activity right at the 
tiny point of breakdown. If this is just a local spot, 
the attack takes the form of pitting, but if two flat 
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Fig. 3. Three great corrosion effects on stainless steel. 
(a) Sensitization. (b) Stress-corrosion cracking. (c) Pit 
corrosion. 


surfaces come close together as in bolt-and-washer 
assemblies, the thin film of entrapped liquid can run 
out of oxygen and cause a general breakdown called 
crevice corrosion. See CORROSION. 

Production. Stainless steel has undergone revolu- 
tionary changes in its melting technology. There are 
two major problems in the melting of stainless steel 
which distinguish it markedly from melting ordinary 
steel: chromium must be brought to high levels, and 
carbon levels must be held low. With minor excep- 
tions in the high-carbon martensitic grades, stain- 
less cannot be produced in ordinary fuel-fired fur- 
naces such as the open hearth. The technology arose 
solely because of the closer controls possible with 
electrically heated furnaces, specifically the arc fur- 
nace where tonnage production is concerned. Thus 
stainless melting arose historically out of ordinary 
steelmaking by graduating into specialty furnaces. 
Following World War II the technological demands 
upon stainless steel pressed the technology into still 
greater specialization, such that steelmaking in this 
field displayed the most brilliant technical achieve- 
ments in the history of metallurgy. 

Ordinary steelmaking is concerned only with re- 
moval of carbon from the liquid metal, and this is 
easily accomplished by the addition of iron ore—the 
carbon seizes the oxygen and escapes as a gas. But 
chromium also reacts strongly with oxygen, thereby 
removing itself from the metal to join the slag. There- 
fore, this branch of steelmaking had to develop a two- 
stage process: melt down ordinary steel and drive off 
the carbon, and add the chromium after the oxidizing 
of carbon was completed. This in turn brought two 
severe handicaps: scrap stainless steel—the cheapest 
chromium carrier—could not be used in the first- 
stage charge because the chromium would be lost; 
and an expensive chromium-rich alloy with low car- 
bon content would have to be added in the second 
stage—so-called low-carbon ferrochrome. See STEEL 
MANUFACTURE. 

In the 1930s Alexander Feild developed the rust- 
less process in Baltimore, based upon the discovery 
that the competition between chromium and car- 
bon for oxygen shifts strongly in favor of carbon at 
very high temperatures. By using chromium oxide 
itself for the hearth lining and then carefully press- 
ing the arc heating some hundreds of degrees above 
normal operating temperatures, Feild found that the 
first-stage charge could be loaded not only with scrap 
but also with high-carbon ferrochrome, which is 
much cheaper than the low-carbon grade. Minimal 
chromium losses to the slag were then “kicked back” 
with a shower of ferrosilicon, which easily took the 
oxygen away from the chromium and left silicates in 
the slag, while returning metallic chromium to the 
bath. 

In the 1940s an abundance of liquid oxygen be- 
came available from developments in war industries, 
and E. J. Chelius developed the idea of the “oxygen 
lance.” A long, insulated steel pipe was plunged be- 
neath the surface of the molten steel, churning the 
metal with pure oxygen. The difference between 
iron oxide and gaseous oxygen, with respect to 


oxidizing a steel bath, is that the latter is a gas, and 
so is carbon oxide. The bubbling of the one aids 
the escape of the other, while to chromium oxide 
it makes no difference. Furthermore, the exceed- 
ingly high temperatures developed locally around 
the lance brought the advantage of the Feild effect in 
removing carbon while conserving chromium. The 
1950s then witnessed a virtual conversion of the en- 
tire industry to the oxygen lance, spreading into the 
open hearth as well as the furnaces used for mak- 
ing stainless steel. At last all stainless could be made 
with high scrap additions in the charge, right up to 
100%. As for carbon contamination and the use of 
high-carbon ferrochrome, the higher the first-stage 
carbon the better. It served as a fuel, and a conve- 
nient source of heat during lancing. 

Scarcely had these achievements come into 
widespread use when successful development of a 
porous refractory plug permitted blowing a cleans- 
ing gas through the bottom of a ladle. At first this 
technique involved only the use of an inert gas, such 
as argon, for removing damaging quantities of hydro- 
gen often dissolved in steels of all kinds. But shortly 
the method evolved to the use of oxygen as a sub- 
stitute for lancing. However, the violence of the re- 
action, from entry so deep in the liquid, made it 
necessary to dilute the oxygen with an inert gas. 
Again argon was chosen, and the argon-oxygen pro- 
cess was born. New achievements were electrifying: 
(1) Almost the entire electric furnace processing was 
dispensed with, the furnace being needed only to 
melt down the charge, with all further work done in 
the ladle. The expensive furnace time was cut by fac- 
tors of three and four, or more. (2) The vastly greater 
facilities for quality control in a ladle, as compared to 
a furnace, shortly permitted the further bridging of 
this technology with the long-developed and highly 
sophisticated vacuum techniques of physics and lab- 
oratory furnaces. The argon-oxygen process then de- 
veloped variations using vacuum chambers and re- 
duced pressures instead of positive pressures, and in 
some cases substituting nitrogen or other gases for 
argon. 

Concurrently two major developments entered 
the field, this time from directions of those highly 
specialized heat-resistant metals and alloys known as 
the superalloys and the refractory metals. The super- 
alloys generally comprise systems based upon nickel, 
cobalt, or titanium, as well as highly complex stain- 
less steels, while the term “refractory metals” typi- 
cally refers to wolfram (tungsten), tantalum, molyb- 
denum, and niobium (columbium). Frequently the 
superalloys have quality demands so high as to re- 
quire remelting a cast form in order to purify it a 
second time. One process arose naturally out of the 
vacuum technology just mentioned, and the more 
critical grades of stainless steel are now often sub- 
jected to vacuum-arc remelting. The original melt is 
cast in the form of a huge electrode which, as with 
a consumable welding electrode, melts off at the tip 
and falls into a water-cooled mold in a vacuum cham- 
ber. The metal is thus purified, and its solidification 
structure is very favorable to good mechanical prop- 
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erties. The second process also arose out of a welding 
concept, namely the use ofa slag cover to protect the 
metal from environmental reactions while molten. 
First conceived by R. E. Hopkins in America several 
decades ago, the process was developed in the So- 
viet Union, then captured worldwide interest in the 
1960s. Again the metal in electrode form is remelted 
electrically, but under a carefully prepared slag blan- 
ket instead of a vacuum. Solidification is again well 
controlled, and the metal is highly purified. See HIGH- 
TEMPERATURE MATERIALS. 

Technological advances continued at a rapid rate, 
with such sophisticated developments as melting 
with electron beams, plasma arcs, and lasers—even 
levitation melting, where crucibles or containers are 
no longer necessary. The metal just suspends itself in 
mid-air! While levitation melting remains in curiosity 
stages and lasers still serve principally for punching 
holes and welding, the plasma-arc technique did en- 
ter commercial usage in welding, with beginnings 
in steelmaking. The electron-beam furnace has led 
to remarkable accomplishments in producing com- 
mercial quantities of certain ferritic stainless steels 
whose properties were previously believed impossi- 
ble. 

Most stainless is cast as ingots, although contin- 
uous casting is rapidly becoming popular. Virtually 
all wrought grades have their casting counterparts 
in the foundry industry, separately standardized by 
the Alloy Casting Institute (ACD. The metallurgy of 
the cast alloys is essentially the same as that for the 
wrought, with the principal exception that certain 
liberties can be taken in alloy composition because 
the castings are not hot-worked. In designing casting 
alloys, one can forget about mechanical problems 
of forming the part and concentrate on the chemi- 
cal challenges of corrosion resistance and heat resis- 
tance. See METAL CASTING. Carl A. Zapffe 

Bibliography. American Society for Metals, Metals 
Handbook, Properties and Selection: Irons and 
High Performance Alloys, 10th ed., vol. 1, 1990; J. R. 
Davis (ed.), Stainless Steels, 1994; R. A. Lula, Stain- 
less Steel, 1985; E B. Pickering (ed.), The Metallur- 
gical Evolution of Stainless Steels, 1979; L. L. Shreir 
(ed.), Corrosion, vols. 1 and 2, 3d ed., 1994. 


E) 
Stalactites and stalagmites 


Stalactites, stalagmites, dripstone, and flowstone are 
travertine deposits in limestone caverns (see illus.), 
formed by the evaporation of waters bearing calcium 
carbonate. Stalactites grow down from the roofs 
of caves and tend to be long and thin, with hol- 
low cores. The water moves down the core and pre- 
cipitates at the bottom, slowly extending the length 
while keeping the core open for more water to move 
down. Stalactites are banded concentrically to the 
center. 

Stalagmites grow from the floor up and are com- 
monly found beneath stalactites; they are formed 
from the evaporation of the same drip of water 
that forms the stalactite. Stalagmites are thicker and 
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Stalactite and stalagmite deposits. Along the central 
opening are deposits made by water as it dripped from the 
crack in the limestone. Outside layers were deposited by 
water flowing over the surface. Water may drip to the floor 
and create a stalagmite. (After E. B. Branson et al., 
Introduction to Geology, 3d ed., McGraw-Hill, 1952) 


shorter than stalactites and have no central hollow 
core. The banding of stalagmites is parallel to the 
surface at the time of deposition of each band. 

Dripstone and flowstone are travertine accumu- 
lations of much the same origin as stalactites and 
stalagmites (evaporation from cave waters) but are 
irregular. Their irregularity is due to deposition on 
irregular sloping walls or surfaces in the cave rather 
than drips from the roof. See CALCITE, CAVE; LIME- 
STONE; TRAVERTINE. Raymond Siever 

Bibliography. D. Gillieson, Caves: Processes, Devel- 
opment, Management, 1997; C. A. Hill and P. Forti, 
Cave Minerals of the World, 2d ed., 1997. 


a EN 
Stall-warning indicator 


A device that determines the critical angle of at- 
tack for a given aircraft, at which point the lift 
stops increasing and the aircraft will no longer sus- 
tain itself in steady-state condition (level flight or 
climb/descent). The indicator usually operates from 
vane sensors, airflow pressure sensors, tabs on the 
leading edge of the wings, and computing devices 
which include accelerometers, airspeed detectors, 
and vertical gyros. The indicator may be a pointer 
over a dial, lights, or an audible signal. In some cases, 
the computer may actuate a control-column shaker 
to alert the pilot. See AIRCRAFT INSTRUMENTATION; 
AIRPLANE; AUTOPILOT. James W. Angus 


C 
Standard (physical measurement) 


An accepted reference sample which is used for es- 
tablishing a unit for the measurement of physical 
quantities. A physical quantity is specified by a nu- 
merical factor and a unit; for example, a mass might 
be expressed as 8 g, a length as 6 cm, and a time in- 
terval as 2 min. Here the gram is a mass unit defined 
in terms of the international kilogram, which serves 
as the primary standard of mass. The centimeter is 
defined in terms of the international meter, which is 
the primary standard of length and is defined as the 
length of path traveled by light in a vacuum during 
a time interval of 1/299 792 458 of a second. In sim- 
ilar fashion, the minute is a time interval defined as 
60 s, where the second is the international standard 
of time and is defined as the duration of 9 192 631 
770 periods of the radiation corresponding to the 
transition between the two hyperfine energy levels 
of the ground state of the cesium-133 atom. 

The National Institute of Standards and Technol- 
ogy in the United States and comparable labora- 
tories in other countries are responsible for main- 
taining accurate secondary standards for various 
physical quantities. See ELECTRICAL UNITS AND STAN- 
DARDS; LIGHT; METRIC SYSTEM; PHYSICAL MEASURE- 
MENT; TIME. Dudley Williams 


aT 
Standard model 


The theory that explains the three major interac- 
tions of elementary particle physics—the strong in- 
teraction responsible for nuclear forces, the weak in- 
teraction responsible for radioactive decay, and the 
electromagnetic interaction—in terms of a common 
physical picture. The model for this picture is quan- 
tum electrodynamics, the fundamental theory un- 
derlying electromagnetism. In that theory, electrons, 
viewed as structureless elementary constituents of 
matter, interact with photons, structureless elemen- 
tary particles of light. The standard model extends 
quantum electrodynamics to explain all three inter- 
actions of subnuclear physics in terms of similar basic 
constituents. 

Quantum electrodynamics (QED). In the 1940s, 
Richard P. Feynman, Julian S. Schwinger, and Shin- 
Ichiro Tomonaga constructed quantum electrody- 
namics, the quantum theory of the interactions of 
electrons with electromagnetic fields. They repre- 
sented the electron as a pointlike particle, possess- 
ing no size but only mass and intrinsic angular 
momentum, or spin. Similarly, they represented elec- 
tromagnetic waves as composed of pointlike quanta, 
photons, possessing spin but no mass. They assigned 
these particles the simplest equations of motion con- 
sistent with Einstein’s special relativity, and postu- 
lated also the simplest relativistic coupling which al- 
lowed electrons to emit and absorb photons one at a 
time. This showed that these ingredients reproduce 
the observed properties of electromagnetism and its 
interaction with electrons. The experimental vali- 
dation of this theory has only improved with time. 


Computations in quantum electrodynamics of the in- 
trinsic magnetic field of an electron agree with exper- 
iment to the level of parts per trillion. See ELECTRON; 
LIGHT; PHOTON; QUANTUM ELECTRODYNAMICS. 

Spin. It may seem strange that a particle of zero 
size can have intrinsic spin. This is one of the un- 
expected and profound consequences of combining 
relativity theory and quantum mechanics. In quan- 
tum mechanics, angular momentum associated with 
motion takes only the discrete values which are mul- 
tiples of h, Planck’s constant divided by 27. Rela- 
tivistic particles can also have intrinsic angular mo- 
mentum, and this may come in either units or half- 
units of ñ. In quantum theory, a relativistic particle 
is viewed as a quantum of the quantized oscillations 
of fields extending throughout space. For each pos- 
sible value of the spin, there is a unique simplest 
form for the equations of propagation of the field. 
The very simplest, the Klein-Gordon equation, is as- 
sociated with a spinless (spin-0) particle. For a parti- 
cle with intrinsic angular momentum of one unit of 
h (spin 1), the required propagation equations are 
precisely Maxwell’s equations for electromagnetic 
waves. For a particle with a spin of half a unit of 
h (spin 1/,), the characteristic equation is called the 
Dirac equation. Particles whose spin is an integer 
multiple of A are called bosons; particles with an 
extra half-unit of spin are called fermions. It is a con- 
sequence of relativistic quantum mechanics that mul- 
tiple bosons can occupy the same location. When 
present in great numbers, these particles can form 
macroscopic fields, such as the electric field. On the 
other hand, fermions in the same spin state are for- 
bidden from occupying the same location in space. 
This property accounts for the Pauli exclusion prin- 
ciple, which leads to the rigidity of atoms and of 
atomic nuclei. It is thus natural to build up matter 
from elementary fermions, and to represent fields of 
force by elementary bosons. In quantum electrody- 
namics, the electrons have spin '/, and the photons 
have spin 1. See BOSE-EINSTEIN STATISTICS; EXCLU- 
SION PRINCIPLE; FERMI-DIRAC STATISTICS; MAXWELL’S 
EQUATIONS. 

The spin also affects the ability of a particle to 
obtain mass. For spin-0 particles, there is a zero-mass 
Klein-Gordon equation, but one may add mass simply 
by adding one extra term to this equation. For spin 
1, the simplest Dirac equation involves two mass- 
less states; a particle which spins in the left-handed 
sense and an antiparticle which spins in the right- 
handed sense. The electron of quantum electrody- 
namics is described by putting together two of these 
pairs of states, to make up the left- and right-handed 
quantum states of the electron and the left- and right- 
handed states of the antielectron or positron. Only 
when the two halves are present is it possible to put 
a mass into the equation of motion. As an outcome, 
the electron and positron are predicted to have 
equal mass and opposite electric charge. (These rela- 
tions are confirmed experimentally to an accuracy of 
40 parts per billion.) The equation of motion for 
spin-1 particles does not allow a mass to be added at 
all, except under a special circumstance, explained 
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later, in which the spin-1 particle couples to a spin- 
O particle or field. This barrier can explain why the 
photon has zero mass. (Experimentally, the photon 
is observed to have mass less than 107? times the 
electron mass.) See RELATIVISTIC QUANTUM THEORY; 
SPIN (QUANTUM MECHANICS); WAVE EQUATION. 

Quarks and leptons. The subnuclear world con- 
tains many more constituents than electrons and 
photons. First of all, two heavier spin-!/, particles 
have been observed which have the same electro- 
magnetic charge as the electron (e) but larger mass. 
These particles, the u and t, are unstable; they decay 
to the electron with the simultaneous emission of 
very light, uncharged spin-!/2 particles called neutri- 
nos. Experiments have demonstrated that there is 
one neutrino species for each of the three charged 
particles e, 4, and t. The three charged and three 
neutral fermions and their antiparticles are collec- 
tively called leptons. See LEPTON; NEUTRINO. 

Next, experiments have observed a very large 
number of particles which experience the strong 
interactions. These particles, collectively called 
hadrons, include the proton and neutron, the z and 
K mesons, and many short-lived heavier species. It 
has been established that the hadrons are compos- 
ite systems built from more elementary fermions, 
called quarks. The known list of hadrons can be built 
up from six species of quarks: u (up), d (down), s$ 
(strange), c (charm), b (bottom), and t (top). (The 
t quark was discovered in 1995.) For example, the 
proton is a bound state of three quarks (uud); the 
x* meson is a bound state of a quark and an an- 
tiquark (u-d). To give the correct electric charge 
assignments for the hadrons, the quarks must be 
assigned fractional electric charges, 7/,; or —1/3. In 
addition, each species of quark must have three pos- 
sible states of an invisible quantum number, called 
color. Hadrons built of three quarks, called baryons, 
must contain one quark of each color. With this rule, 
the bound states of three quarks consistent with the 
Pauli exclusion principle give precisely the observed 
set of baryons. See BARYON; COLOR (QUANTUM ME- 
CHANICS); HADRON; MESON; QUARKS. 

Experiments at very high energy have confirmed 
the quark picture of hadrons. The pattern of scat- 
tering of high-energy electrons and neutrinos from 
protons confirms that quarks have fractional electric 
charge and spin !/,. When electrons and positrons 
collide at high energy and annihilate, the resulting 
burst of energy produces hadrons at a rate three 
times larger than the simple prediction from elec- 
trodynamics, indicating the presence of the three 
color states. The hadrons form two back-to-back jets 
of particles, following the directions that would be 
preferred by spin-!/. quarks and antiquarks. 

In the standard model, all matter is built from the 
quarks and the leptons. These 12 fermions organize 
themselves into three families, each containing the 
same set of electric charges. Table 1 shows this or- 
ganization and gives the mass of each particle. The 
masses of the u, d, and s quarks are quite uncer- 
tain; because most of the rest energy of a proton, for 
example, comes from the strong-interaction binding 
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TABLE 1. Elementary matter particles of the standard model* 














neutrinos have very small masses. 





Leptons 
Quarks 
Charge = 0 
Charge = 2 Charge = =4 (neutrinos) Charge = —1 
u 3 d 5 Ve e 0.5 
C 1200 S 100 Vy H 105.7 
t 174,000 b 4200 Vy T 1777.0 


“The number next to each charged particle shows its observed mass, in MeV. (The scale may be referred to the proton mass: Mp = 938 MeV.) The 








rather than from the intrinsic masses of the quarks, 
these values must be inferred indirectly. Neverthe- 
less, there is a clear hierarchical pattern to the spec- 
trum of masses. The origin of this pattern is still a 
mystery. 

Symmetry groups and quantum states. To discuss 
further the relations between the various quarks and 
leptons, it is necessary to review how symmetry op- 
erations are treated in quantum mechanics. Rotations 
in space provide a convenient example. A classical 
arrow can point in any direction; a classical rotation 
moves the arrow from one definite direction to an- 
other. In quantum mechanics, the Heisenberg uncer- 
tainty principle forbids one from specifying the ori- 
entation of an arrow. Instead, one starts with a fixed 
number of quantum states corresponding roughly to 
different orientations. The number of these states 
depends on the angular momentum. For example, a 
system with angular momentum 1, in units of i, has 
three possible quantum states; a system with spin '/, 
has two. If the system starts in one of these states and 
is rotated, the result is a weighted sum of the possible 
states. By rotating far enough, it is possible to convert 
the original state completely into one of the others. 
Conversely, a rotation in space can be thought of 
as built up out of three basic operations: small rota- 
tions about the x, y, and z axes (Fig. 1). A continuous 
family of symmetry operations built up in this way is 
called a Lie group. The basic operations are called the 
generators of the group. The mathematical notion of 
a Lie group was actually invented in the nineteenth 
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Fig. 1. Basic rotations in three-dimensional space. 


century. After the discovery of quantum mechanics, 
Hermann Weyl and Eugene P. Wigner found that this 
description of symmetries is the one that fits natu- 
rally into quantum theory. See GROUP THEORY; LIE 
GROUP; SYMMETRY LAWS (PHYSICS). 

To understand the symmetry structure of the stan- 
dard model, imagine setting all of the quark and lep- 
ton masses equal to zero. Then, as far as the infor- 
mation in Table 1 is concerned, the three families of 
quarks and leptons would be exactly equivalent. In 
this limit, there are four types of symmetry opera- 
tions that could be applied to the states in the table. 
The most obvious, called a flavor rotation, transforms 
or rotates the states of one family into those of an- 
other. Another possible operation transforms the lep- 
tons into the quarks of one particular color. A third, 
called a color rotation, transforms the quarks of one 
color state to those of another. A fourth, called an 
isospin rotation, transforms the two quarks of each 
family into one another, and, similarly, the two lep- 
tons. These last two operations are especially impor- 
tant for the structure of the standard model. These 
operations have a mathematical structure equivalent 
to that of the group of rotations in a space with, 
respectively, three and two complex-valued coordi- 
nates. Mathematicians refer to these rotation groups 
as SU(3) and SU(@2). The two groups are generated, 
respectively, from eight and three basic operations, 
in the sense discussed at the end of the previous 
paragraph. 

Yang-Mills theory. In quantum electrodynamics, 
photons do not interact with one another, except 
indirectly through their coupling to matter. It can be 
shown that relativity and quantum mechanics put se- 
vere constraints on the possible interactions of spin- 
1 bosons. The unique consistent prescription was 
discovered by Chen Ning Yang and Robert L. Mills 
in 1954. To construct a Yang-Mills theory, one be- 
gins with a Lie group and assigns a spin-1 particle 
to each generator of the group. Matter particles are 
assigned to quantum states which are rotated into 
one another by the group operations. Then the spin- 
1 bosons are coupled to matter in such a way that 
the emission or absorption of such a boson converts 
one matter state into another according to the group 
transformation law. This prescription also causes the 
bosons to be coupled to one another: For rotations 
in space, as can be seen from Fig. 1, a rotation about 
x causes a rotation about y to be transformed into a 
rotation about z. Then the spin-1 boson associated 


with x rotations can absorb a spin-1 boson associ- 
ated with y and transform it into the spin-1 boson 
associated with z. The probability for boson absorp- 
tion or emission is governed by a number g called 
the coupling constant, which generalizes the elec- 
tric charge of electrodynamics. In fact, electrody- 
namics can be considered as a particularly simple 
case of Yang-Mills theory in which the Lie group is 
the group of rotations of a circle, which has only 
one generator; mathematicians call this group UC). 
In the 1970s, Gerardus ’t Hooft and Martinus J. G. 
Veltman, Andrei A. Slavnov, John C. Taylor, Benjamin 
Lee, and others showed that the Yang-Mills prescrip- 
tion gives a well-defined quantum theory of spin- 
1 bosons, and that prescriptions which lack some 
elements of this structure lead to inconsistencies. 
The spin-1 bosons which play the central role in 
Yang-Mills theory are called gauge bosons. See GAUGE 
THEORY. 

The standard model can now be defined as the the- 
ory of elementary particles in which matter is con- 
structed from the spin-'/ fermions in Table 1 and the 
interactions of these particles are generated from a 
Yang-Mills theory based on the product of three in- 
dependent groups of symmetry operations, SUG) x 
SU(2) x UC). The SU(@) group is taken to be the 
group of color rotations, as defined at the end of the 
previous section. The SU(2) group is taken to be the 
group of isospin rotations, but acting only on the 
left-handed quark and lepton states. The U(1) factor 
refers to one extra gauge boson similar to the photon 
of quantum electrodynamics. Using the mathemati- 
cal structure described by Yang and Mills, this pre- 
scription leads to a compact set of equations describ- 
ing massless fermions and massless spin-1 bosons. 
Table 2 lists, on the left, the three component sym- 
metry groups of the standard model and a measure 
of their intrinsic coupling strengths; on the right, 
the three corresponding interactions of subnuclear 
physics, with their relative strengths and the spin-1 
gauge bosons that mediate each. 

Quantum chromodynamics (QCD). At this stage, it 
may seem that the standard model is an elegant phys- 
ical theory, but it is not apparent that the model has 
anything to do with nature. The strong and weak in- 
teractions are both short-range; in quantum theory, 
this implies that they are mediated by massive, not 
massless, bosons. And a symmetry which acts only 
on left-handed and not right-handed fermions pre- 
cludes combining these states to form the observed 
massive quarks and leptons. To show that this math- 
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ematical theory does describe nature requires new 
physical mechanisms. 

The strong interactions will be considered first. In 
the standard model, these interactions are described 
by an SUG) Yang-Mills theory acting on color, called 
quantum chromodynamics. The eight gauge bosons 
are called gluons. The most important qualitative fea- 
ture of the strong interactions is that the observed 
physical particles, the hadrons, are bound states of 
quarks, while the individual quarks are not observed. 
Remarkably, this feature can be derived from quan- 
tum chromodynamics when its quantum-mechanical 
aspects are fully taken into account. 

In quantum theory, the size of charges or coupling 
constants is not fixed but can vary as a function of the 
energy scale. In quantum electrodynamics, the elec- 
tric charge e is fixed at macroscopic distances but 
becomes larger at very small distances. This is the 
result of an effect called vacuum polarization. Since 
particle-antiparticle pairs can be created by quantum 
processes, such pairs are constantly being created 
and destroyed, everywhere in space, by quantum- 
mechanical fluctuations. If there is an electric field 
present, the electrons and positrons created by these 
fluctuations will move to partially screen out the 
field. Then the field observed at larger distances will 
be smaller. An electron-positron pair in such a fluc- 
tuation can have any separation up to the electron 
Compton wavelength, 4 x 107!? m. If two charged 
objects are separated by a distance smaller than this, 
they will be inside the fluctuating cloud of particles 
and antiparticles and will feel a stronger electric in- 
teraction. The strength of the electromagnetic force 
can be expressed by the dimensionless ratio called 
the fine structure constant, œ~ = e?/ħic, where c is 
the speed of light. At macroscopic distances, œ = 
1/137, but in experiments in which high energies 
are used to bring electrons and positrons to within 
distances less than 107!8 m, the value œ = 1/129 is 
observed. 

In Yang-Mills theories in which the gauge bosons 
interact directly, there is a compensatory quantum 
effect in the opposite direction. Through this effect, 
the interaction of a gauge electric field with a quan- 
tum fluctuation can reinforce its strength. This effect, 
called asymptotic freedom because it implies that the 
coupling constant is smaller at short distances, was 
discovered in 1973 by David Politzer, David J. Gross, 
and Frank Wilczek. In quantum chromodynamics, 
this effect overwhelms the vacuum polarization ef- 
fect and makes the coupling grow dramatically as 





TABLE 2. Elementary interactions of standard model 





Symmetry group Coupling strength (œ)* 
SU(3) 1/8.4 
SU(2) 1/29.6 
U(1) 1/98.4 


Bosons Interaction Strength 
Gluons Strong i! 
Photon Electromagnetict Ome 
Wt,wW>, 2° Weak?+ 105° 





g =g ie: 





*Coupling strengths are given at a high energy, equal to the rest energy of the Z boson, 91,000 MeV, and are written as pure numbers by defining 


tWeak and electromagnetic interactions are combinations of the basic SU(2) and U(1) interactions, as explained in text. 
+ The small strength of the weak interactions is due to the large masses of the W and Z bosons. 
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objects with color are separated. States with widely 
separated quarks or gluons acquire very high energy, 
so that it is always energetically favorable to create ad- 
ditional quark-antiquark pairs to neutralize the color 
charges. Then these configurations decay rapidly to 
quark and antiquark bound states. The separation at 
which the quantum chromodynamics coupling be- 
comes strong, which sets the size of typical hadrons, 
is about 10- m. This corresponds to a binding en- 
ergy of roughly mpc’, where mp is the proton mass. 
The massive bosons which mediate the nuclear force 
are then quark-antiquark bound states, mainly x and 
p mesons, which acquire their mass from their inter- 
nal quantum chromodynamics energy. 

Asymptotic freedom implies that experiments at 
energies well above the proton rest energy should 
be able to see quarks and gluons. At the end of 
the discussion of quarks, it was noted that the 
hadrons resulting from the annihilation of electrons 
and positrons at high energy form pairs of collimated 
jets that follow the directions expected for a quark- 
antiquark pair. The detailed properties of a jet indi- 
cate that it originates as a single quark or antiquark 
and acquires its complexity from the production of 
additional pairs along the gauge electric field direc- 
tion. In 1979, experiments at the German accelerator 
PETRA observed events with three jets of hadrons, 
with the distribution of the third jet in energy and 
angle very similar to that expected for radiation of 
photons from the produced quark and antiquark. 
These additional jets originated as radiated gluons. 
See GLUONS; QUANTUM CHROMODYNAMICS. 

Higgs mechanism. The massive particles which 
mediate the weak interactions appear experimen- 
tally to be spin-1 bosons with the simple couplings 
of Yang-Mills gauge bosons. There are three of these 
massive particles, called Wt, W`, and Z°. Given 
the fundamental difficulties noted above in writing 
a mass term for spin-1 particle, it is not clear how 
these states could arise from a Yang-Mills theory. 
There is one way known in which Yang-Mills bosons 
can acquire mass, discovered in the 1960s by Pe- 
ter W. Higgs and a number of others. Though in a 
Yang-Mills theory the basic equations must be sym- 
metric, it is possible that nature realizes these equa- 
tions in an asymmetrical way. For example, with 
respect to rotations in a three-dimensional space 
(Fig. 1), if we lived in a medium which randomly 
chose a preferred direction in this space, say, the z 
direction, only the rotation about z would be rec- 
ognizable as a symmetry. The other two rotation 
symmetries are said to be spontaneously broken. 
Such spontaneous symmetry breaking occurs in mag- 
netic materials when they choose an axis of magne- 
tization, and in crystals when they choose axes of 
alignment. 

If the symmetries that are spontaneously broken 
are those of a Yang-Mills theory, the corresponding 
gauge bosons become massive. For example, if some 
field took up an orientation that preferred the z axis 
in Fig. 1, the gauge bosons corresponding to x and y 
rotations would become massive, while the one as- 
sociated with z rotations would remain massless. It 


is possible to turn this picture into a candidate for a 
unified model of weak and electromagnetic interac- 
tions, in which the two massive bosons are the W+ 
and W~, and the massless spin-1 boson is interpreted 
as the photon. Experiment chooses a different the- 
ory built up around the same idea, constructed in the 
1960s by Sheldon L. Glashow, Steven Weinberg, and 
Abdus Salam (GWS). This theory makes use of the 
SU(2) and UC) pieces of the standard model Yang- 
Mills symmetry. This symmetry group has four gener- 
ators in all. Three of these are spontaneously broken 
and one remains unbroken. This gives the massive 
Wt, W-, and Z°, plus the massless photon. The field 
that is responsible for the symmetry breaking is still 
a complete mystery. In the simplest model, the sym- 
metry breaking can be due to one spin-0 field, called 
the Higgs field; the quanta of this field are called 
Higgs bosons. In this model, the Higgs field also is 
responsible for coupling the left- and right-handed 
components of quarks and leptons and thus for gen- 
erating the fermion masses. See HIGGS BOSON; SYM- 
METRY BREAKING. 

Electroweak interactions. The Glashow-Weinberg- 
Salam theory describes electroweak interactions. 
This theory contains and relates three different types 
of interactions: electromagnetism, mediated by the 
photon; charge-changing processes, mediated by the 
W bosons; and the neutral-current interaction, medi- 
ated by the Z boson. The latter two processes involve 
couplings that are stronger than those of electromag- 
netism, but they appear weak in nuclear physics be- 
cause the W and Z have large masses, 80,4000 and 
91,188 MeV, respectively, in units in which the pro- 
ton mass is 938 MeV. 

The charge-changing processes include the ra- 
dioactive beta-particle decay of nuclei, which corre- 
sponds in this picture to a d quark inside a neutron 
changing to a u quark and a quantum of the W~ field, 
which materializes as an electron and an electron an- 
tineutrino. The fundamental coupling involves only 
left-handed species, and this fact leads to predictions 
for the energy and angular distributions of the decay 
products which are in good accord with experiment. 
Similar predictions can be made for the products of 
more exotic reactions such as jz and t decay and the 
scattering of high-energy neutrinos, and again exper- 
iments confirm the simple picture in which only the 
left-handed fermion states couple to the W bosons. 
See RADIOACTIVITY. 

The neutral-current interaction, a short-ranged 
scattering interaction between fermions which sup- 
plements the electromagnetic interaction, is a char- 
acteristic feature of the Glashow-Weinberg-Salam 
theory. The charges by which the various fermions 
couple to this interaction, or to the Z boson, are given 
by the electric charges combined with an extra factor 
for left-handed species. The neutral-current interac- 
tion was first observed in the 1970s in experiments 
on the scattering of high-energy neutrinos. Accelera- 
tors at the Stanford Linear Accelerator Center (SLAC) 
in California and at CERN, in Geneva, Switzerland, 
have produced individual Z bosons by the annihila- 
tion of electrons and positrons and have observed 
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Fig. 2. Measurements of the rate for electron-positron 
annihilation into a Z° boson as a function of the collision 
energy, from the data of the OPAL experiment at CERN. 


the decay of the Z to all of the fermion species 
of the standard model (except for the very heavy 
t quark). The results confirm the Glashow-Weinberg- 
Salam charge assignments in detail. One piece of this 
confirmation begins from the fact that in quantum 
mechanics an unstable state with lifetime t has an 
uncertainty in energy given by h/t. Then a reaction 
which produces the unstable state proceeds over a 
range of energies, with the rate varying in a charac- 
teristic shape, called a resonance. Figure 2 shows 
the rate for electron-positron annihilation to form 
a Z boson, as measured by the OPAL experiment 
at CERN. The width of the resonance measures the 
lifetime of the Z. The agreement shown between 
theory and experiment actually tests all three of the 
interactions of the standard model—the weak inter- 
actions, because the decay rate of the Z depends 
on the coupling to each fermion species; the strong 
interactions, because the decay rate to quarks is en- 
hanced by the color factor 3 and another 4% correc- 
tion due to gluon effects; and the electromagnetic 
interactions, because the radiation of photons from 
the incident electrons and positrons produces the 
asymmetrical shape of the resonance curve. In this 
and other comparisons, the Z decay rate agrees with 
the prediction of the standard model to an accuracy 
of 0.1%. See ELECTROWEAK INTERACTION; INTERME- 
DIATE VECTOR BOSON; NEUTRAL CURRENTS; WEAK NU- 
CLEAR INTERACTIONS. 

Open questions. The standard model brings to- 
gether diverse aspects of the behavior of subnu- 
clear particles in a common framework. We can truly 
claim to understand the nature of the strong, weak, 
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and electromagnetic interactions. The explanation, 
though, sets up a number of new questions. The first 
among these is the nature of the agent that is respon- 
sible for spontaneously breaking SU(2) x UQ) sym- 
metry. Is it the Higgs field described above or a more 
complex set of interactions? Whatever this agent is, 
it is responsible for the masses of the heavy spin-1 
bosons W and Z, and for all of the quark and lep- 
ton masses. A related question is the origin of the 
pattern of fermion masses shown in Table 1. The 
origin of the Yang-Mills symmetry group SUG) x 
SU(2) x UQ) is also a mystery. In 1973, Howard 
Georgi and Glashow and Jogesh C. Pati and Salam 
proposed that this group orginates as the set of un- 
broken symmetries that remain when a larger sym- 
metry group, called the grand unification symmetry, 
is spontaneously broken at very short distances. The 
choice of SU(5) for the grand symmetry group ex- 
plains the pattern of electric charge assignments of 
the quarks and leptons. New measurements of the 
standard model coupling constants have turned out 
to be inconsistent with the simplest form of the grand 
unification hypothesis, but they fit the expectation 
if the assumption is added of an enhanced space- 
time symmetry called supersymmetry, which inter- 
connects bosons and fermions. See GRAND UNIFICA- 
TION THEORIES; SUPERSYMMETRY. 

At least a part of the answers to these ques- 
tions should be found at the next generation of 
particle accelerators. If there is a Higgs boson, its 
mass should not be far above the masses of the W 
and Z. If grand unification is correct, new particles 
predicted by supersymmetry, or another principle 
which extends the standard model, should also ap- 
pear. With luck, these experiments should lead to 
an even simpler and more all-encompassing mathe- 
matical description of nature. See PARTICLE ACCELER- 
ATOR. Michael E. Peskin 
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A genus of bacteria containing at least 28 species that 
are collectively referred to as staphylococci. Their 
usual habitat is animal skin and mucosal surfaces. Al- 
though the genus is known for the ability of some 
species to cause infectious diseases, many species 
rarely cause infections. Pathogenic staphylococci are 
usually opportunists and cause illness only in com- 
promised hosts. Staphylococcus epidermidis is a 
major component of the human skin flora. Staphy- 
lococcus aureus, the most pathogenic species, is 
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usually identified by its ability to produce coagulases 
(proteins that affect fibrinogen of the blood-clotting 
cascade). Since most other species of staphylococci 
do not produce coagulase, it is useful to divide 
staphylococci into coagulase-positive and coagulase- 
negative species. Coagulase-negative staphylococci 
are not highly virulent but are an important cause 
of infections in certain high-risk groups. Although 
Staphylococcus infections were once readily treat- 
able with antibiotics, some strains have acquired 
genes making them resistant to multiple antimicro- 
bial agents. See BACTERIA. 

Cell structure, physiology, and genetics. Staphylo- 
coccus cells are spherical with a diameter of 0.5- 
1.5 micrometers. Clumps of staphylococci resemble 
bunches of grapes when viewed with a microscope, 
owing to cell division in multiple planes. The staphy- 
lococci have a gram-positive (retain violet color of 
Gram stain after destaining with alcohol) cell com- 
position, with a unique peptidoglycan structure that 
is highly cross-linked with bridges of amino acids. 
See STAIN (MICROBIOLOGY). 

Most species are facultative anaerobes (grow in 
the presence or absence of oxygen). Within a sin- 
gle species, there is a high degree of strain variation 
in nutritional requirements. Staphylococci are quite 
resistant to desiccation and high-osmotic conditions 
and can grow in the presence of 10% sodium chlo- 
ride. These properties facilitate their survival in the 
environment, growth in food, and communicability. 

In addition to genetic information on the chro- 
mosome, pathogenic staphylococci often contain 
accessory elements such as plasmids, bacterio- 
phages, pathogenicity islands (DNA clusters contain- 
ing genes associated with pathogenesis), and trans- 
posons. These elements harbor genes that encode 
toxins or resistance to antimicrobial agents and may 
be transferred to other strains. Genes involved in vir- 
ulence, especially those coding for exotoxins and 
surface-binding proteins, are coordinately or simul- 
taneously regulated by loci on the chromosome such 
as the accessory gene regulator, agr, and the S. au- 
reus regulator, sar. See BACTERIAL GENETICS; BACTE- 
RIOPHAGE; PLASMID; TRANSPOSONS. 

Resistance to antimicrobial agents. Staphylococci 
have a remarkable ability to acquire resistance to 
antimicrobial agents. Presently, most S. aureus iso- 
lates are resistant to penicillin and many other an- 
tibiotics. The most common resistance mechanism is 
mediated by production of beta-lactamase enzymes, 
which inactivate antibiotics by preventing their inhi- 
bition of bacterial cell wall synthesis. Methicillin is 
the drug of choice for strains that are resistant to mul- 
tiple antimicrobial agents. Unfortunately, methicillin- 
resistant S. aureus is being isolated with increasing 
frequency, especially from patients with nosocomial 
(hospital-acquired) infections. For most of these iso- 
lates, vancomycin is the drug used as a last resort. 
In the past few years, a few isolates with partial re- 
sistance to vancomycin have been found, causing 
concern that this organism may become untreatable 
unless new drugs become available. See DRUG RESIS- 
TANCE. 


Strain typing. Infections caused by staphylococci, 
especially S. aureus, may be communicable, and 
it is important to trace the source of organism in 
outbreaks. Several typing methods have been used 
for epidemiologic studies. The classical phenotypic 
method, phage typing, was employed successfully 
for decades and is based on the fact that S. au- 
reus may carry bacteriophages that can infect some 
less-related isolates. Based on susceptibility profiles, 
strains are grouped into five types. Because of tech- 
nical difficulty and the emergence of nontypeable 
isolates, phage typing is being replaced by molecu- 
lar techniques. One technique, pulsed-field gel elec- 
trophoresis, analyzes chromosomal DNA fragments 
generated by the restriction enzyme Smal. Since only 
a few sites are recognized by Smal in the S. aureus 
chromosome, a limited but variable number of frag- 
ments are generated. These fragments can be sepa- 
rated by pulsed-gel electrophoresis, and the profile 
generated represents a “fingerprint” for the strain in 
question. See GENETIC MAPPING. 

Coagulase-positive (S. aureus) infections. Staphy- 
lococcus aureus Causes approximately 8.9 million 
cases of human disease annually in the United States. 
Many people carry S. aureus and can develop en- 
dogenous infections if they become immunocom- 
promised or if the organism enters the body through 
a break in the skin or mucous membrane. Infections 
acquired from exogenous sources are also common. 
Exogenous sources can be other people, fomites 
(inanimate objects, such as dishes, clothing, towels), 
the environment, or occasionally animals. Staphylo- 
coccus aureus strains are variable in expression of 
multiple-potential virulence factors; therefore, an en- 
counter with this organism can lead to three types 
of illness: pyogenic infections, toxigenic infections, 
and food-borne intoxications. 

Pyogenic infections. Staphylococcus aureus infec- 
tions are common, and most develop into a pyogenic 
(pus-forming) lesion caused by acute inflammation. 
Inflammation helps eliminate the bacteria but also 
damages tissue at the site of infection. Typical pyo- 
genic lesions are abscesses with purulent centers 
containing leukocytes, fluid, and bacteria. The ab- 
scess is enclosed by fibrin and cells. Many S. aureus 
products trigger inflammation, but the most impor- 
tant are cell wall components and several staphylo- 
coccal exotoxins (proteins that are secreted and not 
associated with the cell). Exotoxins with hemolytic 
or cytotoxic activity destroy tissue by damaging host 
cell membranes. Some also act on leukocyte mem- 
branes and prolong survival of S. aureus in the ab- 
scess. 

Pyogenic infections can occur anywhere in the 
body. Skin infections are common and may be 
communicable. Styes and boils are uncomplicated 
infections of hair follicles in the eyelid or sweat or 
sebaceous glands in the skin. More extensive skin 
infections are termed carbuncles or furuncles. In- 
fections may also develop internally if staphylococci 
enter through a wound or escape from an abscess. 
Invasion may occur through a direct extension of 
an abscess or via the bloodstream. Blood infections 


(septicemia) can disseminate the organism through- 
out the body and abscesses can form internally. In- 
fections in certain parts of the body, such as in the 
bone (osteomyelitis), respond poorly to antimicro- 
bial therapy. 

Toxigenic infections. Certain strains of S$. aureus pro- 
duce exotoxins that mediate two illnesses, toxic 
shock syndrome and staphylococcal scalded skin 
syndrome. In both diseases, exotoxins are produced 
during an infection, diffuse from the site of infection, 
and are carried by the blood (toxemia) to other sites 
of the body, causing symptoms to develop at sites 
distant from the infection. 

Toxic shock syndrome is an acute life-threatening 
illness mediated by staphylococcal superantigen exo- 
toxins. Toxic shock syndrome patients develop fever, 
hypotension, and multiorgan involvement and sub- 
sequent desquamation (peeling of the skin). The oc- 
currence of staphylococcal toxic shock syndrome in 
the United States is approximately 0.3 to 0.8 case per 
100,000 people. The S$. aureus superantigen family 
contains nine antigenic variants of staphylococcal en- 
terotoxins and toxic shock syndrome toxin-1 (TSST- 
1). Superantigens are unique molecules that bind si- 
multaneously to major histocompatibility complex 
MHO) Class II molecules on antigen-presenting cells 
and to receptors on T cells. Unlike conventional 
antigens, superantigens are neither processed by 
antigen-presenting cells nor expressed within the 
MHC Class II peptide groove. Instead, superanti- 
gens bind to peripheral surfaces of MHC Class II 
molecules and to the T-cell receptor at semicon- 
served elements of the 6-chain variable region (Vf). 
Binding in this manner stimulates a high percentage 
of T cells. The activated immune cells express ex- 
cessive quantities of cytokines, which generate fever 
and adverse effects on the cardiovascular system, re- 
sulting in shock. Menstrual toxic shock syndrome is 
caused by TSST-1 produced by S. aureus colonizing 
the vagina or cervix. This illness is linked to tam- 
pon use during menstruation, with absorbency being 
a risk factor. Nonmenstrual toxic shock syndrome 
occurs when S. aureus causes an infection else- 
where in the body and produces TSST-1 or entero- 
toxins. See CELLULAR IMMUNOLOGY; TOXIC SHOCK 
SYNDROME. 

Staphylococcal scalded skin syndrome, also 
known as Ritter’s disease, refers to several staphylo- 
coccal toxigenic infections. It is characterized by der- 
matologic abnormalities caused by two related exo- 
toxins, the type A and B exfoliative (epidermolytic) 
toxins. Exfoliative toxins are superantigens, but they 
are not typically associated with toxic shock syn- 
drome. Instead, they are proteases that cause the 
separation of granular cells within the epidermis and 
skin peeling in a single plane at the stratum corneum. 
Bullous impetigo is a localized form of staphylococ- 
cal scalded skin syndrome in which the toxin acts 
only at the site of infection to form a liquid-filled le- 
sion termed a bulla. Systemic staphylococcal scalded 
skin syndrome occurs when the exfoliative toxin 
disseminates via toxemia. One form of systemic ill- 
ness, generalized exfoliative disease, is characterized 
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by skin peeling in large sheets over most parts of 
the body. In an alternate form, staphylococcal scar- 
lantiniform eruption, patients develop a generalized 
roughened sandpaper appearance of the skin with- 
out bulla formation or generalized exfoliation. Since 
nearly all people develop antibodies early in life, sys- 
temic staphylococcal scalded skin syndrome is rare 
after the age of 5. 

Standard therapy for toxigenic infections is to 
identify and control the infection to minimize the 
amount of toxin produced. Toxic shock syndrome 
and severe staphylococcal scalded skin syndrome are 
typically treated with intravenous administration of 
beta-lactamase-resistant, cell-wall-active agents such 
as methicillin. In toxic shock syndrome, it is also im- 
portant to provide supportive treatment for shock 
and counteract serum chemistry abnormalities. In- 
travenous immunoglobulin has proven successful in 
reducing severity of toxic shock syndrome; however, 
there is currently no vaccine for toxic shock syn- 
drome or staphylococcal scalded skin syndrome. 

Food-borne intoxications. At least 1 million cases of 
staphylococcal food poisoning occur annually in the 
United States. Staphylococcal food poisoning is not 
an infection, but an intoxication that results from in- 
gestion of staphylococcal enterotoxins in food. The 
enterotoxins are produced when food contaminated 
with S. aureus is improperly stored under conditions 
that allow the bacteria to grow. Although contami- 
nation can originate from animals or the environ- 
ment, food preparers with poor hygiene are the usual 
source. 

The staphylococcal enterotoxins are stable and 
survive moderate heating. If ingested, they act in 
the gastrointestinal tract to stimulate the vagus nerve 
and signal the emetic center in the medulla. Vomit- 
ing, diarrhea, and abdominal cramps are the usual 
symptoms of staphylococcal food poisoning. Symp- 
toms begin 1-6 hours following ingestion and usually 
resolve within 24 hours. Most cases do not require 
treatment. Antimicrobial agents have no effect. Se- 
vere cases accompanied by dehydration should be 
treated with intravenous fluids and antiemetic agents 
to control vomiting. Fatalities are rare and usually 
occur only in the very young or elderly. Because of 
toxin stability, the illness is not routinely preventable 
by heating foods. Pasteurization is effective in elim- 
inating organisms in dairy products, although post- 
pasteurization contamination by human sources is 
possible. See FOOD POISONING. 

Effective methods for preventing staphylococcal 
food poisoning are aimed at eliminating contami- 
nation through common hygiene practices, such as 
wearing gloves, and proper food storage to minimize 
toxin production. 

Coagulase-negative diseases. Because they fre- 
quently colonize human skin and mucosal surfaces, 
Staphylococcus species other than S. aureus are im- 
portant opportunistic pathogens for humans. Most 
of these have low virulence and do not produce co- 
agulase. Although several coagulase-negative staphy- 
lococci occasionally cause infections, S. epidermidis 
and S. saprophyticus are frequently associated with 
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certain human diseases. Staphylococcus epider- 
midis, the most virulent, coagulase-negative strep- 
tococcus, is a major cause of infected prosthetic 
devices, shunts, and catheters. The more frequent 
use of these devices has led to an increased oc- 
currence of infections. Strains that cause these in- 
fections typically produce a variety of extracellular 
substances that facilitate their adherence to, and sub- 
sequent accumulation on, artificial materials. Staphy- 
lococcus sapropbyticus causes at least 10% of uri- 
nary tract infections in nonhospitalized women aged 
16-25 years. The source of this pathogen is unclear, 
but recent sexual intercourse is the leading risk fac- 
tor. Staphylococcus saprophyticus is not highly viru- 
lent otherwise. Its surface components that bind to 
the urinary epithelium are important for disease. See 
MEDICAL BACTERIOLOGY. 

Animal diseases. Coagulase-positive staphylococci 
are the most important Staphylococcus pathogens 
for animals. Certain diseases of pets and farm ani- 
mals are very prominent. Staphylococcus aureus is 
the leading cause of infectious mastitis in dairy an- 
imals. This udder infection causes nearly $2 billion 
in lost revenue to the United States dairy industry 
per year. Staphylococcus intermedius causes skin 
pyoderma in dogs and cats. Staphylococcus aureus 
and other coagulase-positive and coagulase-negative 
Streptococcus species are also important pathogens 
for goats, sheep, horses, pigs, poultry, and dolphins. 

Greg Bohach 
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A self-luminous body that during its life generates (or 
will generate) energy and support by thermonuclear 
fusion. 

Sun as reference. The fundamental reference for 
properties of stars is the Sun. The Sun’s distance of 
1.496 x 10° km (9.30 x 10” mi) from the Earth de- 
fines the astronomical unit (AU) and gives a solar 
radius (R) of 6.96 x 10° km (4.32 x 10° mi). From 
the energy received on Earth (1367 W m~?), the 
solar luminosity (L) is 3.85 x 107° W. Now, L = 47 R? 
(surface area) x oT Cuminosity per unit area ac- 


cording to the Stefan-Boltzmann law, where o is 
the Stefan-Boltzmann constant); therefore, the effec- 
tive blackbody “surface” temperature (T) is 5780 K 
(9950°F). (The “surface” is where the solar gases 
become opaque thanks to absorption by the nega- 
tive hydrogen ion.) The Earth’s orbital parameters 
and Kepler’s third law give a solar mass of 1.99 x 
10°° kg, the solar volume leading to an average 
density of 1.4 g cm~> (1.4 times that of water). See 
ASTRONOMICAL UNIT; HEAT RADIATION; SUN. 

The Sun’s temperature and density rise inwardly 
as a result of gravitational compression, and at the 
still-gaseous solar center reach values of 15.7 x 10°K 
(28.3 x 10°°F) and 150 gcm~>. Application of atomic 
theory to the solar spectrum and analysis of the 
solar wind and oscillations show the Sun to be 92% 
hydrogen (by number of atoms), 8% helium, and 
about 0.015% everything else (oxygen dominating, 
then carbon, neon, and nitrogen). The solar chemi- 
cal composition provides a standard for other stars 
as well as an observational benchmark against which 
to compare theories of the chemical evolution of the 
Milky Way Galaxy. 

Above about 8 x 10° K (14 x 10°°F), within the 
inner quarter of the radius and about half the mass, 
hydrogen (the proton, 'H) fuses to helium primarily 
via the proton-proton chain: 


1H + !H — °H + positron (positive electron) 
+ neutrino 
positron + electron —> two gamma rays 


7H + 'H > 3He + gamma ray 
3He + >He > *He + 2('H) 


The resulting release of energy supports the 
Sun against gravitational contraction. The radiation 
works its way out, degrading in energy with decreas- 
ing temperature, until it is released largely at opti- 
cal wavelengths. The neutrinos, nearly massless par- 
ticles, exit the Sun at close to the speed of light. 
Though nearly everything is transparent to them, 
enough are captured by terrestrial neutrino tele- 
scopes to confirm the theory of the Sun’s energy pro- 
duction. There was initially enough hydrogen to run 
the chain for 10 billion years. With the Sun 4.6 billion 
years old (from radioactive dating of meteorites), it 
has over 5 billion years of core hydrogen fusion left. 
See NEUTRINO; POSITRON; PROTON-PROTON CHAIN; 
SOLAR NEUTRINOS. 

Names. Over 6000 stars can be seen with the un- 
aided eye. The brightest carry proper names from 
ancient times, most of Arabic origin that reflect the 
position of the star within its constellation. A more 
general system (from Johannes Bayer in the early 
seventeenth century) names stars within constella- 
tions by Greek letters roughly in accord with ap- 
parent brightness, followed by the Latin genitive of 
the constellation name (for example, Vega, in Lyra, is 
also Alpha Lyrae). More generally yet, brighter stars 
carry numbers, derived from the work of the as- 
tronomer John Flamsteed (1646-1719), in easterly 
order within a constellation (Vega also 3 Lyrae). 
All naked-eye stars also have HR (Harvard Revised) 








TABLE 1. First-magnitude stars* 
Apparent visual 
Proper name Greek-letter name magnitude (V) 

Canopus a Carinae —0.72 
Rigil Kentaurus a Centauri —0.29 
Arcturus œ Boötis —0.04 
Vega a Lyrae 0.03 
Capella a Aurigae 0.08 
Rigel B Orionis 0.12 
Procyon a Canis Minoris 0.34 
Achernar æ Eridani 0.46 
Hadar ß Centauri 0.61 
Betelgeuse œ Orionis 0.70 
Acrux a Crucis 0.76 
Altair a Aquilae 0.77 
Aldebaran a Tauri 0.85 
Antares a Scorpii 0.96 
Spica æ Virginis 1.04 
Pollux B Geminorum 1.14 
Fomalhaut a Piscis Austrini 1.16 
Deneb a Cygni 1.25 
Mimosa B Crucis 1.25 
Regulus a Leonis 1.35 
Adhara e Canis Majoris 1.50 


Sirius a Canis Majoris —1.46 


Distance, Absolute visual 
light-years magnitude (M,) Spectral class 
8.6 1.43 A1V 
313 —5.63 FO Il 
4.4 4.06 G2V+K1V 
37 —0.30 K1.5 Ill 
25 0.58 AOV 
42 —0.48 G8 III + GO III 
UUS —6.8 B8 lab 
1 2.62 F5 IV—V 
145 —2.76 B3 V 
335 —4.44 B1 III 
425 —4.9 M2 lab 
320 —4.2 B0.5 IV + B1 V 
17 2.21 A7 V 
65 —0.65 K5 III 
600 —5.9 M1.5 Ib 
260 —3.49 B1 Ill-IV + B2 V 
34 1.07 KO III 
25 1.73 A3 V 
2600 —8 A2 la 
350 —3.9 B0.5 III 
78 —0.53 B7 V 
430 —4.2 B2 Il 








*As seen by eye, doubles combined into one. Doubles are indicated by two entries in the spectral class. Low-luminosity companions are not included. 








numbers assigned in order east of the vernal 
equinox. A variety of catalogs list millions of tele- 
scopic stars. See ASTRONOMICAL CATALOGS; CON- 
STELLATION. 

Magnitudes and colors. About 130 B.C., Hip- 
parchus assigned naked-eye stars to six brightness 
groups or “apparent magnitudes” (m), with first mag- 
nitude the brightest (Table 1). This scheme is now 
quantified as a logarithmic system such that five mag- 
nitudes correspond to a factor of 100 in brightness, 
rendering first magnitude 2.512...times brighter 
than second, and so on; calibration extends the 
brightest stars into negative numbers. The faintest 
stars now observed are around magnitude 30, 10!” 
times fainter than magnitude zero. 

Stars assume subtle colors from red to blue-white, 
reflecting different spectral energy distributions that 
result from temperatures ranging from under 2000 K 
(3100°F) to over 100,000 K (180,000°F). The mag- 
nitude of a star therefore depends on the detector’s 
color sensitivity. A blue-sensitive detector “sees” blue 
stars brighter than does the eye, which is most sensi- 


tive to yellow light, and will assign them lower magni- 
tudes. Numerous magnitude systems range from the 
ultraviolet into the infrared, though the apparent vi- 
sual magnitude (m, = V) is still standard. The differ- 
ences among the systems allow measures of stellar 
color and temperature. The total brightness at the 
Earth across the spectrum is given by the apparent 
bolometric magnitude (77,)), which is dependent on 
the visual magnitude and temperature. See COLOR 
INDEX; MAGNITUDE (ASTRONOMY). 

Distances. The fundamental means of finding stel- 
lar distances is parallax. As the Earth moves in orbit 
around the Sun, a nearby star will appear to shift its 
location against the background. The parallax (p in 
arcseconds) is defined as one half the total shift, and 
is the angle subtended by the Earth’s orbital radius as 
seen from the star. Distance in parsecs (pc) is 1/pc, 
where 1 pc = 206,265 AU = 3.26 light-years. (A light- 
year is the distance that a ray of light will travel in 
a year.) The nearest star, a telescopic companion to 
Alpha Centauri, is 1.34 pc = 4.39 light-years away 
(Table 2). Parallaxes of hundreds of thousands of 

















TABLE 2. Stars within 10 light-years 
Distance, Spectral Apparent visual Absolute visual 
Star light-years class magnitude (V) magnitude (My) 
Proxima Centauri 4.21 M5.5 V 11.05 15.49 
Alpha Centauri A 4.40 G2V —0.01 4.34 
Alpha Centauri B 4.40 K1V 1.33 5.68 
Barnard’s Star 5.94 M4 V 9.54 13.24 
Wolf 359 7.66 M6 V 13.54 16.68 
BD+36°2147 8.32 M2 V 7.49 10.47 
Sirius A 8.61 A1 V —1.47 1.43 
Sirius B 8.61 DAŤ 8.44 11.33 
L726-8A 8.64 M5.5 V 12.57 15.45 
L726-8B 8.64 M5.5 V 12152 15.40 
Ross 154 9.69 M3.5 V 10.95 13.58 
tHydrogen-rich white dwarf. 
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Fig. 1. The Milky Way in Sagittarius, composed of millions of stars in the disk of the 
Galaxy. (© Anglo-Australian Observatory; photograph by David Malin) 


stars found with the Hipparcos satellite allow fair 
accuracy to distances of over 1000 light-years. Par- 
allax measures then allow the calibration of other 
distance methods (such as spectroscopic distance, 
main-sequence cluster fitting, and use of Cepheid 
variables) that extend much farther. See LIGHT-YEAR; 
PARALLAX (ASTRONOMY); PARSEC. 

Distribution and motions. All the unaided-eye stars 
and 200 billion more are collected into the Milky Way 
Galaxy (or simply, the Galaxy), 98% concentrated 
into a thin disk over 100,000 light-years across. From 
the Sun, inside the disk and 27,000 light-years from 
the Galaxy’s center in Sagittarius, the disk appears 
as the Milky Way. The appearance of the Milky Way 
varies considerably. It is brightest and widest toward 
the galactic center, and faintest in the opposite di- 
rection toward the anitcenter in Taurus and Auriga 
(Fig. 1). Surrounding the disk is a vast but sparsely 
populated halo. 

Since stars orbit the Galaxy’s center, they are seen 
to move relative to the Sun. Angular proper motions 
across the line of sight range from 10 arcseconds per 
year to below a thousandth of an arcsecond. Proper 
motions (jy, in arcseconds per year) depend on ve- 
locities across the line of sight (w, in kilometers per 
second) and distances (D, in parsecs) according to 
Eq. (1), which allows v, to be found. Velocities along 
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the line of sight (v,) are measured from Doppler shifts 
of stellar spectrum lines. The combination yields 
space velocities (v,) relative to the Sun according 
to Eq. (2). Statistical analysis of these motions shows 


v= to D 


the Sun to be moving through the local stars at a 
speed of 15-20 km/s (9-12 mi/s), roughly toward 
Vega. From radial velocities of sources outside the 
Galaxy, it is found that the Sun moves in a roughly 
circular orbit at 220 km/s (137 mi/s), which when 
combined with the space motions of other stars al- 
lows their galactic orbits to be determined. Stars in 
the disk have closely circular orbits; those in the halo 
have elliptical ones. See DOPPLER EFFECT; MILKY WAY 
GALAXY. 

Absolute magnitudes. The apparent visual magni- 
tude of a star depends on its intrinsic visual lumi- 
nosity and on the inverse square of the distance. 
Knowledge of the distance allows the determination 
of the true visual luminosity, expressed as the abso- 
lute visual magnitude, M,. This quantity is defined 
as the apparent visual magnitude that the star would 
have at a distance of 10 pc, and is found from the 
magnitude equation (3). Absolute visual magnitudes 


M=m-+5-—5logd (pe) QB) 


from class O to mid L range from M, = —10 to +23 
(a factor of 10!). The Sun’s absolute visual magni- 
tude is in the middle of this range, +4.83. When 
the invisible infrared radiation from faint cool stars 
and the equally invisible ultraviolet from the ultralu- 
minous hot stars are taken into account, the range 
in absolute bolometric luminosities (expressed in 
magnitudes as M,) is 30 times lower. The quantity 
My. can be calibrated in watts; for the Sun, My.) = 
+4.75. 

Spectral classes. Stars exhibit a variety of 
absorption-line spectra. The absorptions, narrow 
cuts in the spectra, are produced by atoms, ions, and 
molecules in the stars’ thin, semitransparent outer 
layers, or atmospheres. Some stars show powerful 
hydrogen absorptions, others weak hydrogen and 
metals, yet others molecules but no hydrogen. Over 
a century ago, Edward C. Pickering lettered them 
according to the strengths of their hydrogen lines. 
After he and his assistants dropped some letters and 
rearranged others for greater continuity, they arrived 
at the standard spectral sequence, OBAFGKM, which 
was later decimalized (the Sun is in class G2). Classes 
Land T were added in 1999 (Fig. 2; Table 3). Since 
the sequence correlates with color, it must also cor- 
relate with temperature, which ranges from near 
50,000 K (90,000°F) for hot class O, through about 
6000 K (10,000°F) for solar-type class G, to under 
2000 K (3100°F) for class L, and below 1000 K 
(1300°F) for class T (all of which are brown dwarf 
substars). 

The majority of stars in the galactic disk have 
chemical compositions like that of the Sun. Differ- 
ences in spectra result primarily from changes in 
molecular and ionic composition and in the effi- 
ciencies of absorption, all of which correlate with 
temperature. Molecules exist only at low tempera- 
tures. At higher temperatures they break up, and 
neutral atoms become ionized, so the absorptions 
of the neutral state eventually weaken with increas- 
ing temperature. When the temperature becomes 


high enough, single ionization is replaced by dou- 
ble, and so on. Hydrogen reaches maximum strength 
in class A. Hotter than class A, hydrogen becomes 
ionized and the neutral hydrogen lines weaken 
in classes B and O. Even the absorptions of in- 
dividual ions are affected, as absorbing electrons 
are pumped toward higher-energy states at higher 
temperatures. 

Most stars have more oxygen than carbon. In 
some cool stars, however, the ratio is reversed, car- 
bon molecules replacing those of metallic oxides. 
Stars whose temperatures track class M are assigned 
class N, warmer ones class R; both are now called 
carbon stars and lumped into class C. In between, 
where oxygen and carbon are in equal abundance, 
leftover oxygen attaches preferentially to zirconium 
(whose abundance is also raised above normal); the 
zirconium oxide stars are called class S. See ASTRO- 
NOMICAL SPECTROSCOPY; CARBON STAR; SPECTRAL 
TYPE. 

Hertzsprung-Russell diagram. Shortly after the in- 
vention of the spectral sequence, H. N. Russell and E. 
Hertzsprung showed that luminosity correlates with 
spectral class. A graph of absolute visual (or bolomet- 
ric) magnitude (luminosity increasing upward) plot- 
ted against spectral class (or temperature, decreasing 
toward the right) is called the Hertzsprung-Russell 
(HR) diagram (Fig. 3). The majority of stars lie in a 
band in which luminosity climbs up and to the left 
with temperature (with the Sun in the middle). But 
another band begins near the location of the Sun 
and proceeds up and to the right (to lower temper- 
ature), luminosities increasing to thousands solar as 
temperature drops to class M. To be bright and cool, 
such stars must have large radiating areas and radii. 
To distinguish between the bands, the larger stars 
are called giants, while those of the main band are 
termed dwarfs (or main-sequence stars). Most car- 
bon stars are giants, and stars of classes R, N, and 
S are mixed in with those of class M. Yet brighter 
stars to the cool side of the main sequence, with 
luminosities approaching 10° solar, are called super- 
giants. In between the giants and the dwarfs lie a few 
subgiants. At the top, superior to the supergiants, are 
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Fig. 2. Spectral sequence ranging from hot class O to cool class M. The hydrogen lines 
increase in strength from O to A and then weaken, disappearing in class M, where 
molecules take over. (From N. Ginestet et al., Atlas de Spectres Stellaires, Observatoire 
Midi-Pyrénées and Observatoire de Strasbourg, 1992; Y. Yamashita et al., An Atlas of 
Representative Stellar Spectra, University of Tokyo Press, 1978; J. B. Kaler, Extreme 
Stars, Cambridge University, Press, 2001) 
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the very rare hypergiants. See DWARF STAR; GIANT 
STAR; SUBGIANT STAR; SUPERGIANT STAR. 

White dwarf stars. In the lower left corner of the dia- 
gram, beneath the main sequence, are stars so dim 
that they must be very small. Since the first ones 
found were hot and white, they became known as 
white dwarfs in spite of their actual temperatures or 
colors. White dwarfs cannot be classified on the stan- 
dard system. Most have almost pure hydrogen atmo- 
spheres caused by the downward settling of heavier 
helium atoms. Others have no hydrogen at all and 
display only helium lines. White dwarfs must there- 
fore be positioned on the HR diagram according to 





TABLE 3. Spectral classes 








Class Characteristic absorption lines Color? Temperature, Ke 
© H, He*, He Blue 31,000-49,000 
B H, He Blue-white 9800-31 ,000 
A H White 7500-9800 
E Metals, H Yellow-white 6000-7500 
G Cat, metals Yellow 5300-6000 
K Cawica Orange 3900-5300 
M TiO, other molecules, Ca Orange-red 2200-3900 
Le Metal hydrides, alkalai metals Red <2200 
aie Methane, water, ammonia bands Infrared 1000 
R® Carbon Orange 4000-5800 
N° Carbon molecules Red 2000-4000 
Se ZrO and other molecules Orange-red 2000-4000 





The color refers to energy distribution; visual colors are subtle. 
Main sequence. 

CA mixture of red dwarf stars and brown dwarfs. 

dBrown dwarfs. 





eR and N are combined into carbon stars, class C. Classes R, N, and S all consist of giants. 
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Fig. 3. Hertzsprung-Russell (HR) diagram, showing the 
positions of the major kinds of stars. 


their temperatures (rather than their spectral 
classes). See WHITE DWARF STAR. 

Subdwarfs. Stars in the galactic halo are deficient 
in heavy elements. Typical iron contents are a hun- 
dredth that of the Sun, and in the extreme a hundred- 
thousandth. Low metal content makes halo dwarfs 
bluer than those of the standard main sequence, shift- 
ing them to the left and seemingly downward on the 
HR diagram, where they are known as subdwarfs. 
See STELLAR POPULATION. 

Stellar sizes. Since, for any spherical star, luminos- 
ity L = 40 R?oT4, the luminosity in solar units is 
given by Eq. (4). Temperature (7) from spectral class 


Tj Lnn) = CR Oy Tod (4) 


or spectral analysis plus the luminosity from abso- 
lute bolometric magnitude Mpo gives the stellar ra- 
dius. Dwarf radii increase from about 10% that of 
the Sun in class M to nearly 15 solar in class O. Giant 
radii range past 10 times solar to over 1 AU, super- 
giants to nearly 10 AU. White dwarfs are about the 
size of Earth. The radii of stars are confirmed by 
direct measurement of angular size through inter- 
ferometry and analysis of eclipsing double (binary) 
stars. See ECLIPSING VARIABLE STARS. 

Spectral lines are sensitive to stellar density, so 
stars can be classified by size and luminosity on 
the basis of their spectra alone. In the standard 
Morgan-Keenan-Kellman (MKK) system, roman nu- 
merals I through V stand for supergiant, bright giant, 


giant, subgiant, and dwarf (VI is sometimes used for 
subdwarf). Once luminosity and absolute visual 
magnitude are known from a star’s spectrum, its 
spectroscopic distance can be calculated from the 
magnitude equation (which must contain a correc- 
tion for dimming by interstellar dust). 

Stellar frequency distribution. The unaided-eye sky is 
filled with class A and B dwarfs, and with giants and 
supergiants that range from class O to M. Per unit vol- 
ume of space, however, dim M and L dwarfs (none 
visible to the unaided eye and not counting dimmer 
brown dwarfs) dominate, constituting over 80% of all 
stars (Table 2). Of the standard sequence (discount- 
ing Land T), some 10% of stars are G dwarfs like the 
Sun and only 1% are class A stars. Rarest, making up 
only 0.0001%, are the O stars, which are so bright 
that they can be seen for great distances. There are 
about as many giants as A dwarfs, and supergiants 
are as rare as O stars. See HERTZSPRUNG-RUSSELL 
DIAGRAM. 

Double and multiple stars. Most stars are members 
of some sort of community, from doubles through 
multiples (double-doubles, and so forth) to clus- 
ters, which themselves contain doubles. Separations 
between components of double stars range from 
thousands of astronomical units (with orbital pe- 
riods of a million years) to stars that touch each 
other (and orbit in hours). Orbits of the more 
widely spaced binaries (periods of months to cen- 
turies) can be determined directly (or through in- 
terferometry). Limited orbital parameters of the 
more closely spaced ones are found by means of 
Doppler shifts caused by high orbital velocities. 
Some of the latter eclipse each other, the eclipses 
establishing orbital tilts, true orbital speeds, and stel- 
lar radii. The sum of the masses of a binary is found 
from the period and orbital dimension through 
Kepler’s third law; the ratio of the masses is found 
by locating the center of mass of the system; and 
the combination of these two leads to determination 
of the individual masses. See BINARY STAR; KEPLER’S 
LAWS; ORBITAL MOTION. 

Masses and main-sequence properties. Obser- 
vations of hundreds of binary stars show that mass 
(M) increases upward along the main sequence from 
about 7.5% solar at cool class M to over 20 solar 
in the cooler end of class O. Extrapolation by the- 
ory, as well as through observations of clusters, sug- 
gests masses of 120 solar at the extreme hot end 
(class O3) of the main sequence. Fainter than the 
Sun, luminosity is proportional to M?; from the Sun 
and brighter, luminosity goes as M‘ and then back 
to MP. This mass-luminosity relation is the result of 
higher internal temperatures and pressures in more 
massive stars caused by gravitational compression. 
Given the Sun’s energy source and that luminos- 
ity varies smoothly with mass, the main sequence 
must be a hydrogen-fusing mass sequence. Giants, 
supergiants, and white dwarfs must therefore have 
some other energy source. See MASS-LUMINOSITY 
RELATION. 

Carbon cycle. As internal temperature climbs above 
the 8 x 10° K limit, hydrogen fuses to helium via 





(a) 








(b) 


Fig. 4. Images of the brown dwarf Gliese 229B, a companion to a much brighter M dwarf. (a) Discovery image, taken at 
Palomar Observatory in California (T. Nakajima and S. Kulkarni). (b) Hubble Space Telescope image (S. Durrance and 


T. Golimowski; NASA). 


the proton-proton cycle at an ever-increasing rate. 
Above about 15 x 10° K (27 x 10°F), so does fusion 
by the carbon cycle, in which carbon acts as a nuclear 
catalyst: 


12C + 1H —> PN + gamma ray 


But !N is radioactive and decays, so the cycle pro- 
ceeds: 


BN — C + positron + neutrino 
positron + electron — two gamma rays 
BC + 4H —> N + gamma ray 
MN + !H > O + gamma ray 


Again, O is unstable, so the cycle concludes: 


BOUN + positron + neutrino 
positron + electron — two gamma rays 
DN + 1H > ?C + ‘He 


Four protons are therefore converted to an atom of 
helium. Together with side chains that create heav- 
ier oxygen and fluorine, the carbon cycle overtakes 
the proton-proton chain around class F5 (close to 
1.3 solar masses), greatly shortening the star’s life. 
See CARBON-NITROGEN-OXYGEN CYCLES. 

Stellar convection. The onset of carbon-cycle domi- 
nance coincides with a change in stellar structure. 
The Sun has a radiatively stable core surrounded 
by an envelope whose outer parts are in a state of 
convection that helps produce a magnetic field and 
magnetic sunspots. Hotter dwarfs have shallower 
convection layers, and above class F envelope con- 
vection disappears, the cores becoming convective. 
The convective layers of cooler dwarfs deepen, until 
below about 0.3 solar mass (class M3), convection 
takes over completely and the stars are thoroughly 
mixed. See CONVECTION (HEAT). 


Brown dwarfs. Below a lower mass limit of 7.5% solar, 
internal temperatures and densities are not great 
enough to allow the proton-proton chain to oper- 
ate. Such stars, called brown dwarfs, glow dimly and 
redly from gravitational contraction and from the fu- 
sion of natural deuterium ÊH) into helium (Fig. 4). 
Long sought, brown dwarfs are now being found 
in abundance with infrared technologies. These and 
the stars at the bottom of the main sequence together 
inhabit class L. All class T stars are brown dwarfs. 
The lower mass limit to brown dwarfs is unknown; 
they may even overlap the masses of planets. See 
BROWN DWARF; EXTRASOLAR PLANETS; INFRARED AS- 
TRONOMY. 

Stellar rotation. The Sun rotates with an equatorial 
velocity of 2.0 km/s (1.25 mi/s) and a period of 
25 days. Rotation broadens stellar spectrum lines, 
allowing the observation of projected velocities 
which, since axial inclinations are not generally 
known, are lower limits to the true velocities. The 
assumption of random inclinations allows rotation 
speeds of stars in different spectral classes to be 
found. Main-sequence stars also exhibit magneti- 
cally induced sunspots, whose rotation in and out 
of sight allows accurate knowledge of rotation peri- 
ods. Cooler dwarfs rotate slowly because the mag- 
netic fields they generate couple with stellar winds 
and slow them down. Above the “rotation break” 
in class E rotation speeds increase, reaching max- 
imum values of 300 km/s (190 mi/s) or more in 
class B. Class B stars are commonly surrounded by 
rotation-related disks, the “Be” stars exhibiting emis- 
sion lines in their spectra. Because of their large sizes, 
giants and supergiants rotate slowly. See STELLAR 
ROTATION. 

Clusters. Doubles and multiples are highly struc- 
tured. Clusters are not, the member stars orbit- 
ing a common center of mass. Open clusters are 


Star 


351 


352 


Star 





Fig. 5. The Jewel Box open cluster (Kappa Crucis), which 
lies next to the Southern Cross. (© Anglo-Australian 
Observatory; photograph by David Malin) 


fairly small collections in which a few hundred or 
a thousand stars are scattered across a few tens of 
light-years. Examples are the Pleiades and Hyades in 
Taurus, which are among thousands of such clus- 
ters occupying the Galaxy’s disk (and thus the Milky 
Way) [Fig. 5]. About 150 globular clusters occupy 
the Galaxy’s halo, the poorest about as good as a 
rich open cluster, the best containing over a million 
stars within a volume 100 light-years across (Fig. 6). 
The main-sequence stars of globular clusters are sub- 
dwarfs, and like other halo stars are deficient in 
metals. 

The HR diagrams of clusters are radically differ- 
ent from the HR diagrams of the general stellar field. 
Those of open clusters differ among themselves in 
having various portions of the upper main sequence 
removed. Some main sequences go all the way from 
class M to O, while others extend only from M 
through G. The effect is related to the cluster’s 
age, since high-mass stars die first. Globular clusters, 
which lack an upper main sequence and are there- 
fore all old (up to about 12 x 10° years), contain a 
distinctive “horizontal branch” composed of modest 
giants. 

All clusters disintegrate with time as their stars 
escape through a form of evaporation aided by tides 
raised by the Galaxy. Old open clusters (more than 
10° years old) are rare, while the compact globulars 
can survive for the age of the Galaxy. 

If the HR diagram ofa distant cluster is constructed 
using apparent magnitude rather than absolute, then 
comparison of the cluster’s main sequence with that 
found from local stars using parallax allows the de- 
termination of the difference between absolute and 
apparent magnitude for the cluster, and therefore of 


the cluster’s distance (after correction for interstellar 
absorption). Knowing the cluster’s distance, in turn, 
makes it possible to learn the absolute magnitudes 
of the kinds of stars that are not within parallax dis- 
tance. By locating the center of the distribution of 
globular clusters, most of which lie outside the ob- 
scuring dust of the galactic disk, it is possible to de- 
termine the distance to the center of the Milky Way 
Galaxy. See STAR CLUSTERS. 

Variable stars. Dwarfs are generally stable. Giants 
and supergiants, however, can have structures that 
allow them to pulsate. Cepheids (named after the star 
Delta Cephei) are F and G supergiants and bright gi- 
ants that occupy a somewhat vertical instability strip 
in the middle of the HR diagram. They vary regu- 
larly by about a magnitude over periods ranging from 
1 to 50 days. The pulsation is driven by a deep layer 
of gas in which helium is becoming ionized. This 
layer alternately traps and releases heat from the out- 
going radiation. Larger and more luminous Cepheids 
take longer to pulsate. Once this period-luminosity 
relationship is calibrated through parallax and main- 
sequence cluster fitting, the period of a Cepheid 
gives its absolute magnitude, which in turn gives its 
distance. Cepheids are vital in finding distances to 
other galaxies (Fig. 7). See CEPHEIDS; HUBBLE CON- 
STANT. 

As the instability strip crosses the class F and A gi- 
ants found in globular clusters, it produces the low- 
metal RR Lyrae stars, which have periods of a day or 
less. Since they all have absolute magnitudes around 
0.7, they too are valuable distance indicators. Farther 
down the HR diagram, the instability strip crosses 
the class A and F main sequence. Here and in the 
giants just above it are the subtly varying Delta Scuti 





Fig. 6. Globular cluster 47 Tucanae, one of the grandest in 
the Milky Way Galaxy, containing over 1 million stars. 

(© Anglo-Australian Observatory; photograph by David 
Malin) 





Fig. 7. A Cepheid 51 million light-years away in the spiral galaxy M100. (a-c) Hubble Space Telescope images of the star (in 
center of each image) on three different nights, showing changes in brightness. (d) Location of the star in one of the galaxy’s 
spiral arms. (Wendy L. Freedman, Observatories of the Carnegie Institution of Washington; NASA) 


stars. Where the instability strip hits the hydrogen- 
rich white dwarfs, at temperatures between 10,500 
and 13,000 K (18,400 and 22,900°F), it produces 
the ZZ Ceti stars, which pulsate nonradially with 
periods of minutes. (Hydrogen-poor white dwarfs 
have their own instability zone around 22,000 K or 
39,100°F) 

Miras (after the star Mira, Omicron Ceti), or long- 
period variables, are luminous M, C, or S giants that 
can vary visually by more than 10 magnitudes over 
periods that range from 50 to 1000 days, the pulsa- 
tions again driven by deep ionization layers. Much 
of the visual variation is caused by small changes in 
temperature that shift the starlight from the visual to 
the infrared and that also alter the formation of ob- 
scuring titanium oxide molecules. The infrared (and 
bolometric) variation amounts to only two magni- 
tudes or so. Other giants, and especially supergiants, 
are semiregular or even irregular variables with no 
periodicities at all. Still other stars, such as the class 
B subgiant Beta Cephei stars, are multiply periodic. 
See MIRA. 

Duplicity produces its own set of intrinsic varia- 
tions. If the members of a binary system are close 
enough together, one of them can transfer mass to 
the other, and instabilities in the transfer process 


cause the binary to flicker. If one companion is 
a white dwarf, infalling compressed hydrogen can 
erupt in a thermonuclear runaway, producing a sud- 
den nova that can reach absolute magnitude —10. 
If the white dwarf gains enough matter such that it 
exceeds its allowed limit (the Chandrasekhar limit) 
of 1.4 solar masses, it can even explode as a super- 
nova (discussed below). See CATACLYSMIC VARIABLE; 
NOVA; VARIABLE STAR. 

Off the HR diagram. Various kinds of stars are not 
placeable on the classical HR diagram. The most 
common examples are the central stars of planetary 
nebulae, which are complex shells and rings of 
ionized gas that surround hot blue stars that range in 
temperature from 25,000 to over 200,000 K (45,000 
to over 360,000°F) [Fig. 8]. While they can have lumi- 
nosities as high as 10,000 solar, these central stars ap- 
pear relatively dim to the eye, as they produce most 
of their radiation in the high-energy ultraviolet por- 
tion of the electromagnetic spectrum. This radiation 
ionizes the surrounding nebulae and causes them to 
glow. On the theoretician’s extended HR diagram, 
in which luminosity is plotted directly against tem- 
perature, the planetary nebula central stars clearly 
connect the red giants with the white dwarfs. See 
PLANETARY NEBULA. 
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Fig. 8. Hubble Space Telescope image of the Ring Nebula 
in Lyra, a planetary rebula made of glowing ionized gas that 
surrounds a hot blue star. (Hubble Heritage Team; 
Associated Universities for Research in Astronomy; Space 
Telescope Science Institute; NASA) 


Even hotter neutron stars (10° K, 1.8 x 10°F) are 
found associated with the exploded remains of su- 
pernovae (supernova remnants). Only 25 or so kilo- 
meters (15 mi) across, the visible ones spin rapidly, 
and are highly magnetic, with fields 10'* times the 
strength of Earth’s field. Radiation beamed from 
tilted, wobbling magnetic axes can strike the Earth 
to produce seeming pulses of radiation (Fig. 9). Over 
time, the spin rates slow and these pulsars disappear. 
The remains of exploded stars, quiet pulsars (and ac- 
tive ones whose radiation does not hit Earth) litter 
the Galaxy. See NEUTRON STAR; PULSAR. 





Evolution. The different kinds of stars can be linked 
through theories of stellar evolution. Stars are born 
by the gravitational collapse of dense knots within 
cold dusty molecular clouds found only in the 
Galaxy’s disk. When the new stars are hot enough 
inside to initiate fusion, their contraction halts and 
they settle onto the main sequence. The higher the 
stellar mass, the greater the internal compression and 
temperature, and the more luminous the star. But the 
higher the internal temperature, the greater the rate 
at which hydrogen is fused, and the shorter the star’s 
life. The fuel supply is proportional to mass, M, and 
the rate of fusion is proportional to the luminosity, 
L. Lifetime ¢ is proportional to (fuel supply)/(rate of 
use) and thus to M/L. But L is on average proportional 
to M?$, so tis proportional to 1/M°3. For example, a 
100-solar-mass star will exhaust its core hydrogen in 
only 2.5 million years. See MOLECULAR CLOUD; PRO- 
TOSTAR. 

Main-sequence lifetime is so long for stars under 
0.8 solar mass that none has ever evolved in the his- 
tory of the Galaxy. When a star between 0.8 and 
about 10 solar masses uses all its core hydrogen, its 
outer layers expand and cool to class K (Fig. 10). 
The star then brightens as it becomes a giant, lower 
masses brightening by larger factors and first becom- 
ing subgiants. The ascent to gianthood is terminated 
when the central temperature is high enough (108 K 
or 1.8 x 10°°F) to initiate the fusion of core helium 
via the triple-alpha process, wherein three helium 
nuclei (alpha particles) combine to make one car- 
bon atom. This process temporarily stabilizes the star 
with a helium-burning core and a hydrogen-burning 
shell. (Fusion with another He nucleus makes oxy- 
gen.) 

When the helium is fused to carbon and oxygen, 
the core contracts, helium fusion spreads outward 





Fig. 9. Hubble Space Telescope image of the inner parts of the Crab Nebula supernova remnant. A spinning neutron star 
(pulsar) is the left of the two stars near the center of the image. Part b is blowup of boxed area in a. (J. Hester and P. Scowen; 


NASA) 


into a shell, and the star again climbs the giant 
branch, the hydrogen- and helium-fusing shells se- 
quentially turning on and off. Such asymptotic giant 
branch (AGB) stars become larger and brighter than 
before, passing into class M, where they eventually 
become unstable enough to pulsate as Miras. Within 
a certain mass range, convection in giants can bring 
carbon from helium fusion (as well as other new ele- 
ments) to the surface, and the star can become a class 
Sand then a carbon star. Powerful winds strip the star 
nearly to its fusion zone, which is protected from the 
outside by a low-mass hydrogen envelope. As the 
inner region becomes exposed, the star heats and 
eventually illuminates the surrounding outflowing 
matter to produce a planetary nebula, which dissi- 
pates into the interstellar medium, leaving a carbon- 
oxygen white dwarf behind. 

When high-mass stars, those above about 10 solar 
masses, use up their core hydrogen, they too migrate 
to the right on the HR diagram, becoming not so 
much brighter but larger, cooling at their surfaces 
and turning into supergiants. Below about 40 solar 
masses they become class M red supergiants, losing 
huge amounts of matter through immense winds. 
Some stabilize there under the action of helium fu- 
sion; others loop back to become blue supergiants. 
Above 60 solar masses, so much mass is lost that 
the supergiants do not make it much past class B, 
stalling there as helium fusion begins. Supergiants 
are so massive that, whatever their outward dispo- 
sition, when the internal helium is fused to carbon 
and oxygen, the core shrinks and heats to the point 
that carbon fusion can begin. The mixture converts 
to oxygen, neon, and magnesium, which then burns 
to a mixture of silicon and sulfur, and finally to iron, 
each fusion stage moving outward to surround the 
core in successive shells. 

Since iron cannot fuse and produce energy, the 
core suddenly collapses and somehow “bounces” 
to explode away all the outer layers. From the out- 
side, the observer sees the explosion as a type II 
supernova that can reach absolute magnitude — 18. 
Temperatures within the exploding layers are high 
enough to create all the chemical elements, includ- 
ing iron, nickel, and uranium. The debris—the su- 
pernova remnant—expands for centuries into space 
(Fig. 9), its shock wave helping to compress the in- 
terstellar matter and create new stars. The remain- 
ing core is so dense that it converts to a neutron star 
or, if the mass is great enough, even to a black hole. 
See BLACK HOLE; CRAB NEBULA; NEBULA. 

Other routes to stellar explosion derive from 
double stars. Mass transfer from a low-mass main- 
sequence star to a white dwarf companion can cause 
the white dwarf to exceed the Chandrasekhar limit 
of 1.4 solar masses (beyond which it cannot support 
itself by way of electron degeneracy). The resultant 
collapse of the white dwarf produces a hydrogen- 
less type Ia supernova that can reach M, = —19 with 
such reliability that such supernovae are used to mea- 
sure distances to distant galaxies. Stellar evolution 
feeds huge quantities of enriched matter back into 
the star-generating clouds of interstellar space, caus- 
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Fig. 10. Evolution of stars on the HR diagram. Solar masses of stars are the indicated 
numbers along the main sequence. Lower main-sequence stars have never had time to 
evolve. Intermediate-mass stars evolve from the main sequence to become giants that 
stabilize during helium fusion in a “clump.” (Subsequent evolution, when helium fusion 
is done, is not shown here.) High-mass stars evolve to the right as supergiants and then 
explode. (After J. B. Kaler, Astronomy!, Addison-Welsey-Longman, 1994, from work of 


I. Iben, Jr.) 


ing the metal content of the Galaxy to increase and 
providing the raw materials from which the Earth 
is made. See NUCLEOSYNTHESIS; STELLAR EVOLUTION; 
SUPERNOVA. James B. Kaler 
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Star clouds 


Large groupings of stars with dimensions of 1000 to 
several thousand light-years. Star clouds are not sta- 
ble clusterings of stars or even groups of a common 
origin, such as stellar associations. Instead, they are 
large areas where the stellar density is higher than 
average in a galaxy. They are nevertheless physical 
entities in that they result from large-scale star for- 
mation events or series of events that are to some 
extent limited in size by the characteristics of the 
galactic environment. 

Large star clouds are found commonly in the spi- 
ral arms of galaxies such as the Milky Way Galaxy 
and the Andromeda Galaxy (M31), but are not found 
in very old galaxies such as elliptical galaxies M32 
or NGC 3379. Conspicuous star clouds in the Milky 
Way Galaxy are found in the constellations Carina, 
Cygnus, Sagittarius, and Scutum. They are particu- 
larly spectacular when photographed with a wide- 
field telescope, when tens of thousands of stars 
crowd the photo. The densest star clouds, such as 
those in the constellation Sagittarius, are also spec- 
tacular when photographed with the Hubble Space 
Telescope (see illustration). In some galaxies, such 
as the giant spiral galaxy M101, they are immense, 
ranging up to 6000 light-years (5.7 x 10!° km or 
3.6 x 101° mi) in diameter. See ANDROMEDA GALAXY; 
MILKY WAY GALAXY. 

Some astronomers believe that the evidence sug- 
gests that star groups are hierarchical, with open 
clusters the smallest units; they are grouped together 





Hubble Space Telescope image of the center of the 
Sagittarius star cloud. (Space Telescope Science Institute; 
NASA) 


in OB associations, and the associations are grouped 
in star clouds, which in turn are grouped in galax- 
ies. This is sometimes interpreted to indicate that 
star formation proceeds from the large to the small 
scale, with the interstellar gas first breaking up into 
large clouds that gravitationally separate into pro- 
tostar clouds, and then episodes of star formation 
occur within these clouds to form OB associations, 
the nuclei of which then can become stable star 
clusters. See GALAXY, EXTERNAL; INTERSTELLAR MAT- 
TER; MOLECULAR CLOUD; STAR; STAR CLUSTERS; STEL- 
LAR EVOLUTION. Paul Hodge 
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galactic Star Clusters, [AU Symp. 207, pp. 410-420, 
Astronomical Society of the Pacific, San Francisco, 
2002. 
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Star clusters 


Groups of stars held together by mutual gravitational 
attraction. There are two basic morphological types: 
open clusters and globular clusters. Typical densi- 
ties of field stars near the Sun are 0.1 star per cubic 
parsec. (One parsec equals 1.9 x 10! mi or 3.1 x 
10% km.) Open clusters, with dozens to thousands 
of stars, have central densities of 0.3 to 10 stars per 
cubic parsec, and are often elongated or amorphous 
in shape (Fig. 1). Globular clusters, with a thousand 
to several million stars, and central densities of a few 
hundred to over 100,000 stars per cubic parsec, are 
generally spherical (Fig. 2). Associations are even 
looser assemblages, with a few hundred stars and 
central densities lower than the field. They are often 
recognized because of unusually large numbers of 
special types of stars. OB associations are dominated 
by hot, luminous, young O and B stars. T associa- 
tions are dominated by young T Tauri variable stars. 
See SPECTRAL TYPE; T TAURI STAR. 

Star clusters are important because, within each 
cluster, member stars probably are at the same dis- 
tance from the Sun, and have the same age and the 
same initial chemical composition. Stars with larger 
masses begin life hotter and brighter than lower-mass 
stars and end their lives earlier. Most stars within clus- 
ters lie along a band in the luminosity-temperature 
plane (where temperature is measured by spectral 
type or color). In older clusters, this main-sequence 
band terminates at fainter and cooler levels. Ages 
derived from such observations provide a chronol- 
ogy of which clusters formed first. Differences ob- 
served among stars away from the main sequence, 


such as in luminosity and temperature or in chemi- 
cal composition, reflect changes to the stars during 
the ends of their lives and provide laboratories for 
the study of stellar evolution. Apparent luminosities 
of main-sequence stars may be converted into intrin- 
sic luminosities by distance estimates obtained via 
geometric methods, including stellar parallaxes or 
stellar motions, or by photometric methods. Such 
distance determinations map out the locations of 
clusters in the Milky Way Galaxy. See HERTZSPRUNG- 
RUSSELL DIAGRAM; MILKY WAY GALAXY; PARALLAX 
(ASTRONOMY); STELLAR EVOLUTION. 

Open clusters. Open clusters, once called galactic 
clusters, are found mostly along the band of Milky 
Way, and are rarely found more than 1000 parsecs 
above or below the plane. A dozen are visible to 
the naked eye, such as Ursa Major, the Hyades (the 
horns of Taurus), the Pleiades (also in Taurus), and 
Praesepe (“the Beehive,” in Cancer). Over a 1100 are 
cataloged, and the Milky Way probably contains tens 
of thousands, most hidden by interstellar obscura- 
tion. See HYADES; PLEIADES. 

Distances and dimensions. The nearest, the Ursa Major 
cluster, lies about 25 parsecs away. The Hyades and 
Pleiades cluster lie about 45 and 125 parsecs away, 
respectively. Open clusters have been found at dis- 
tances as large as 15,000 parsecs (Berkeley 29). Open 
clusters have diameters of a few to about 20 parsecs. 

Spectral characteristics. The brightest main-sequence 
stars in the youngest clusters are luminous, violet O 
stars. In slightly older clusters, such as the Pleiades, 
the brightest main-sequence stars are blue B stars. 
In the oldest clusters, the brightest main-sequence 
stars are yellow G stars. Post-main-sequence stars in- 
clude luminous red giants. About a dozen clusters 
contain Cepheid variables. Measuring distances to 
such clusters provide the intrinsic luminosities of 
the Cepheids, and their detection and measurement 
in distant galaxies provides excellent distance esti- 
mates to those galaxies. This is a key component 
in measuring the rate of the universal expansion. 
The Pleiades and other clusters of the same age 
or older often contain white dwarf stellar rem- 
nants. See CEPHEIDS; COSMOLOGY; HUBBLE CON- 
STANT; WHITE DWARF STAR. 

Embedded clusters. The youngest clusters may be hid- 
den within dark dust clouds opaque to optical light. 
Infrared methods can reveal such young clusters, in- 
cluding T Tauri variables, Herbig-Haro objects, and 
bipolar outflows associated with them. See HERBIG- 
HARO OBJECTS; INFRARED ASTRONOMY; PROTOSTAR. 

Motions. Proper motions, distances, and radial ve- 
locities provide a snapshot of the motion of a clus- 
ter. Open-cluster motions are not very dissimilar from 
that of the Sun and appear to confine the clusters to 
the plane of the Milky Way. 

Ages and dissolution. Comparison of main-sequence 
data with stellar evolution models yields age esti- 
mates. Embedded clusters have ages of less than 
10° years. The Pleiades is about 3 x 107 years 
old, and the oldest open clusters, NGC 6791 and 
Berkeley 17, may reach 10!° years. Few old clus- 
ters are known, partly because of the dissolution 


Fig. 1. Double cluster in Perseus. (Yerkes Observatory) 


of clusters with time. The lower-mass stars can be 
said to float to the outer boundaries of a cluster and 
may be stripped away by passing stars, clusters, or 
giant molecular clouds. The oldest clusters are far- 
ther from the galactic plane, probably since they are 
more likely to survive in the lower densities there. 
See MOLECULAR CLOUD. 

Chemistry. Most open-cluster stars have chemical 
compositions similar to that of the Sun, beginning 
life with 88% hydrogen, 10% helium, and 2% in so- 
called heavy elements such as oxygen, carbon, and 
iron. Open-cluster heavy-element abundances range 
from a factor of about 3 below that of the Sun to 
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Fig. 2. Great globular cluster in Hercules, Messier 13, photographed with 200-in. (500-cm) 
telescope. (California Institute of Technology, Palomar Observatory) 
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about 2 times above. Open clusters in the outer part 
of the plane of the Milky Way Galaxy tend to have 
lower heavy-element abundances. 

Stellar associations. These are very low density 
stellar ensembles of young stars, first named by 
V. A. Ambartsumian in 1947. About 80 associa- 
tions are cataloged, 8 of which contain Cepheid 
variables. Associations are no longer gravitationally 
bound, probably because of expulsion by strong stel- 
lar winds and supernovae of large amounts (over 
50%) of gas and dust out of which the stars formed. 
Open clusters often appear as subconcentrations 
within some associations. The double cluster h and 
x Persei (Fig. 1) lies within the Perseus OB1 associ- 
ation. See SUPERNOVA. 

Globular clusters. Generally more distant, more 
massive, older, and more deficient in heavy elements 
than open clusters, globular clusters are dispersed all 
over the sky, but with a strong concentration in the 
direction of the galactic center (Sagittarius). In 1917, 
H. Shapley estimated distances to globulars to infer 
the distance to the galactic center. A total of 153 glob- 
ular clusters are known, some of which are visible to 
the unaided eye, such as M13 in Hercules and espe- 
cially mw Centauri. One to three dozen more clusters 
may be awaiting discovery behind interstellar obscu- 
ration. A significant fraction of globular clusters may 
have been formed in dwarf galaxies that have been 
absorbed by the Milky Way Galaxy. The Sagittarius 
dwarf galaxy is in the process of being absorbed, 
and several globular clusters are or have been part 
of it. The globular cluster M54 may be the Sagittarius 
dwarf galaxy’s nucleus. See GALAXY, EXTERNAL. 

Distances and dimensions. The nearest globular (M4 in 
Scorpius) lies about 2000 parsecs away, while the 
most distant ones in the Milky Way Galaxy, AM-1 and 
Palomar 3, lie over 100,000 parsecs from the Sun. 
The distances are estimated by photometric means 
since they are too distant for geometric methods. RR 
Lyrae variable stars, often found in globulars, are a fre- 
quently used “standard candle” to estimate their dis- 
tances. Globular-cluster diameters range from about 
10 to about 100 parsecs. The typical globular cluster 
contains about 100,000 stars, but the range is from 
1000 (AM-4) to over 3 million (w Centauri). 

Spectral characteristics. The brightest main-sequence 
stars in globulars are yellow G stars, signifying that 
all such clusters in the Galaxy are old. Red giants are 
the brightest stars. Some clusters contain RR Lyrae 
variables, and some contain a few blue stragglers, 
stars hotter and more luminous than the brightest 
and hottest normal main-sequence stars. Some glob- 
ular clusters contain a type of cepheid variable, but 
they differ from those found in open clusters and are 
not frequently used as distance indicators. See BLUE 
STRAGGLER STAR; VARIABLE STAR. 

Structure. Some globular clusters have low central 
concentrations; other have very high values, approa- 
ching 100,000 stars per cubic parsec. Some show 
signs of having experienced what is known as core 
collapse. As the stars at the outer edges evaporate 
from the cluster, the central regions contract, den- 
sities become extremely high, and binary star sys- 


tems form. About 20% of the globular clusters have 
experienced core collapse. The high frequencies of 
blue stragglers in some cluster cores, and the pres- 
ence of millisecond pulsars (17 known in nine clus- 
ters) may be related to this dynamical evolution of 
the cluster. Heavy stellar remnants such as neutron 
stars are probably also present nearer the center. 
Bright x-ray sources may result from binary star trans- 
fer of mass onto a neutron star. Globular clusters 
are unusually rich in such sources, and all are near 
the cluster centers. See ASTROPHYSICS, HIGH- 
ENERGY; BINARY STAR; NEUTRON STAR; PULSAR; X-RAY 
ASTRONOMY. 

Chemistry. Heavy-element abundances in globular 
clusters cover a wide range, from 50% above solar to 
300 times below solar. There seem to be two chemi- 
cally distinct classes of globular clusters. The metal- 
deficient class is typically 30 times poorer in heavy 
elements than the Sun, while the metal-rich class is 
typically about 3 times more deficient. Those associ- 
ated with the Sagittarius dwarf galaxy have different 
element-to-iron abundance patterns than most other 
globular clusters. 

Motions. The metal-deficient globulars have very 
high velocities relative to the Sun, and so do not be- 
long to the disk of the Milky Way; they are found in a 
spherical distribution around the galactic center. The 
more metal-rich globulars have motions characteris- 
tic of disk stars and clusters, and are found within a 
few kiloparsecs of the plane. The metal-rich clusters 
are all within 10,000 parsecs of the galactic center. 

Ages. All globular clusters in the Milky Way are very 
old, typically about 1.2 x 10!° years. There may be 
a modest age spread of 3 x 10° years. The globular 
clusters are a primary method of estimating the lower 
limit to the age of the galaxy and of the universe. This 
may be compared to the age derived by the Wilkin- 
son Microwave Anisotropy Probe (WMAP) mission 
and the expansion rate derived from studies of extra- 
galactic Cepheids. See COSMIC BACKGROUND RADIA- 
TION; WILKINSON MICROWAVE ANISOTROPY PROBE. 

Clusters in other galaxies. Giant associations are 
known in spiral galaxies (such as M31 and M33) and 
irregular galaxies (such as Magellanic Clouds). Large 
open clusters are also known in spiral and irregular 
galaxies throughout the Local Group. Globular clus- 
ters are known in most galaxies, including ellipti- 
cals, throughout the Local Group and in galaxies out 
to disances of 10° parsecs. The galaxy M87 has the 
largest number (6000 counted, 16,000 estimated). 
Generally all extragalactic globular clusters are old, 
like those in the Milky Way, although there is a class 
of blue populous clusters that are young, roundish, 
and fairly massive (10,000 stars) in some star-forming 
galaxies. See ANDROMEDA GALAXY; LOCAL GROUP; 
MAGELLANIC CLOUDS; STAR. Bruce W. Carney 
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Star tracker 


A device that automatically measures the angular sep- 
aration of stellar observations with respect to a refer- 
ence platform. It is also referred to as an as-trotracker. 

By utilizing the star tracker in conjunction with 
a precise time reference (chronometer) and a dead- 
reckoning device consisting of gyroscopes and ac- 
celerometers (inertial navigator), a digital computer 
can correct many of the inertial navigator errors so 
that precise, autonomous (free from any radio po- 
sition aids), terrestrial navigation can be achieved. 
The major errors corrected by the star tracker are in- 
troduced by the inertial navigator’s gyroscopes that 
result in attitude deviations. In this configuration, 
called a stellar inertial system, very high precision 
aircraft autonomous navigation is achieved. Since 
the availability of radio position aids poses no prob- 
lem for commercial aircraft, stellar inertial naviga- 
tion technology is applied only on military vehicles. 
These navigation devices are used when radio posi- 
tion aids, such as Loran and the Global Positioning 
System (GPS), may be unavailable. See CHRONOME- 
TER; ELECTRONIC WARFARE; INERTIAL GUIDANCE SYS- 
TEM. 

Star trackers are also utilized extensively for 
both military and nonmilitary applications on space 
probes, space-based interceptors, and satellites. In 
these applications the precise attitude capabilities 
of these devices provide the fiducial precision ref- 
erence for pointing of the vehicle and Earth or 
planet sensors. On space missions, star trackers are 
the only sensor presently available that can provide 
arc-second attitude accuracy. See SPACE NAVIGATION 
AND GUIDANCE. 

Stellar visibility. Terrestrial star trackers must be 
capable of making star measurements at sea level and 
in daylight. Cloud cover represents an impediment 
to terrestrial stellar observations. The probability of 
seeing a star at any given time on the Earth at sea level 
is 50%. At altitude greater than 45,000 ft (13.7 km), 
which is above the clouds, stellar observations are al- 
ways available. This guaranteed availability of stellar 
observations at high altitudes has led to these devices 
being used on high-altitude aircraft (such as recon- 
naissance vehicles), ballistic missiles, satellites, and 
space probes. 

An example of a terrestrial star tracker application 
is the stellar inertial navigation system of the U.S. Air 
Force’s B-2 stealth bomber. When the B-2 is flying at 
altitudes above 45,000 ft, the inertial navigation sys- 
tem is continually updated by stellar observations. 
When it is flying a terrain-following, low-altitude 
mission, the star tracker is capable of making stel- 
lar observations 50% of the time and using these ob- 
servations to correct the inertial navigation system’s 
gyroscopic errors. Thus, the system is not dependent 
on any radio navigation aids which may be rendered 
unavailable in a wartime scenario; in particular, it is 
not susceptible to radio jamming of the Global Po- 
sitioning System (GPS) or natural electromagnetic 
disturbances causing GPS outages. See MILITARY 
AIRCRAFT; SATELLITE NAVIGATION SYSTEMS. 


Mounting. A gimbaled stellar inertial system is 
mounted on the stable element of an inertial navi- 
gator. The tracking system measures the telescope 
azimuth rotation angle and elevation angle to the ac- 
quired star with respect to a stable inertial reference. 
The tracker is programmed to observe different, 
widely angular separated stars in order to achieve 
accurate operation. The deviation of the measured 
stellar observations from their ideal stellar positions 
is utilized to enhance the performance of the inertial 
navigator. 

The preponderance of modern systems do not 
contain gimbals. These systems are completely 
strapped down (that is, with no moving parts) and 
observe stars at random as the stars come into the 
rigid, fixed star tracker’s field of view. Modern star 
trackers can operate in two modes. The first dost in 
space) uses sophisticated star pattern matching al- 
gorithms to determine initial position of the vehicle. 
The second uses its knowledge of the telescope’s 
location and direction to know which stars should 
be in the field of view and where their images are lo- 
cated on the star tracker’s optical detector (Fig. 1). In 
order to make a three-axis attitude correction, there 
should be a least two stellar observations ideally sep- 
arated by 90°. Smaller angular separations lead to a 
dilution of attitude correction precision. The multi- 
ple measurements need not be performed simultane- 
ously since the star tracker corrects the attitude of 
each axis based on the stellar observations available. 
Thus, these strapdown startracker systems consist 
of either multiple telescopes with moderate fields 
of view or a single telescope with a very wide field 
of view for proper operation. See CELESTIAL NAVIGA- 
TION. 

Processor functions. Digital computers, sometimes 
referred to as processors, perform many functions in 
a stellar inertial system. The computer has stored in 
its memory the ideal locations of the stars in a catalog 
called the ephemeris. Depending on the ephemeris 
used for the star tables, it usually stores the color- 
corrected star magnitude, declination, and right as- 
cension. Since the declination and right ascension 
change slightly during a year, these parameters are 
periodically altered. In some systems the proper mo- 
tions of the stars are stored in the star catalog in 
order to minimize the catalog updates. The catalog 
deliberately culls out stars that are located close to 
each other, such as binary stars, in order to elim- 
inate stellar observation ambiguity. Using accurate 
time supplied to the computer from a chronometer, 
the processor indexes the ephemeris with the appar- 
ent position of the platform so that it can determine 
which stellar observations will be processed. See AL- 
MANAC; DIGITAL COMPUTER; EPHEMERIS. 

For a gimbaled star tracker, pointing and tracking 
commands are issued to precisely direct the tele- 
scope at a star. Since the strapdown star tracker has 
no gimbals to point the telescopes, the computer 
indicates the ideal locations of the stars that should 
be in the field of view of the fixed telescopes, along 
with their color-corrected brightness. The color cor- 
rection in the ephemeris modifies the brightness 
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Fig. 1. Strapdown satellite star tracker. (a) Mounting of 
system on the satellite. (b) Star tracker in housing. 
(Northrop Grumman Corp.) 
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according to the responsivity of the telescope and its 
optical detector. Because of the lack of a pointing ca- 
pability, the strapdown star-tracker ephemeris must 
contain as many as 10,000 or more stars, whereas the 
star tracker in the gimbaled, or pointing, system may 
contain only a table of less than 70 of the brightest 
stars. Thus, the strapdown star-tracker ephemeris is 
far more extensive than the gimbaled one, but the 
complexity and cost are more than offset by the re- 
duction in cost, parts, size, and weight as well as 
enhanced reliability gained by the elimination of the 
gimbaled system’s moving parts and gimbal shaft an- 
gular encoder readouts. 

The computer processes the stellar data and 
chronometer data to arrive at the measured attitude. 
It then processes any known misalignments and com- 
pares the corrected measurements with the inertial 
estimate of attitude to provide enhanced attitude per- 
formance. Care is taken so that no star tracking is 
performed on stars in the vicinity of the Sun, Moon, 


or zodiacal light. This is done to prevent their high ir- 
radiance from corrupting the detection of the much 
weaker stellar irradiance signal. See ZODIACAL LIGHT. 

Detectors. The preponderance of star-tracker pho- 
todetectors used to measure the stellar irradiance 
are solid-state, silicon semiconductor photosensors. 
These devices have their peak responsivity in the 
near-infrared region (0.6-1.0 micrometer). Imaging 
focal-plane arrays can have several million photo- 
sensors on a single semiconductor silicon chip ar- 
ranged in a rectangular grid matrix (Fig. 2). Each 
of the individual photosensors on the focal-plane 
array is called a pixel. The devices are similar to 
the photosensors that have found wide utilization 
in home video-camera recorders. Two types of focal- 
plane arrays used for star trackers are charge-coupled 
devices (CCD) and charge-injection devices (CID). 
These rectangular devices are less than 0.125 in. 
(3 mm) thick and are approximately 0.25 in. ( mm) 
per side for a 65,000-pixel sensor to 1 in. (25 mm) 
per side for a megapixel sensor. The advantages of 
the imaging photosensor can be illustrated by ap- 
plying it to a standard 4° field-of-view star-tracker 
telescope. For a star tracker with a star magnitude 6 
capability (some strapdown star trackers have the ca- 
pability of measuring stars as dim as magnitude 10), 
on the average, there will be over three stars on the 
photosensor image plane. Thus, these imaging focal- 
plane arrays can process multiple stars with each 
stellar observation. The silicon focal-plane arrays are 
also more reliable than earlier photodetectors based 
on photomultipliers and image dissector electron 
tubes, and occupy only a small fraction of their vol- 
ume. See CHARGE-COUPLED DEVICES; IMAGE TUBE (AS- 
TRONOMY); MAGNITUDE (ASTRONOMY); PHOTOMUL- 
TIPLIER; TELEVISION CAMERA. 

Accuracy. Star-tracker accuracy ranges from sub- 
arcsecond to 30 arcseconds. For terrestrial astro- 
inertial navigation, which must track stars in day- 
light at sea level, the predominant noise source is the 
sky background. The sky background noise is usually 
mitigated by large telescope apertures and by mini- 
mizing the field of view. In the aircraft applications 
of astro-inertial navigation systems, there are num- 
ber of other error sources besides the star tracker 
accuracy, such as accelerometer noise, that degrade 
the system’s performance. For these applications a 
star-tracker accuracy of 1 arcsecond is compatible 
with providing a latitude and longitude navigation 
accuracy of less than 600 ft (180 m) for mission 
flight times of up to 12 hours, although 1 arcsecond 
of the Earth’s circumfereance is only about 100 ft 
(0 m). On the opposite end of the spectrum are 
the space trackers used for satellites and deep-space 
probes. These space-probe trackers do not contend 
with sky background noise when the stellar obser- 
vation is sufficiently displaced from looking through 
a planet’s atmosphere. Also, no star tracking is per- 
formed with the telescope looking in the vicinity of 
the Sun or Moon. This is done by selecting the lo- 
cation of the tracker with respect to its orbit or by 
providing multiple trackers to assure that the dark- 
ness of space provides the background illumination 






Fig. 2. Silicon focal-plane arrays. (Northrop Grumman Corp.) 


of the telescope that is tracking stars. The accuracy of 
these devices ranges from 30 arcseconds for commu- 
nications satellites to sub-arcsecond trackers that are 
used on reconnaissance satellites and space probes 
which track scientific targets. The high precision is 
required on reconnaissance satellites operating in 
high Earth orbits (altitudes of thousands of miles) 
to steer Earth-observation sensors and to report the 
location of terrestrial regions of interest. In the re- 
connaissance and mapping satellite applications, the 
angular accuracy of the satellite’s star tracker repre- 
sents a pointing location error for objects located 
on the surface of the Earth. For a 1l-arcsecond star 
tracker the terrestrial location reporting errors of ob- 
jects range from 30 ft (9 m) for a 1000-nautical-mile- 
altitude (1850-km) low-Earth-orbit (LEO) satellite to 
600 ft (180 m) for a 19,325-nautical-mile-altitude 
(35,800-km) geosynchronous- or geostationary-orbit 
(GEO) satellite. See MILITARY SATELLITES. 
Seymour Levine 
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Strapdown astro-inertial navigation utilizing the opti- 
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37(4):347-362, Winter 1990-1991; International So- 
ciety of Optical Engineering (SPIE), Space Guidance, 
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lando, Florida, April 12-14, 1993; M. Kuritsky and 
M. Goldstein, Inertial navigation, Proc. IEEE, 
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Starburst galaxy 

A galaxy that is observed to be undergoing an unusu- 
ally high rate of formation of stars. It is often defined 
as a galaxy that, if it continues to form stars at the 
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currently observed rate, will exhaust its entire sup- 
ply of star-forming material, the interstellar gas and 
dust, in a time period that is very short compared 
to the age of the universe. For a typical starburst 
galaxy, this gas and dust exhaustion time scale is less 
than 108 years, that is, less than 1% of the age of the 
universe. Since such a galaxy must shortly run out 
of star-forming material, the high star formation rate 
currently observed not only must end soon but also 
must have started relatively recently or the gas sup- 
ply would have run out long ago. It follows that such 
galaxies must be undergoing a passing burst of star 
formation. The rates of star formation that are esti- 
mated to occur in starbursts can exceed 100 times 
the mass of the Sun per year for the most energetic 
bursts observed in the nearby universe. This may be 
compared to about 2-3 solar masses per year in the 
Milky Way Galaxy. However, there are large uncer- 
tainties in such estimates because there is only weak 
evidence on the range of masses of stars that form 
in starburst events. There are theoretical limits to 
the total rate of star formation that can occur in a 
starburst. One limit is derived from the mass of all 
the available interstellar material turning into stars 
during the time taken for that material to fall inward 
under gravity to form a typically sized galaxy. An- 
other limit is related to the expectation that when 
large numbers of newly formed massive stars reach 
supernova stage, they will blow away much of the 
remaining star-forming material, thus inhibiting the 
burst. 

The term “starburst,” coined in the 1981 by Daniel 
W. Weedman and his collaborators, usually implies 
that the burst of star formation is occurring in 
the nuclear regions of the galaxy, the term first 
being used to describe a sample of luminous spiral 
galaxies with bright pointlike nuclei. The prototype 
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starburst nucleus is NGC 7714. Indeed, it has been 
confirmed by extensive studies that starbursts are 
very frequently confined to the inner approximately 
1000-light-year-diameter regions of massive galaxies. 
However, there exist galaxies that meet the defi- 
nition of short gas-exhaustion time scale but ex- 
hibit more galaxy-wide star formation. These include 
extragalactic H II regions, clumpy irregular galax- 
ies, blue compact dwarf galaxies, and the nearest, 
best-studied starburst galaxy, M82 (Colorplate 1). 
The related term “starburst region” is also now 
often used to refer to large regions within galax- 
ies where exceptionally energetic star formation is 
occurring. 

Radiative emission. Starburst galaxies often appear 
blue because of the dominance of their energy out- 
put by hot massive stars. Many blue starburst galaxies 
were discovered in the extensive ultraviolet-sensitive 
objective prism surveys of B. E. Markarian. Most star- 
burst galaxies are also strong emitters of mid- and 
far-infrared radiation, because the newly formed stars 
illuminate their natal clouds of dust and gas with ul- 
traviolet and optical radiation, and as the dust warms, 
it reradiates this energy at far-infrared wavelengths. 
The Infrared Astronomical Satellite RAS), which 
surveyed the entire sky at far-infrared wavelengths 
in 1983, discovered over 100,000 galaxies that emit 
strongly at 60 micrometers, the majority of which 
are likely to be starburst galaxies. The most lumi- 
nous of these galaxies are commonly termed ultra- 
luminous infrared galaxies (ULIRGs). More detailed 
and more sensitive mapping in the mid-infrared with 
the Infrared Space Observatory USO), which oper- 
ated from 1995 to 1998, helped to confirm that a 
large fraction of the newest star formation activity 
in starburst galaxies is hidden from view in the opti- 
cal waveband by the dense, dusty molecular clouds 
which spawn the new stars. The Spitzer Space Tele- 
scope has been able to build on the foundations laid 
by the IRAS and ISO observatories, imaging starburst 
galaxies in the mid- and far-infrared with unprece- 
dented sensitivity and detail (Colorplate 2). Only 
the slightly older young star complexes (a few mil- 
lion years) are clearly visible in the optical as bright 
blue complexes of stars and hot gas; these stars have 
had time to move away from their nurseries, or ex- 
pel their natal clouds by stellar wind or supernova 
wind pressure. One of the most spectacular results 
from Spitzer is the multicolor imaging of starbursts, 
which show regions with intense emission from 
large molecules, probably polycyclic aromatic hydro- 
carbons (PAHs). This emission, usually colored red in 
the Spitzer false color images, dramatically highlights 
regions where the molecules are excited to shine 
by strong shocks occurring in star-forming regions. 
See INFRARED ASTRONOMY; INTERSTELLAR MATTER; 
MOLECULAR CLOUD; PROTOSTAR; SPITZER SPACE TELE- 
SCOPE. 

Strong millimeter-wavelength emission lines of 
carbon monoxide, often highly concentrated to the 
nuclear regions, have also shown that starburst galax- 
ies are rich in molecular gas, the dense, relatively 
cool material in which star formation takes place. 


In several galaxies there is evidence that these nu- 
clear molecular clouds are disklike in shape, rotating 
around the galactic nucleus. At radio wavelengths, 
starburst galaxies show emission that is strongly cor- 
related in strength with the far-infrared emission. 
Supernovae are common in starbursts because the 
most massive young stars have very short lives that 
end spectacularly by supernova. It is thought that 
the radio emission from starbursts arises from the ef- 
fects of the supernova explosions on the surround- 
ing interstellar medium. Extremely high resolution 
observations at radio wavelengths using the tech- 
niques of very long baseline interferometry (VLBI 
have demonstrated the detection of individual young 
and highly luminous radio supernovae in a small 
number of galaxies, including one of the most ener- 
getic local ultraluminous infrared galaxies, Arp 220. 
In the submillimeter wavelength range, polarized 
emission—emission which has a preferred orienta- 
tion for the light waves to oscillate in—has been 
found in Messier 82, thought to be caused by strong 
magnetic fields within the galaxy. The revealed ori- 
entation of the magnetic field lines indicates that the 
fields may help drag interstellar material into the cen- 
ter of the system. In the x-rays, starburst galaxies are 
weak compared to active galactic nuclei, and the x- 
ray emission that they do radiate is thought to come 
from superwinds and from binary-star radiation pro- 
cesses such as seen in the Milky Way Galaxy. See 
BINARY STAR; POLARIZATION OF WAVES; RADIO AS- 
TRONOMY; STELLAR EVOLUTION; SUPERNOVA; X-RAY 
ASTRONOMY. 

Spectrum. The existence of a starburst in a galaxy 
is often deduced from the presence of strong, 
narrow emission lines in the optical spectrum. 
These lines originate in hot gaseous nebulae, called 
H II regions, which form around the hot young stars. 
Stronger proof of the existence of a large population 
of hot young stars in a galaxy comes from the detec- 
tion in the optical spectrum of broad emission lines 
of helium, characteristic of very massive and short- 
lived Wolf-Rayet stars. Over 130 Wolf-Rayet galaxies 
are known. In a small number of galaxies, the 
ultraviolet absorption features caused by hot young 
stars have been detected by the International Ul- 
traviolet Explorer UE) satellite. Starbursts can also 
be recognized spectroscopically in the mid-infrared 
by the detection of broad emission features from 
PAH molecules, using the Infrared Spectrograph on 
board Spitzer. See ASTRONOMICAL SPECTROSCOPY; 
NEBULA; ULTRAVIOLET ASTRONOMY; WOLF-RAYET 
STAR. 

Superwinds and stellar smoke. Energetic outflows 
of hot material have been discovered along the mi- 
nor axes of many nearby starburst galaxies. Revealed 
in x-rays, emission lines of hydrogen, and interstel- 
lar absorption lines, these so-called superwinds are 
believed to be ejected by the combined energy of 
many supernova explosions and stellar winds in the 
starburst. Outflow wind velocities seem to be in the 
range 400-800 km/s (250-500 mi/s), and the rate 
of gas expulsion can be tens of solar masses per 
year, perhaps even exceeding the rates at which 


gas is being converted into stars inside the host 
galaxies. These results have implications for the de- 
velopment over time of the heavy-element enrich- 
ment of the gas, both within galaxies and in the 
intergalactic medium. For example, they may help 
to explain the observation that galaxy spheroids of 
larger mass tend to have stars that are more en- 
riched in heavy elements than the stars in lower-mass 
spheroids. Accompanying the hot x-ray wind in the 
nearest starburst galaxy, M82, Spitzer has discovered 
great plumes of cool “stellar smoke,” revealed in mid- 
infrared images sensitive to dust emission and to the 
PAH molecule (Colorplate 1). 

Super star clusters. In some starburst galaxies, im- 
ages from the Hubble Space Telescope indicate the 
existence of super star clusters. These are knots 
of emission thought to be extraordinarily luminous 
and compact clusters of young stars, possibly young 
analogs of globular clusters. Some of the most beau- 
tiful examples lie along the extended tails of inter- 
acting galaxy systems, like “strings of pearls.” See 
HUBBLE SPACE TELESCOPE; STAR CLUSTERS.. 

Causes of starburst episodes. There are several the- 
ories as to why starbursts occur. One likely cause is 
the interaction between two galaxies as they pass 
close to or collide with one another. The tidal forces 
generated result in shock-wave compression of the 
interstellar material, loss of angular momentum and 
infall of material into the central regions of the galaxy, 
and star formation in the compressed clouds. Many 
starburst galaxies show evidence of interactions, in- 
cluding distorted appearance and long wispy tails 
of material. Not all starbursts can be due to inter- 
actions, however, since some display no evidence 
for any recent disturbance. Other mechanisms that 
are thought responsible for high star-formation rates 
in galaxies are very strong spiral density waves and 
central bar instabilities. See GALAXY FORMATION AND 
EVOLUTION. 

Starbursts in the early universe. In the local uni- 
verse, starbursts are rare events that contribute less 
than 1% of the total amount of energy radiated by 
all galaxies, so they may seem exotic and interesting 
but not very important. However, dramatic obser- 
vational breakthroughs made possible by the Hub- 
ble Space Telescope, the Keck Telescopes on Mauna 
Kea in Hawaii, the Cosmic Background Explorer 
(COBE) satellite, the Submillimeter Common User 
Array (SCUBA) on the James Clerk Maxwell Tele- 
scope in Hawaii, and the Spitzer Space Telescope 
have demonstrated that star formation rates in galax- 
ies were probably much higher at very early times 
in the universe than they are now. In particular, ev- 
idence has been found that ultraluminous infrared 
galaxies were much more common and much more 
important at earlier times in the history of the uni- 
verse than now. It is not known whether starbursts 
are responsible for the tremendous energy of these 
distant, young, highly infrared-luminous galaxies; if 
they are, then, starburst events may even have domi- 
nated the processes of galaxy building long ago, and 
much of the total energy radiated by early galax- 
ies may be detectable only at infrared or submil- 


limeter wavelengths because ultraviolet and optical 
light would have been very strongly obscured by the 
dense clouds of gas and dust involved in the star- 
burst. See SUBMILLIMETER ASTRONOMY; TELESCOPE; 
UNIVERSE. 

Relationship to other objects. Nuclei of starburst 
galaxies resemble active galactic nuclei in being 
bright and pointlike. However, active galactic nu- 
clei are believed to differ from starbursts in harbor- 
ing a central massive black hole, since they display, 
among other phenomena, an emission-line spectrum 
that is excited by a more powerful source of ultra- 
violet energy than can be provided by young stars. 
Many galaxies display the characteristics of both star- 
bursts and active galactic nuclei, with a ring of star 
formation surrounding an active galactic nucleus. It 
is very likely that there exists a physical relation- 
ship between starbursts and the events leading to 
active galactic nuclei because both are frequently 
seen in the same galaxy, but the details of the con- 
nection remain to be fully explained. Possibilities in- 
clude the following: (1) a galaxy-galaxy interaction 
triggers both phenomena by channeling gaseous ma- 
terial into the central regions of the galaxy; (2) the 
starburst event promotes the formation of a nuclear 
massive black hole; (3) the starburst event promotes 
the fueling of an existing, but previously quiescent, 
nuclear massive black hole by providing it with fuel; 
and (4) an active galactic nucleus turns on first, and 
then triggers a starburst in the surrounding interstel- 
lar medium. See ASTROPHYSICS, HIGH-ENERGY; BLACK 
HOLE; GALAXY, EXTERNAL. Carol J. Lonsdale 
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ET 
Starch 


A carbohydrate that occurs as discrete, partially crys- 
talline granules in the seeds, roots (tubers), stems 
(pith), leaves, fruits, and pollen grains of higher 
plants. Starch functions as the main storage or re- 
serve form of carbohydrate; it is second in abun- 
dance only to cellulose, a major structural compo- 
nent of plants. Cereal grains, tuber and root crops, 
and legumes (seeds) have long been used as major 
sources of carbohydrate in human diets. See CELLU- 
LOSE. 

Starch is isolated commercially from the follow- 
ing sources: cereal grain seeds [maize (corn), wheat, 
rice, sorghum], roots and tubers [potato, sweet 


Starch 


363 


364 


Starch 


Fig. 1. Cornstarch as seen by (a) scanning electron 
microscope, (b) light microscope, and (c) polarized light. 
(From R. L. Whistler, E. F. Paschall and J. N. BeMiller, eds., 
Starch: Chemistry and Technology, 2d ed., Academic, 1984) 


potato, tapioca (cassava), arrowroot], and stems and 
pith (sago). Cereal grains are often steeped first to 
loosen the starch granules in the endosperm ma- 
trix, followed by wet grinding or milling. Roots 
and tubers are ground to give a suspension con- 
taining starch granules. Then follows screening 
or sieving, washing, centrifuging, dewatering, and 
drying. 

Characteristics. Starch, normally a white powder 
of 98-99.5% purity, is insoluble in cold water, 
ethanol, and most common solvents. Starch is stained 
ablue to reddish-purple color, depending on the amy- 
lose content, by dilute aqueous iodine. 

The size and shape of starch granules depend on 
the botanical source of the starch. Rice starch, a 





small-granule starch, has polygonal granules ranging 
in size from 3 to 5 micrometers, while potato starch, 
a large-granule starch, has ellipsoidal or oval gran- 
ules from 15 to 120 wm. Normal cornstarch has 
spherical or polygonal granules from 5 to 30 um 
(Fig. 1). When viewed through a microscope under 
polarized light, starch granules are birefringent and 
show a characteristic “maltese cross” (Fig. 1c). 
Starch granules have characteristic x-ray diffraction 
patterns and have been estimated to be 20-50% 
crystalline. 

Structure and architecture. Starch, a polymer of 
glucose, is an alpha-glucan, predominantly contain- 
ing alpha-1,4-glucosidic linkages with a relatively 
small amount of alpha-1,6-glucosidic linkages form- 
ing branch points (Fig. 2). Two major polymeric 
components are present: amylose and amylopectin. 
See GLUCOSE. 

Amylose, long thought to be a linear polymer, con- 
tains a limited amount of long-chain branching. Nor- 
mal corn amylose contains about 900-1000 glucose 
units and appears to be a mixture of nearly equal 
numbers of large branched (5.3 chains on average) 
molecules and small unbranched molecules. In many 
respects amylose behaves like a linear polymer. In 
aqueous solution, it complexes with iodine to yield 
a deep blue color and associates with itself (retro- 
grades) to form precipitates or gels. 

Amylopectin is a high-molecular-weight (107-10°) 
branched polymer with an average chain length of 
20-26 glucose units. The structure is composed of 
clusters of branch points with individual chains in 
each cluster containing 12-16 glucose units. Some 
longer chains extend through more than one cluster. 
Association of chains in amylopectin clusters forms 
the crystalline regions of the granule. The branch 
points are located in the less organized, amorphous 
regions of the granule. 

The amounts of amylose and amylopectin vary 
with the source of the starch. Common starches 
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Fig. 2. Structures involved in starch. (a) Linear chain of 
alpha-1,4-glucan. (b) Alpha-1,6 branch point. (After Y. 
Pomerzanz, ed., Wheat: Chemistry and Technology, 2d ed., 
American Association of Cereal Chemists, 1971) 


contain 15-30% amylose, the remainder being amy- 
lopectin. Special hybrids, with altered ratios of amy- 
lose to amylopectin, have been produced. Waxy 
starches from maize (commercially available), barley, 
rice, and sorghum contain only amylopectin. High- 
amylose maize starches containing 50-70% amylose 
are available. 

Gelatinization. As a dilute aqueous suspension of 
starch is heated above 50°C (122°F), swelling of the 
granule begins in the less organized, amorphous re- 
gions with loss of birefringence and crystallinity. This 
is known as gelatinization and occurs over an inter- 
val of about 10°C (18°F) within the range of 55-75°C 
(131-167°F) for most common starches. As the tem- 
perature increases, irreversible swelling occurs, the 
crystalline areas undergo disorganization, and amy- 
lose leaches from the granule. When a sufficiently 
high temperature is reached (cooking, pasting), gran- 
ule structure is destroyed, with apparent dissolution 
of the amylose and amylopectin components. The 
increase in viscosity occurring during gelatinization 
is an important property of starch, resulting in its 
common use as a thickener in foods. 

Usein food systems. Starches are involved in impor- 
tant roles in foods, either naturally occurring in an 
ingredient or added to achieve a desired functional 
characteristic. Often the desired functional charac- 
teristic (thickening; gelling; adhesive; binding; im- 
proving acid, heat, and shear stability) cannot be 
achieved by using a native starch. Starches may be al- 
tered physically, chemically, or enzymatically to pro- 
duce modified starches with improved functional 
properties. 

Modified starches, commonly known as starch 
derivatives, are controlled in the United States by 
Food and Drug Administration (FDA) regulations. 
Modification of starch may involve a change in phys- 
ical form (pregelatinization), controlled degradation 
(dextrinization, oxidation, acid or enzyme modifica- 
tion) or chemical reaction (oxidation, substitution, 
cross-linking). Most modifications are accomplished 
in aqueous suspensions under conditions in which 
the granule is not destroyed. Often, only minor mod- 
ification is required to generate significant changes 
in starch properties. Modified starches can be pro- 
duced from any starch, but the common commercial 
sources in the United States are normal maize, waxy 
maize, tapioca, and potato starches. See FOOD. 

Starches are modified for the following reasons: to 
improve the resistance of swollen granules to acid, 
heat, and mechanical processing conditions; to elim- 
inate unwanted gelling characteristics; to reduce or 
increase starch gelatinization temperature; to gener- 
ate freeze-thaw stability; and to maximize, reduce, 
or inhibit viscosity. 

Modified starches that produce solutions of lower 
viscosity are obtained by (1) treating aqueous sus- 
pensions of starch with dilute acid below the 
gelatinization temperature (acid-thinned starches); 
(2) treating a gelatinized starch slurry with an 
alpha-amylase (enzymatically modified starch), com- 
mercially accomplished in a continuous process; 
©) heating (roasting) dry starch with small amounts 


of acid (dextrinization) to produce dextrins (py- 
rodextrins, British gums, white dextrins, yellow dex- 
trins) that often have a pale yellow to tan color; or 
(4) oxidizing starch in an aqueous suspension, which 
results in the introduction of carboxyl and carbonyl 
groups and partial depolymerization of the starch. 
Each of these processes produces starch derivatives 
with the desired altered properties, for example, vis- 
cosity, paste clarity, or gel-forming ability, for differ- 
ent uses. 

Cross-linked starch derivatives are produced by 
treating an aqueous suspension of starch granules 
with cross-linking reagents. The resulting modi- 
fied starches have improved cooking characteristics 
chigher gelatinization temperatures); increased sta- 
bility to acid, heat, or shear, with resulting control 
of viscosity during processing; and improved tex- 
tural characteristics of starch pastes. Starch ethers 
or esters (substituted starches) are formed by treat- 
ing an aqueous suspension of starch with ap- 
propriate reagents in the presence of an alkaline 
catalyst. These stabilized starch derivatives have 
freeze-thaw stability and a reduced tendency to un- 
dergo syneresis (formation of water on the surface 
of a gel or paste due to retrogradation). 

Glucose (dextrose), glucose syrups, maltodex- 
trins, maltose syrups, and high-fructose corn syrup 
are obtained from starch by using acid or enzymes. 
Enzymes are used industrially with increasing fre- 
quency because of the mild conditions required for 
processing and the range of products that can be ob- 
tained. High-fructose corn syrup finds extensive use 
as sweetener, particularly in soft drinks. See ENZYME; 
FOOD MANUFACTURING; FRUCTOSE; MALTOSE. 

Nonfood uses. The paper, textile, adhesive, chem- 
ical, pharmaceutical, and polymer industries use 
starch and starch derivatives. Organic acids and or- 
ganic solvents for use as chemical intermediates, 
enzymes, hormones, antibiotics, and vaccines are 
industrially produced from starch. See CARBOHY- 
DRATE. David R. Lineback 
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Stark effect 


The effect of an electric field on spectrum lines. The 
electric field may be externally applied, but in many 
cases it is an internal field caused by the presence of 
neighboring ions or atoms in a gas, liquid, or solid. 
Discovered in 1913 by J. Stark, the effect is most eas- 
ily studied in the spectra of hydrogen and helium, by 
observing the light from the cathode dark space of 
an electric discharge. Because of the large potential 
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Fig. 1. Stark effect for helium lines. (a) The 414.4- and 
416.9-nm lines at field strengths 0-85,000 V/cm. Note the 
disappearance of 'P-'H line at a certain field strength, and 
symmetry of the s components (those with polarization 
perpendicular to the field), indicated by dots. (b) The 
492.2-nm line at 0-40,000 V/cm. Note the crossing-over of 
two 'P-'F components. (J. S. Foster and C. Foster, McGill 
University) 


drop across the region, the lines are split into sev- 
eral components. For observation perpendicular to 
the field, the light of these components is linearly 
polarized. The splitting can be easily resolved with a 
spectrograph of moderate dispersion, amounting in 
the case of the H, line at 434 nanometers to a total 
spread of 3.2 nm for a field of 10,400 kV/m. 

Linear Stark effect. This effect exhibits large, nearly 
symmetrical patterns. Examples are shown in Fig. 1, 
where z and o refer to the two states of polariza- 
tion. The interpretation of the linear Stark effect 
was one of the first successes of the quantum the- 
ory. According to this theory, the effect of the elec- 
tric field on the electron orbit is to split each en- 
ergy level of the principal quantum number n into 
2n — 1 equidistant levels of separation proportional 
to the field strength. Thus the higher members of 
a series show a larger number of components and 
greater overall splittings. The criterion for the occur- 
rence of the linear Stark effect is that the splitting of 
the levels shall be large compared to the natural sep- 
aration of levels of the same n but different L, where 
L is the quantum number of orbital angular momen- 
tum. See QUANTUM NUMBERS; ZEEMAN EFFECT. 

Quadratic Stark effect. This occurs in lines result- 
ing from the lower energy states of many-electron 
atoms. Here the large separation of states of differ- 
ent Z results from the penetration of the valence elec- 
trons into the core of other electrons, with the result 
that the permanent dipole moment associated with 
hydrogenlike orbits no longer exists. There is, how- 
ever, a small induced dipole moment due to polariza- 
tion of the atom. This moment is proportional to the 
electric field strength, and since the energy change 
is proportional to the product of the dipole moment 
and the field strength, the energy levels shift by an 





amount depending on the square of the field. Thus 
all levels have shifts of the same sign and therefore 
are displaced to lower energies. Each field-free level 
is also split, as a result of the space quantization of 
the angular momentum vector J, so that there are 
J + 1 components if J is integral, or J + 1/2 com- 
ponents if it is half-integral. Because the lower lev- 
els are usually less displaced than the higher ones, 
the quadratic Stark effect ordinarily shows itself as 
a shift of the lines toward the red end of the spec- 
trum, with an accompanying separation into several 
components. The quadratic Stark effect is basic to 
the explanation of the formation of molecules from 
atoms, of dielectric constants, and of the broadening 
of spectral lines. 

Intermolecular Stark effect. Produced by the ac- 
tion of the electric field from surrounding atoms or 
ions on the emitting atom, the intermolecular ef- 
fect causes a shifting and broadening of spectrum 
lines. The molecules being in motion, these fields 
are inhomogeneous in space and also in time. Hence 
the line is not split into resolved components but 
is merely widened. Particularly in the electric dis- 
charge through gases with high currents, the large 
ion density may cause very wide lines. The amount 
of the broadening is found to run parallel to the 
sensitivity of the line to the Stark effect and thus 
is greatest for those lines susceptible to the linear 
effect. 

Inverse Stark effect. This is the effect as observed 
with absorption lines. It has been detected, for ex- 
ample, by applying an electric field to potassium 
vapor and measuring a small displacement of the ab- 
sorption lines toward the red. The displacements are 
found to be proportional to the square of the field 
strength, as in the quadratic Stark effect. In addition, 
certain lines of large n, which are normally forbidden 
by the selection rule for L, are observed to appear 
in the presence of the field. This type of transition is 
said to be produced by forced dipole radiation. Such 
forbidden lines also may be obtained in emission. In 
Fig. 1, the lines 2!P-6!S and 2!P-4!F are examples of 
forced dipole transitions. F. A Jenkins; W. W. Watson 

Quantum-confined Stark effect. This is the Stark ef- 
fect observed in structures in which the hydrogenic 
system is confined in a layer of thickness much less 
than its normal diameter. This is not practical with 
atoms, but the effect is observed with excitons in 
semiconductor quantum-well heterostructures. It is 
important that quantum-confined Stark shifts can be 
much larger than the binding energy of the hydro- 
genic system. The resulting shifts of the exciton op- 
tical absorption lines can be used to make optical 
beam modulators and self-electrooptic-effect optical 
switching devices. 

Wannier excitons behave like very large hydrogen 
atoms; for example, in gallium arsenide (GaAs) their 
diameter is approximately 30 nm, and their binding 
energy is approximately 4 meV. In a quantum well, 
consisting, for example, of a 10-nm gallium arsenide 
well layer between aluminum gallium arsenide (Al- 
GaAs) barrier layers, the exciton is confined primar- 
ily in the gallium arsenide layer, becoming overall 


smaller. This smaller size, incidentally, gives the ex- 
citon a larger binding energy (for example, 10 meV), 
and in contrast to most semiconductors, exciton 
absorption resonances can be seen clearly even at 
room temperature in many quantum-well structures. 
When an electric field is applied perpendicular to the 
layer planes, the electron and hole in the exciton are 
polarized, being displaced toward opposite sides of 
the well. This polarization by the field gives essen- 
tially a quadratic Stark shift of the ground-state exci- 
ton in a symmetric quantum well. Such Stark shifts in 
quantum wells can be much larger than the binding 
energy (typically 40 meV). 

In normal semiconductors with unconfined exci- 
tons, some quadratic Stark shifts can also be seen, 
but these are limited to a small fraction (10%) of 
the binding energy because of line broadening re- 
sulting from field ionization. Fields so large that they 
correspond to a potential drop of more than one 
binding energy over the exciton diameter (for ex- 
ample, about 100 kV/m) field-ionize the unconfined 
exciton very rapidly, pulling the electron and hole 
away from one another in much less than a classical 
orbit time. Because the particle lifetime is so short, 
the optical absorption spectrum shows primarily 
strong broadening of the exciton lines, according to 
the Heisenberg uncertainty principle. With the con- 
fined exciton, however, the barriers prevent the ex- 
citon from being rapidly field-ionized, even at fields 
of about 10 MV/m. Hence the exciton absorption 
lines in the quantum-confined Stark effect are not 
strongly broadened, even though the Stark shifts in 
such large fields can be many times the binding en- 
ergy of the exciton (Fig. 2). The exciton absorption 
lines in Fig. 2 correspond to the creation of excitons 
in their ground state, and the shifts of the absorption 
line correspond to the (quadratic) quantum-confined 
Stark shift of this exciton ground state. The absorp- 
tion peak height is reduced with field because there 
is less spatial overlap of the electron and hole as they 
are pulled apart, lowering the absorption strength of 
the transition. 
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Fig. 2. Optical absorption spectra of 9.4-nm gallium 
arsenide (GaAs) quantum wells for different fields 
perpendicular to the layers, showing the quantum-confined 
Stark effect shift of the exciton absorption peak. (After 

D. A. B. Miller, J. S. Weiner, and D. S. Chemla, Electric-field 
dependence of linear optical properties in quantum will 
structures: Waveguide electroabsorption and sum rules, 
IEEE J. Quant. Electron., QE-22:1816-1830, 1986) 


Static electricity 


A typical structure for observing the quantum- 
confined Stark effect would contain about 50 gallium 
arsenide quantum-well layers of total thickness of 
1 micrometer within a semiconductor diode struc- 
ture. Reverse-biasing the diode with about 10 V ap- 
plies a 10-MV/m electric field; this can change the 
transmitted power of a light beam of appropriate 
wavelength (for example, 850 nm) by a factor of 2 
to make an optical modulator. This offers a method 
for modulating light beams with very low operat- 
ing energy in small devices at room temperature. 
See ARTIFICIALLY LAYERED STRUCTURES; ELECTROOP- 
TICS; EXCITON; OPTICAL MODULATORS; SEMICONDUC- 
TOR HETEROSTRUCTURES. David A. B. Miller 

Bibliography. B. H. Bransden and C. J. Joachain, 
Physics of Atoms and Molecules, 2d ed., 2003; E. U. 
Condon and H. Odabasi, Atomic Structure, 1980; 
R. D. Cowan, The Theory of Atomic Structure and 
Spectra, 1981; H. Haug (ed.), Nonlinear Optical 
Properties of Semiconductors, 1988; P. H. Heckman 
and E. Trabert, Introduction to the Spectroscopy of 
Atoms, 1989. 


a SS SSF 
Static 


A hissing, crackling, or other sudden sharp noise that 
tends to interfere with the reception, utilization, or 
enjoyment of desired signals or sounds. Perhaps the 
commonest form of static is that heard in ordinary 
broadcast receivers during electrical storms. Inter- 
ference in radio receivers caused by improperly op- 
erating electric devices in the vicinity is sometimes 
also called static. See SFERICS. 

The crackling sounds heard when long-playing 
plastic phonograph records are played are also called 
static. These sounds are caused by sudden deflection 
of the phonograph needle by dust particles, which 
are attracted to the grooves of the record by surface 
electric charges that are caused by friction on dry 
days. Static appears as momentary white specks in 
a television picture. See ELECTRICAL INTERFERENCE; 
ELECTRICAL NOISE. John Markus 


SSS 
Static electricity 


Electric charge at rest, generally produced by fric- 
tion or electrostatic induction. In the sixth cen- 
tury B.C., Thales of Miletus, who is usually credited 
with the discovery of static electricity, described ex- 
periments in which rubbed amber induced move- 
ment in nearby particles. It was not until the eigh- 
teenth century, when Benjamin Franklin carried out 
his kite-flying experiments in which he “captured” 
some thunderstorm electricity in a Leyden jar, that it 
was verified that thunderstorm electricity and static 
electricity were one and the same. See ELECTRIC 
CHARGE. 

Triboelectrification is the process whereby charge 
transfer between dissimilar materials, at least one 
of which must have a high electrical resistivity, oc- 
curs due to rubbing or mere contact. Contact and 
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separation of materials occur during industrial oper- 
ations such as powder processing and the manufac- 
ture of plastic and other materials in sheet form. See 
ELECTRICAL RESISTIVITY. 

Most organic and polymeric materials have volume 
resistivities greater than 10° ohms per meter and re- 
tain charge for periods of many hours. Triboelectric 
charging is not fully understood, and the magnitude 
of charging may be strongly influenced by surface 
contamination of the contacting materials, relative 
humidity of the atmosphere, and the energy of rub- 
bing. For example, during the grinding of material 
during powder manufacture, charge levels of about 
a microcoulomb per kilogram are typical. This is a 
relatively low level of charge. The more energetic 
process of pneumatically conveying a powder along 
a pipe may give rise to charge levels a thousand times 
greater and an electrostatic hazard may arise. Haz- 
ards are due to sparks to ground after a relatively 
large amount of static charge has accumulated on a 
body that is improperly grounded. See ELECTRIC IN- 
SULATOR; ELECTRIC SPARK. 

In modern industry, highly insulating synthetic ma- 
terials, such as plastic powders and insulating liq- 
uids, are used in large quantities in an ever increas- 
ing number of applications. Such materials charge 
up readily, and large quantities of electrical energy 
may develop with an attendant risk of incendiary 
discharges. When, for example, powder is pneumat- 
ically transported along pipes, charge levels of up 
to about 100 microcoulombs per kilogram can de- 
velop and potentials of thousands of volts are gen- 
erated within powder layers and the powder cloud. 
Energetic sparking from charged powder may initi- 
ate an explosion of the powder cloud. Similar prob- 
lems occur when insulating liquids, such as certain 
fuels, are pumped along pipes, and it is essential 
that strict grounding procedures are followed dur- 
ing the refueling of aircraft, ships, and other large 
vehicles. 

The capacity of a person for retaining charge de- 
pends upon stature, but is typically about 150 pi- 
cofarads. Even the simple operations of removing 
items of clothing or sliding off a chair can lead to 
body discharges to ground of about 0.1 uC, which 
are energetic enough to ignite a mixture of natural 
gas and air. Human body capacitance is sufficiently 
high that, if poorly conducting shoes are worn, body 
potential may rise to 15,000 V or so above ground 
during industrial operations such as emptying bags 
of powder. Sparking may then occur with energy 
exceeding the minimum ignition energy of pow- 
der or fumes, so initiating a fire or explosion. Con- 
ducting footware should be used to prevent charge 
accumulation on personnel in industrial situations 
where triboelectrification may occur. See CAPACI- 
TANCE. 

In the microelectronics industry, extremely low- 
energy discharges, arising from body potentials of 
only a few tens of volts, can damage microelectronics 
systems or corrupt computer data. During the han- 
dling of some sensitive semiconductor devices, it is 
imperative that operators work on metallic grounded 


surfaces and are themselves permanently attached to 
ground by conducting wrist straps. Whenever films 
of insulating material are wound over rollers, such as 
in the photographic industry, surface charging and 
subsequent discharging occur. The discharges ob- 
served from charged insulator surfaces are brushlike 
and are known as brush discharges. See ELECTRO- 
STATICS. A. G. Bailey 

Bibliography. M. Glor, Electrostatic Hazards in 
Powder Handling, 1988; D. M. Taylor and P. E. 
Secker, Industrial Electrostatics: Fundamentals 
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Static var compensator 


A thyristor-controlled (hence static) generator of re- 
active power, either lagging or leading, or both. The 
word var stands for volt ampere reactive, or reac- 
tive power. The device is also called a static reactive 
compensator. 

Need for reactive compensation. Reactive power 
is the product of voltage times current where the 
voltage and current are 90° out of phase with one 
another. Thus, reactive power flows one way for 
one-quarter of a cycle, the other way for the next 
quarter of a cycle, and so on (in contrast to the real 
power, or active power, which flows in one direc- 
tion only). This back-and-forth flow results in no net 
power being delivered by the generator to the load. 
However, current associated with reactive power 
does flow through the conductor and creates extra 
losses. See ALTERNATING-CURRENT CIRCUIT THEORY; 
ELECTRIC POWER MEASUREMENT; VOLT-AMPERE. 

Most loads, particularly motor loads, draw lagging 
reactive power along with active power. Increased 
lagging reactive power causes electric power system 
voltage to sag, and under heavy load conditions the 
voltage can decrease below desirable levels. (Utili- 
ties usually hold system voltage to within +5% of 
the nominal value.) On the other hand, under light 
loads; the capacitance of high-voltage lines can cre- 
ate excessive leading reactive power, causing the 
voltage at some locations to rise above the nominal 
value. 

Finally, it is prudent to keep reactive power flows 
to a minimum in order to allow the lines to carry 
more active power. For example, cables have very 
high capacitance to ground (compared to overhead 
lines), and the resulting leading reactive power loads 
a 10-20-mi (15-30-km) transmission cable to its ther- 
mal limit when voltage is applied, leaving it with no 
ability to carry real power. 

Mechanical versus static compensation. Utilities fre- 
quently install capacitors connected from line to 
ground to compensate for lagging reactive power 
and reactors connected from line to ground to com- 
pensate for leading reactive power. These reactors 
and capacitors are switched in and out with mechan- 
ical switches based on the level of line loading as it 
varies throughout the day. However, frequent opera- 
tion of these mechanical switches may reduce their 
reliability. 
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Fig. 1. One phase of a static var compensator (thyristor- 
controlled reactor). (a) Circuit diagram of thyristor- 
controlled reactor in the electric power system. (b) Voltage 
and current waveforms with partial conduction. (c) Wave- 
forms with greater, but still partial, conduction. (d) Wave- 
forms with full conduction. 


It is desirable to have a controllable source of re- 
active power (leading or lagging); and the static var 
compensator, controlled with static switches, called 
thyristors, for higher reliability, fulfills this function. 
It is more expensive than mechanically switched ca- 
pacitors and reactors (due to the cost of thyristor 
valves and associated equipment), and hence its use 
is based on an economic trade-off of benefits versus 
cost. See SEMICONDUCTOR RECTIFIER. 

Operation. Figure 1a shows one phase of a typical 
static var compensator (thyristor-controlled reactor). 
Lagging reactive power is produced by controlling 
the current flow through a reactor with thyristors. 

By controlling the start of conduction of thyris- 
tor 1, current is allowed to flow at the desired mo- 
ment, say x, (Fig. 1b). The current rises, reaches 
its peak at the voltage zero, and then falls back to 
zero in about the same time period that it took to 
reach its peak. Thyristor 2 similarly controls the cur- 
rent flow in the opposite direction. More current 


Static var 


and hence an increasing amount of lagging reactive 
power can be allowed to flow by firing the thyris- 
tors at an earlier time (Fig. lc). Maximum current 
will flow when the thyristors are fired 90° before the 
respective voltage zero (Fig. 1d). Thus, lagging reac- 
tive power can be rapidly controlled by the appro- 
priate firing of thyristors. Control circuits, including 
firing circuits and sophisticated controls, vary the fir- 
ing time. Since power systems are three-phase, three 
single-phase compensators must be constructed 
(Fig. 2). 

Voltage. Since thyristors for the high voltages of 
transmission lines can be expensive, it is customary 
to provide a step-down transformer to lower the volt- 
age to about 10-20 kV for connection of a static var 
compensator. Often a power transformer connected 
to a transmission circuit is provided with a tertiary 
winding of low voltage, to which the static var com- 
pensator can be connected. The relatively low volt- 
age capability of thyristors requires connecting sev- 
eral of them in series. See TRANSFORMER. 

Use of filters. Except when the thyristors are fired 
at 90° for maximum current, the current is not a pure 
60-Hz sine wave, but rather contains undesirable har- 
monics of the 60-Hz component. Filters must be pro- 
vided to eliminate most of these harmonics (Fig. 2). 
See ELECTRIC FILTER. 

Since these filters have large capacitors, they 
also become a source of leading reactive power. 
Thus, the required rating of the controlled reac- 
tor is correspondingly increased to compensate for 
them. If smooth reactive power control over a wide 
range of leading and lagging reactive power is de- 
sirable, then additional mechanically switched ca- 
pacitors (Fig. 1) and a larger controlled reactor are 
provided. 

If mechanically controlled capacitors are not de- 
sirable because of the need for frequent operation, 
the capacitors can be controlled with thyristors 
(Fig. 2). However, because the current in the 
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Fig. 2. Complete three-phase static var compensator, constructed from three single- 


phase compensators. 
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capacitors cannot be controlled as conveniently as 
in the reactors, several small capacitor banks are pro- 
vided. Narain G. Hingorani 
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Statics 


The branch of mechanics that describes bodies 
which are acted upon by balanced forces and torques 
so that they remain at rest or in uniform motion. 
This includes point particles, rigid bodies, fluids, 
and deformable solids in general. Static point par- 
ticles, however, are not very interesting, and spe- 
cial branches of mechanics are devoted to fluids and 
deformable solids. For example, hydrostatics is the 
study of static fluids, and elasticity and plasticity are 
two branches devoted to deformable bodies. There- 
fore this article will be limited to the discussion of 
the statics of rigid bodies in two and three spatial di- 
mensions. See BUOYANCY; ELASTICITY; HYDROSTAT- 
ICS; MECHANICS. 

Mechanics is the study of motions in terms of 
mass, length, time, and forces. In statics the bodies 
being studied are in equilibrium. Positions of points 
in space, velocities, forces, and torques are all vector 
quantities, since each has direction, magnitude, and 
units. Beams, bridges, machine parts, and so on are 
not really rigid, but whenever the largest change in 
length of a portion of a body AZ is much smaller than 
the length /, that is, / > AZ, then it is a satisfactory ap- 
proximation to treat that body as a rigid body. Many 
of the objects used in architecture, engineering, and 
physics can be satisfactorily idealized as rigid bod- 
ies, and much of building, machine, bridge, and dam 
design is based upon the study of statics. 

The equilibrium conditions are very similar in 
the planar, or two-dimensional, and the three- 
dimensional rigid-body statics. These conditions are 
that the vector sum of all forces acting upon the body 
must be zero, and the resultant of all torques about 
any point must be zero. Thus it is necessary to un- 
derstand the vector sums of forces and torques. 

Force as a vector. The physical effect of a force is 
a push or pull on an object at its point of applica- 
tion. The effect of a force is to change the veloc- 
ity of the body. The SI units of force are newtons 
(1 N = 1 kilogram-meter per second squared). The 
vector nature of forces is expressed by their direc- 
tion in space. This direction is the direction of the 
push or pull exerted by the force. The notation for a 
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Fig. 1. Directed line AB or P is a vector quantity in the 
plane. 


vector quantity is an overarrow, for example, F or P. 
In Fig. 1 a two-dimensional vector P is shown as a 
directed line segment AB in the plane of the paper. A 
three-dimensional vector P is shown in Fig. 2 as the 
directed line segment OD. A complete description 
of a vector is obtained by specifying a coordinate 
system and giving its components in that coordinate 
system. The sum of any two vectors A + Bis the sum 
of their components, so that A and B must have the 
same number of components. 

Superposition and transmissibility. In studying stat- 
ics problems (and general mechanics problems), 
two principles, superposition and transmissibility, 
are used repeatedly on force vectors. They are ap- 
plicable to all vectors, but specifically to forces and 
torques (first moments of forces). 

1 The principle of superposition of d-dimensional 
vectors is that the sum of any two d—vectors is an- 
other d-vector. Of course, some or all components 
can be zero. This principle is illustrated in Fig. 3. The 
two vectors labeled Q, which can be shown applied 
to a rigid structural member D, add up to zero, and 
so do the two labeled R. Superposition applies to all 
sums of vectors, not just sums which vanish. 

2 The principle of transmissibility of a force ap- 
plied to a rigid body is that the same mechanical 
effect is produced by any shift of the application of 
the force. The principle of transmissibility is illus- 
trated in Fig. 3. The solid rectangle C represents a 
solid structural member, and the line AB is the line 





Fig. 2. Rectangular components of a force vector P in 
three-dimensional space. 
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Fig. 3. Illustration of two theorems of statics. 
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Fig. 4. Moments about the axis in rectangular coordinates. 


of action for the equivalent forces P. Since these two 
forces have the same magnitudes and direction and 
are applied along the same line, by transmissibility, 
all of their mechanical effects are equivalent. To use 
the superposition principle to add two vectors, the 
principle of transmissibility is used to move some 
vectors along their line of action in order to add to 
their components. 

Components of a force. Figure 1, the two- 
dimensional example, shows the construction by 
which orthogonal force components are defined. In 
this figure P or AB isa force vector, and LZ a directed 
line whose positive sense is toward its labeled end. 
Construction lines AC and BD are in planes (not 
shown) normal to LZ, and @ is the direction angle of 
P relative to L; it is a plane angle between the di- 
rections in the positive senses of È and L; further, 
0 <98 < 180°. 

The orthogonal vector component of force P on 
directed line Z is a force of direction and magnitude 
given by P; or CD, where P; is in the direction of L. 
Its magnitude is given by Eq. (1). 

P, = CD = AE = AB|cos0| = P| cosôð) œ) 

The component of P on Lis P; = P cos 0, where 
P; is positive if 0° > 90° < 90° and negative if 90° < 0 
< 180°. The absolute magnitude of P; is designated 
|Prl. 

The rectangular components of a force are its com- 
ponents on mutually perpendicular lines. In Fig. 2, 
P or OA, P or OB, and p or OČ are the rectangular 
vector components of P or OD in the directions of 
lines (axes) X, Y, and Z, respectively. 

The corresponding components have the mag- 
nitudes Py = P cos 6x, Py = P cos Oy, and Pz = 


P cos 0z, and are in the X, Y, Z directions, respec- 
tively. The magnitude of the three-dimensional vec- 
tor P is given by Eq. (2). 


P = || P} + P? + P3 (2) 


Moment of a force. The moment of a force about a 
directed line is a signed number whose value can be 
obtained by applying these two rules: 

1 The moment of a force about a line parallel to 
the force is zero. 

2 The moment of a force about a line normal to 
a plane containing the force is the product of the 
magnitude of the force and the least distance from 
the line to the line of the force. Conventionally, the 
moment is positive if the force points counterclock- 
wise about the line as viewed from the positive end 
of the line. 

In Fig. 4 the moments of P about the X Y, 
and Z coordinate axes are My = bPz — cPy about 
the X axis; My = cPy — aPz about the Y axis; 
and Mz = aPy—bPy about the Z axis. The re- 
sultant of the two torques or moments of force, 
T 1 and T 2 about a point P, is shown as T + ifi 
Fig. 5. The components of these torques are due 
to forces F; and F, oriented with respect to T as 
shown. 

Figure 6 illustrates cases where the forces in a 
structure may or may not depend on the relative 
strength of its parts. Case 6a is called statically de- 
terminate, and case 6b statically indeterminate. The 
terms are actually misnomers, for the forces can be 
found in both cases. Better terminology would be 
rigidly determinable and indeterminable. In case 6b, 
if each member were rigid (infinitely strong), the 
forces in each could not be determined. 

In summary, the statics of rigid bodies in stati- 
cally determinate structures is carried out by sum- 
ming vector forces and vector torques and setting the 





Fig. 5. Resultant of two torques. 


Statics 


371 


372 


Statistical mechanics 








(a) 


Fig. 6. Statically determinate and indeterminate structures. 
(a) The forces in members A and B are independent of their 
strengths (for small deformations). (b) The relative strength 
of the added member D will determine how much of the 
load W it supports. 


resultants equal to zero. See COUPLE; EQUILIBRIUM OF 
FORCES; FORCE; TORQUE. Brian De Facio 
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Statistical mechanics 


That branch of physics which endeavors to explain 
the macroscopic properties of a system on the basis 
of the properties of the microscopic constituents of 
the system. Usually the number of constituents is 
very large. All the characteristics of the constituents 
and their interactions are presumed known, it is the 
task of statistical mechanics (often called statistical 
physics) to deduce from this information the behav- 
ior of the system as a whole. 


Scope of Statistical Mechanics 


Elements of statistical mechanical methods are pres- 
ent in many widely separated areas in physics. For in- 
stance, the classical Boltzmann problem is an attempt 
to explain the thermodynamic behavior of gases on 
the basis of classical mechanics applied to the system 
of molecules. Historically, this was the first system- 
atic investigation of a statistical problem, and many 
of the procedures and methods originate from this 
investigation. It is important to realize that statisti- 
cal mechanics gives more than just an explanation 
of already known phenomena. By using statistical 
methods, it often becomes possible to obtain expres- 
sions for empirically observed parameters, such as 
viscosity coefficients, heat conduction coefficients, 
and virial coefficients, in terms of the forces between 
molecules. This kind of result, a direct relation be- 
tween an observed macroscopic entity and an in- 
termolecular potential, constitutes one of the main 
achievements of statistical physics. See BOLTZMANN 





STATISTICS; INTERMOLECULAR FORCES; KINETIC THE- 
ORY OF MATTER. 

Statistical considerations also play a significant role 
in the description of the electric and magnetic prop- 
erties of materials. In this case the objective is to 
deduce such characteristics as the permittivity, elec- 
trical conductivity, and magnetic permeability from 
the known properties of the atom. To obtain the 
Maxwell equations, which are macroscopic equa- 
tions for matter in the bulk, from the Lorentz (elec- 
tron) equations is typically a statistical question; new 
information which can be obtained from this ap- 
proach includes, for example, the temperature de- 
pendence of the permittivity. See ELECTRICAL CON- 
DUCTIVITY OF METALS; MAGNETIC SUSCEPTIBILITY; 
MAXWELL’S EQUATIONS; PERMITTIVITY; RELATIVISTIC 
ELECTRODYNAMICS. 

If the problem of molecular structure is attacked 
by statistical methods, the contributions of internal 
rotation and vibration to thermodynamic properties, 
such as heat capacity and entropy, can be calculated 
for models of various proposed structures. Compar- 
ison with the known properties often permits the 
selection of the correct molecular structure. The sta- 
tistical description of the activated complex in reac- 
tion mechanisms has enabled chemists to formulate 
a theory of absolute reaction rates. See MOLECULAR 
STRUCTURE AND SPECTRA. 

Perhaps the most dramatic examples of phenom- 
ena requiring statistical treatment are the coop- 
erative phenomena or phase transitions. In these 
processes, such as the condensation of a gas, the 
transition from a paramagnetic to a ferromagnetic 
state, or the change from one crystallographic form 
to another, a sudden and marked change of the whole 
system takes place. The appropriate description of 
such processes is one of the most difficult but most 
interesting problems of statistical physics. See PHASE 
TRANSITIONS. 

Statistical considerations of quite a different kind 
occur in the discussion of problems such as the 
diffusion of neutrons through matter. In this case, 
the probability of the various events which affect 
the neutron are known, such as the capture prob- 
ability and scattering cross section. The problem 
here is to describe the physical situation after a 
large number of these individual events. The pro- 
cedures used in the solution of these problems are 
very similar to, and in some instances taken over 
from, kinetic considerations. Similar problems occur 
in the theory of cosmic-ray showers. Here the prob- 
abilities of the basic processes—pair production, 
annihilation, ionization, meson production—are all 
presumed known. The major task is the computa- 
tion of the combined effect of a large number of 
these individual events. Special techniques of statis- 
tical physics, the use of distribution functions and 
transport-type equations, find extensive applications 
in these areas. See BOLTZMANN TRANSPORT EQUA- 
TION; COSMIC RAYS; REACTOR PHYSICS. 

It happens in both low-energy and high-energy 
nuclear physics that a considerable amount of en- 
ergy is suddenly liberated. An incident particle may 


be captured by a nucleus, or a high-energy proton 
may collide with another proton. In either case, 
there is a large number of ways (a large number 
of degrees of freedom) in which this energy may 
be utilized. In the nuclear case, there are usually 
many decay and excitation modes; in the case of 
the proton-proton collision, there is enough energy 
available for the creation of a number of mesons. 
To survey the resulting processes, one can again in- 
voke statistical considerations. The statistical prob- 
lem here is to calculate the probability that, given 
a total amount of energy, a particular process will 
occur. Of course, the statistical method cannot help 
in the investigation of the actual mechanisms of 
these processes. However, the statistical factors must 
be considered in the interpretation of experiments. 
See NUCLEAR REACTION; SCATTERING EXPERIMENTS 
(NUCLED. 

Of considerable importance in statistical physics 
are the random processes, also called stochastic pro- 
cesses or sometimes fluctuation phenomena. The 
brownian motion, the motion of a particle moving 
in an irregular manner under the influence of molec- 
ular bombardment, affords a typical example. The 
process may be described in terms of a fluctuating 
force acting on a particle, perhaps in addition to a 
systematic force. The interest in this problem centers 
around a calculation of the position and velocity of 
a particle after it has experienced many collisions or 
after the fluctuating force has acted for a long time. 
The stochastic processes are in a sense intermedi- 
ate between purely statistical processes, where the 
existence of fluctuations may safely be neglected, 
and the purely atomistic phenomena, where each 
particle requires its individual description. All sta- 
tistical considerations involve, directly or indirectly, 
ideas from the theory of probability of widely dif- 
ferent levels of sophistication. The use of probabil- 
ity notions is, in fact, the distinguishing feature of 
all statistical considerations. See BROWNIAN MOVE- 
MENT; PROBABILITY; PROBABILITY (PHYSICS); STATIS- 
TICS; STOCHASTIC PROCESS. 


Methods of Statistical Mechanics 


In the following sections procedures used in statisti- 
cal mechanics are discussed. 

Phase space. For a system of N particles, each 
of mass m, contained in a volume V, the positions 
of the particles may be labeled x), 11, 21, ..., Xy, 
yy, Zn, their cartesian velocities vy, ..., Uzy, and 
their momenta p,), ..., Dzy. The simplest statisti- 
cal description concentrates on a discussion of the 
distribution function /@,y,2;V;,U),Uz;). The quan- 
tity fOOV.Z5Ux,U;),Uzb) (dxdydzduxdv,dvz) gives the 
(probable) number of particles of the system in those 
positional and velocity ranges where x lies between 
x and x + dx; Ux between vx and vx + dux, and 
so on. These ranges are finite. The six-dimensional 
space defined by x, y, Z, Ux, Uy, Uz is called the u 
space, and the behavior of the gas is represented ge- 
ometrically by the motion of N points in the jz space. 
It was shown by L. Boltzmann that in the course of 
time fapproaches an equilibrium distribution /° as in 
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Eq. (1). Here d*x = dxdydz, A and ß are parameters, 
f°, Vv) Px 
=A exp[—'/.Bmv* — BUG, y, 2] dxd*v (A) 


and U(x,),Z) is the potential energy at the point x, y, 
z. Boltzmann also could interpret Eq. (1) as the most 
probable distribution. 

Actually, important as the use of the distribution 
function is in practice, there are in principle seri- 
ous limitations and difficulties associated with it. The 
limitations refer to the fact that the description by 
means of a single distribution function is correct 
only for noninteracting particles; also, the neglect 
of triple and higher collisions restricts the appli- 
cability to very dilute systems. The indiscriminate 
use of the ideas of Boltzmann leads to paradoxical 
results. 

In the further study of these questions, the phase 
space of a dynamical system plays an important role. 
This is the 6N-dimensional euclidean space, whose 
ON axes are given in notation (2). The state of the 
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system at a given time is completely specified by 
these 6N numbers. These numbers define a point 
(the representative point, or phase point) in the 6N- 
dimensional phase space. In the course of time the 
variables x), ..., Dzy Change, hence the phase point 
moves. If the system is conservative, Eq. (3) defines, 
for the energy E, a (GN — 1)-dimensional surface in 
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the phase space called the energy surface. For a con- 
servative system, the phase point in the course of 
time wanders over the energy surface. Intuitive as 
this geometrical representation of an involved me- 
chanical system might seem, the actual trajectory of 
the phase point is extremely complicated. Also, the 
use of the phase space does not by itself imply any 
statistical considerations. A knowledge of the trajec- 
tory is precisely equivalent to a solution of the com- 
plete mechanical problem, for it determines each p 
and x as a function of time. However, many differ- 
ent microscopic states all correspond to the same 
macroscopic situation. 

Observations made on a system always require a 
finite time; during this time the microscopic details 
of the system will generally change considerably as 
the phase point moves. The result of a measure- 
ment of a quantity Q will therefore yield the time 
average, as in Eq. (4). The integral is along the tra- 
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jectory in phase space; Q depends on the variables 
X1, ..., Pzy and t. To evaluate the integral, the tra- 
jectory must be known, and as mentioned above, 
this requires the solution of the complete mechanical 
problem. 
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For many years one of the major problems in sta- 
tistical mechanics was precisely to recast the expres- 
sion for a time average in a more tractable form. Gen- 
erally attempts were made to replace time averages 
by phase space averages, by trying to show that the 
time spent in a region was proportional to the size 
of the region. The legitimacy of this procedure was 
studied extensively by mathematicians, but although 
these studies opened up significant areas in mathe- 
matics, they did not really help elucidate problems 
of physical interest. 

The subtlety of the problems involved is illustrated 
by the so-called ergodic theorem of G. D. Birkhoff, 
which asserts that lim, Q; exists for almost all tra- 
jectories, although this limit varies discontinuously 
from trajectory to trajectory. The Poincaré recur- 
rence theorem, which states that a system within 
a finite time will return as closely as desired to its ini- 
tial state, is of special interest in statistical mechan- 
ics. Apart from providing the mathematical frame- 
work for these theorems, the phase space provides 
also the appropriate framework for the formulation 
of ensemble theory, which is the basis of modern 
statistical mechanics. 

Ensembles; Liouville’s theorem. J. Willard Gibbs first 
suggested that instead of calculating a time average 
for a single dynamical system, a collection of systems, 
all similar to the original one, should instead be con- 
sidered. Such an ensemble of systems is to be con- 
structed in harmony with the available knowledge 
of the single system, and may be represented by an 
assembly of points in the phase space, each point 
representing a single system. If, for example, the en- 
ergy of a system is precisely known, but nothing else, 
the appropriate representative example would be 
a uniform distribution of ensemble points over the 
energy surface, and no ensemble points elsewhere. 
An ensemble is characterized by a density function 
PM, -3 ZN; Px -> Peni D = Pp, À. The signifi- 
cance of this function is that the number of ensem- 
ble systems dN, contained in the volume element 
dx,... dzy;dp x... dpzy of the phase space (this vol- 
ume element will be called dl), at time t is as given 
in Eq. (5). 
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The ensemble average of any quantity Q is given 
by Eq. (6). The basic idea now is to replace the time 
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foa 
average of an individual system by the ensemble aver- 
age, at a fixed time, of the representative ensemble. 
Stated formally, the quantity QO, defined by Eq. (4, 
in which no statistics is involved, is identified with 
Qens defined by Eq. (6), in which probability assump- 
tions are explicitly made. Another form of this same 
connection between the behavior of the individual 
system and the ensemble is that the probability that 
the individual system at time ¢ will be in a region R of 
the phase space is given by the fraction of ensemble 


systems contained in R at time ¢, Eq. (7). 


See aT 
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It is clear, on the basis of these relations, that the 
complete statistical behavior of a system is known 
once its representative ensemble has been obtained. 
In the construction of such ensembles, it is always 
assumed that accessible parts of the phase space 
should be weighted equally. There is one other gen- 
eral requirement that the density function must sat- 
isfy. Inasmuch as the number of ensemble mem- 
bers remains constant (no ensemble members are 
created or destroyed in the course of time), there 
is a continuity equation, Eq. (8), for the density 
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function p which simply states that the change in 
the number of systems in a volume of phase space 
per unit time equals the difference of the number 
of systems flowing in and flowing out of that vol- 
ume. If the time derivatives dx,/dt and dp,/dt are 
now expressed through the hamiltonian equations, 
Eq. () is obtained. Here dp/dt is the substantial, 
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or total derivative. See EQUATION OF CONTINUITY; 
HAMILTON’S EQUATIONS OF MOTION. 

Equation (9), called the Liouville theorem, asserts 
that the time rate of change of p is zero along a 
streamline in phase space. It can also be deduced 
from Eq. (9) that the ensemble systems move in 
the course of time in a volume-preserving fashion 
through the phase space. If at some initial time a set 
of ensemble points occupy a volume Vo, then in the 
course of time each representative point will move 
in a complicated fashion. However, at time ¢ the vol- 
ume V, made up by the ensemble points which ini- 
tially were in Vo is still equal to Vo. The shape of the 
volume changes, but the actual volume is constant. 
In the attempts which have been made to justify 
the postulate that O; = Ogns, this volume-preserving 
property plays a dominant role. The proof of the 
Poincaré recurrence theorem also depends crucially 
on this property. Yet in spite of the general signifi- 
cance of the Liouville theorem, Eq. (9) generally does 
not help in the task of setting up an appropriate rep- 
resentative ensemble. (it is relevant, however, to the 
subsequent discussion on nonequilibrium theory of 
liquids.) The choice of the ensemble is usually deter- 
mined by physical considerations and by the already 
mentioned postulate of equal a priori probabilities 
in phase space. Perhaps most important is the a pos- 
teriori justification of the statistical procedures, ob- 
tained through a comparison with the experimental 
facts. It is important that inasmuch as the methods 
are statistical, not only average values but also fluctu- 
ations around these average values can be computed. 
It turns out that usually these fluctuations are 


extremely small; however, in those instances in 
which they are appreciable, they can be compared 
with experiments, providing additional support for 
the statistical procedures. Hence, despite the fact 
that no completely rigorous mathematical justifica- 
tion of the ensemble methods exists, these meth- 
ods are physically so plausible and lead to such 
agreement with experiments for a variety of systems 
that they reasonably must be considered as a well- 
established part of statistical physics. 

Relation to thermodynamics. It is certainly reason- 
able to assume that the appropriate ensemble for a 
thermodynamic equilibrium state must be described 
by a density function which is independent of the 
time, since all the macroscopic averages which are 
to be computed as ensemble averages are time- 
independent. Thus an equilibrium ensemble is re- 
alized when 0p/dt = 0. In that case it follows from 
Liouville’s theorem that Eq. (10) holds. It is easy to 
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see that if p is a function of a quantity œ (which is 
a function of x and p and which is a constant of the 
motion, such as the energy, so that da/dt = 0), then 
any (a) satisfies Eq. (10). Hence any density func- 
tion p which is a function of x and p through the 
dependence of the energy E on p and x satisfies the 
Liouville equation (9). The functional form of p(£) is 
left completely unspecified. 

Microcanonical ensemble. In the case of an isolated sys- 
tem, where the energy is specified within a well- 
defined range, the most obvious example to use 
would be the so-called microcanonical ensemble de- 
fined by Eq. (11a), where c is a constant, for the en- 
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ergy E between Eo and Ey + AE; for other energies 
Eq. (110) holds. By using Eq. (6), any microcanonical 
average may be calculated. The calculations, which 
involve integrations over volumes bounded by two 
energy surfaces, are not trivial. Still, many of the re- 
sults of classical Boltzmann statistics may be obtained 
in this way. 

Canonical ensemble. For applications and for the in- 
terpretation of thermodynamics, the canonical en- 
semble is much more preferable. This ensemble de- 
scribes a system which is not isolated but which is 
in thermal contact with a heat reservoir. By describ- 
ing the complete system (system plus reservoir) by a 
microcanonical ensemble, it may be shown that the 
system itself may be represented by Eq. (12a). Here 


p(x, p) = N. exp | ; 
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Ne is the total number of ensemble systems, E(x, p) is 
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the mechanical energy of the system, and y and 0 are 
parameters independent of x and p characterizing 
the canonical ensemble. From the fact that Eq. (12b) 
is valid, it follows that the two parameters y and 
0 are not independent, but that Eq. (13) applies. 
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The quantity Z is usually called the partition function. 
It is customary to define Z for a system of N identi- 
cal particles with a different multiplicative constant 
by Eq. (14). Here 4 is the Planck constant. It is now 
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very important that the parameters 0 and y be iden- 
tified with definite thermodynamic functions. This 
may be done by considering some of the thermody- 
namic relations. If 7 is the entropy, € the thermody- 
namic energy (internal energy), and w’ the free en- 
ergy (Helmholtz free energy), then some examples 
of thermodynamic relations are, for gases, described 
by pressure, volume, and temperature (P V, T) as in 
Eqs. (15). 
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See ENTROPY; FREE ENERGY; INTERNAL ENERGY; 
THERMODYNAMIC PRINCIPLES. 

From Egs. (12a), (6), and (13) the average energy 
E may be calculated, Eq. (16). For an ideal gas, in 
which the energy of a molecule is p*,/2m, Eq. (17) 
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holds. (The evaluation of the integrals is elementary; 
the space integrations merely contribute VY, and the 
momentum integrals are gaussian.) Equation (184) is 
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then obtained as the energy of an ideal gas, having 
N molecules in a vessel of any volume V. Now it is 
known by experiment that the thermodynamic en- 
ergy of 1 mole of an ideal gas is given by Eq. (180). 
Here R is the ideal gas constant, T is the absolute 
temperature, N is Avogadro’s number, and k is the 
Boltzmann constant. Comparison of Eqs. (18a) and 
(8b) leads to the identification of 0 = kT. It is also 
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easy to deduce the Maxwell-Boltzmann distribution. 
Applying Eq. (7) yields Eq. (19) for the probability 
P (pix) Pp, Ax, 
_ J- f dxi dapi Bxz--- Ppye 
- fo [ Bx» Bpye=/ 
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that molecule 1 will have a given range of positions 
and momenta. The canceled symbols in the numer- 
ator mean that the coordinates and momenta of the 
molecule 1 are not integrated over. The result given 
by Eq. (19) is again true for an ideal gas. The identi- 
fication of 6 = kT, however, is made in general. By 
differentiating Eq. (13) with respect to @, it is possible 
to establish from Eq. (18a) the general connection 
(using 0 = kT) written as Eq. (20). When this general 

E=-y- rê% =€ (20) 
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relation, deduced from the canonical ensemble, is 
compared with the general thermodynamic relation 
(15d), the parameter 6 and the free energy 6’ are in- 
deed seen to play identical roles. Again, for an ideal 
gas, w can be computed, and it is the same as the 
ideal gas free energy. Hence y and y’ are identified 
in general. Equation (13) relates the thermodynamic 
free energy to an integral containing the mechanics 
of the microscopic problem. It is also interesting to 
observe that a quantity n’ defined by Eq. (21) plays in 
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all instances the role of the entropy. In fact, it can be 
shown that the equation y = € — Tn’ is in harmony 
with Eq. (15a). These thermodynamic analogies can 
be followed through in all detail to see that the canon- 
ical ensemble combined with the relevant definitions 
does indeed reproduce the formal aspects of ther- 
modynamics. The average energy E for a canonical 
ensemble has already been calculated. It is impor- 
tant to know just how often appreciable deviations 
from this average energy can be expected. For this, 
the fractional fluctuations in energy given by nota- 
tion (22) must be calculated. The definition of E?, in 
Eq. (23), follows again from Eq. (6). From Eq. (23) 
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the quantity E? may be expressed directly in terms 
of Z, the partition function. For an ideal gas it is then 
possible to obtain, using Eq. (17), an explicit expres- 
sion, Eq. (24),for the fluctuations in energy. Since the 
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number of particles N is about 107 in systems dealt 


with in practice, the chance of observing a sizable 
deviation from the average energy is extremely small. 
Using these same methods, defining in addition the 
specific heat C, = dE/0T, it may be shown gener- 
ally that the fractional fluctuation in energy is given 
by Eq. (25). Even though the energy fluctuations are 
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negligible for an ideal gas, this is not always so for 
other systems. For solids at low temperatures, the 
fluctuations may be appreciable. When phase tran- 
sitions (of the first order) take place, C, becomes 
infinite, indicating via Eq. (25) that the fluctuations 
become very large. It is clear that the main problem 
of the applications is reduced to the calculation of 
the partition function Z; all thermodynamic entities 
follow from Eqs. (16) and (13). 

Grand canonical ensemble. There is yet another ensem- 
ble which is extremely useful and which is particu- 
larly suitable for quantum-mechanical applications. 
Much work in statistical mechanics is now based 
on the use of this so-called grand canonical ensem- 
ble. The grand ensemble describes a collection of 
systems; the number of particles in each system is 
no longer the same, but varies from system to sys- 
tem. The density function poWy,p,xX¥) dI y gives the 
probability that there will be in the ensemble a sys- 
tem having N particles, and that this system, in its 
6N-dimensional phase space Iy, will be in the re- 
gion of phase space dI y. The function p is given by 
Eq. (26). Here 0, Q, and m are the parameters char- 
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acterizing the ensemble, just as yy and 6 characterize 
the canonical ensemble. It is again true that these pa- 
rameters are directly related to thermodynamic state 
functions. The detailed argument follows the identi- 
cal pattern indicated in the discussion of the canon- 
ical ensemble. The normalization condition is now 
given by Eq. (27). The grand canonical average of 
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any quantity Q is given by Eq. (28) and the grand 
partition function by Eq. (29a). Again Eq. (29b) is 


Ox = Xf arvoove. x) On, x) (28) 
N 


fy oe f aye F (29a) 
N 


ii n 
Z= y = ae f dTye — (296) 


N 


customarily used for a system consisting of N iden- 
tical particles. From Eq. (27) it follows immediately 


that Eq. (30) holds. From Q, sometimes called the 
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grand potential, all other thermodynamic functions 
may be computed. The parameter ju is the chemical 
potential. It is defined by u = (0&/0.N)p 7 where & is 
the Gibbs free energy or thermodynamic potential; 
E€ = w+ PV. Formally, these results are written as 
Eqs. (31). From a knowledge of the grand partition 
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function as a function of V T, and p, all thermody- 
namic functions follow. For an ideal gas, for exam- 
ple, Eq. (29b) in conjunction with Eq. (17) yields 
Eq. (2a). Here A is the thermal de Broglie wave- 
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length, defined by Eq. 2b). From Eqs. (2a) and 
(30) it follows that Eq. G3) holds. Application of 
V 
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Eqs. (31) yields the usual results for ideal gases, such 
as PV = NkT: however, the grand average of N en- 
ters the relations now, rather than just V. From the 
definitions (28) and (29b) it can be shown that the 
fractional fluctuation in the number N is given by 
Eq. (4). Thus for gases, the fluctuations in num- 
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ber are negligibly small, showing that the number of 
particles in the grand canonical ensemble is sharply 
peaked around the average value. The physical re- 
sults deduced from the canonical and grand canon- 
ical ensemble can therefore be expected to be the 
same; the calculations, however, are frequently (es- 
pecially in quantum problems) simpler in the grand 
canonical than in the canonical scheme. See CHEMI- 
CAL EQUILIBRIUM; CHEMICAL THERMODYNAMICS. 

Applications. Two of the newest and most impor- 
tant applications of statistical physics involve the the- 
ory of nonideal gases and the theory of nonequilib- 
rium states of dense gases and liquids. 
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Theory of nonideal gases. The importance of relations 
such as Eq. (13), which connects the thermodynamic 
free energy to the partition function, lies in the gen- 
eral validity of these connections. Specifically, when 
dealing with a gas of interacting molecules, where 
the energy is given by Eq. (55) with the potential U 
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a known function, the relation is still valid. In that 
case, the partition function may be obtained directly 
in terms of Uby performing the integrations over the 
momenta as in Eq. (36). Since from Zy all thermody- 
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namic relations follow, Eq. (36) is the appropriate 
starting point for all studies of nonideal gases. The 
canonical ensemble has been used in obtaining Zy, 
which depends on V through the integration limits, 
on T through å [see Eq. (32b)], and on N through the 
number of integrations. Since Zy gives the free en- 
ergy immediately, the equation of state (the relation 
between P V, and T) can be derived by computing 
P=-—dw/dV. The experimental results on the equa- 
tions of state may be expressed in terms of the so- 
called virial expansion, Eq. (57). Here B(T) and CCT) 
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are the second and third virial coefficients; they are 
determined experimentally. It is clear that Eq. (37) is 
a development starting from the ideal gas equation. 
A first task for statistical mechanics is to deduce the 
form (37), as well as an explicit expression for BC1), 
and so on. A first step in carrying this out is to set 
up relation (38a), and to assume that the potential 
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U(x, ..., Zn) is such that it is possible to make an un- 


ambiguous distinction between interacting and non- 
interacting particles. Any potential which vanishes 
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for a separation larger than a critical amount has 
this property, and any short-range potential approx- 
imates it. A separated configuration may be defined 
as one in which none of the molecules of one group 
interacts with any of another. If two such groups are 
called a and £, Eq. (380) is satisfied for such a config- 
uration. Hence, by relation 38a), Eq. G8c) holds. A 
set of functions $ may now be defined by Eqs. (38d). 
From Eqs. (38c) and @8d) the following important 
property can be proven: the S functions vanish for a 
separated configuration. For instance S(1,2, ..., D is 
zero, unless the molecules 1, 2, ..., / form a nonsep- 
arated configuration. This is an important property, 
for it allows the integration of the S functions in a sim- 
ple fashion. This can be illustrated by the example 
of integral 39a). To evaluate this integral it is useful 
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to first imagine that molecule 1 is fixed somewhere 
in the middle of the vessel. Since the S functions 
vanish for a separated configuration, the / molecules 
must all be quite near to molecule 1. Roughly speak- 
ing, if a is the range of the molecular forces, integral 
(9a) gets contributions only from a range of about 
la around molecule 1. If /a is smaller than V'”, the 
first 7 — 1 integrations are independent of V. The 
last integrations over the coordinates of 1 just con- 
tribute V, for molecule 1 can be placed anywhere in 
V. (Wall effects are ignored.) Thus Eq. (396) holds. 
The /! is a normalization factor and b; is indepen- 
dent of V. The b; are called cluster integrals. Using 
Eqs. (36), G8a), G8d), and (39b), the pressure may 
be obtained in terms of the cluster integrals as in 
Eq. (40). Thus, the experimental form of the virial 
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development has been deduced, while the cluster 
integrals are related to the experimental virial coef- 
ficients by Eqs. (41a) and (41b). Using Eq. (9b) for 
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l = 2 gives a direct relation between B and the inter- 
molecular force potential U(r) as in Eq. (41c). This is 
a “perfect” statistical formula. In the case of helium, 
where both U(r) and B are known, Eq. (410 may 
in fact be checked. [At low temperatures, quantum 
effects, not included in Eq. (410), begin to play an 
important role.] 


The complete equation of state can be obtained 
using a similar procedure. The result comes out in 
an implicit form, as in Eqs. (42a) and (42b), derived 


P=kTXD,z! (42a) 
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by J. E. Mayer. In principle, the auxiliary variable z 
can be eliminated between Eqs. (42a) and (42b) to 
obtain a relation between P V, and T. Of course, 
to have a useful relation it is necessary to know the 
general character of the cluster integrals, and, in fact, 
this is not known, except for special molecular mod- 
els. A question which has concerned physicists for 
some time is whether or not the system of equa- 
tions (42) actually predicts a condensation pheno- 
menon. It is well known that every real gas at a low 
enough temperature will condense when the vol- 
ume is decreased. During the condensation process 
the pressure remains constant, the onset of conden- 
sation being marked by a discontinuity in the slope 
of the isotherm. The problem is now whether this 
information can be obtained from Eqs. (42). An inter- 
esting clue was obtained by Mayer. He showed that 
for low enough temperatures, bT) & constant x b’. 
Substituting this relation in Eq. (42b) yields Eq. (43). 
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Since the last power in the series is N & 10°%, the se- 
ries given by Eq. (43) is radically different for boz < 1 
and boz > 1. If boz > 1, boD" is a very large number 
and a change in N/V will cause a very slight change 
in z, hence a very slight change in P by Eq. (42a). 
Therefore, boz = 1 separates two ranges: If boz < 
1, changes in N/V cause reasonable changes in P; if 
boz > 1, P becomes quite insensitive to changes in 
N/V. This qualitative idea has been refined in many 
ways, by rigorously studying limit (44). These stud- 
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ies have clarified some aspects of the situation, but 
even so the problem is still not completely settled. 
See GAS. 

Nonequilibrium theory of liquids. It was observed in the 
earlier discussion of Liouville’s theorem that the Liou- 
ville equation usually does not help in setting up 
an appropriate ensemble. Yet the Liouville equa- 
tion has become the starting point for an impor- 
tant development aimed at an understanding of 
the nonequilibrium states of dense gases and liq- 
uids. This development starts from the Liouville 
equation (45). The interpretation that p@, ..., 
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Pzy;bdT is the probability of finding the molecules 
of the system in their prescribed momentum and po- 
sition ranges may now be used. A set of probability 


functions (or distribution functions) can be defined 
by Eqs. (46). The function /; is (apart from constants) 
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the Boltzmann distribution function. The canceled 
symbol d(1) means that the variables x1, y1, 21, Dx, 
Py, Pa are not integrated over. The higher distribu- 
tion functions become important for dense systems. 
Ifit is assumed that the potential energy of the system 
consists of an external potential Vo, and a potential 
U which is additive, then integration of Eq. (45) over 
all coordinates except those of molecule 1 yields an 
equation for fı, Eq. (47). 
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The equation for f; involves f1, and so on. 
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The discussion of this hierarchy forms the basis 
of the study of the nonequilibrium phenomena in 
dense gases. The basic difficulty is that the equations 
for f; always involve a function f1. To obtain an equa- 
tion for a single /; function, it is necessary to make a 
guess or assumption about the way in which a higher 
Jf function can be expressed in terms of lower ones, if 
indeed this can be done at all. A frequently discussed 
possibility is the so-called superposition approxima- 
tion relation (48). An appeal to probabilities of inde- 
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pendent events makes relation (48) appear reason- 
ably plausible. However, it has not been proven that 
the approximation of relation (48) is indeed consis- 
tent with the system of Eq. (47). The use of relation 
(48) in Eq. (47) leads to (nonlinear) integral equa- 
tions. In spite of the considerable amount of work 
done with these equations, the results still have not 
led to great advances in the theory of dense gases. 
However, for additive central potentials, defined by 
Ud, ..., N) = X U(r;), it is possible to express the 
equation of state in terms of the pair distribution 
function. This function is defined by Eq. (49a). Here 
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gives the positional distribution of molecular pairs. 
As such it is less detailed than /2, which gives the 
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distribution of pairs both in positions and momenta. 
Apart from combinatorial factors, Eq. (49b) is valid. 
For central additive potentials, it may now be shown 
that 772 is in fact a function of the distance 1,2 between 
the molecules: 72 « g(r), where g(r) is defined as giv- 
ing the number of molecules between r and r + dr. 
See INTEGRAL EQUATION. 

An analysis similar to the one given in the preced- 
ing discussion on nonideal gases now leads to an 
equation of state, Eq. (50), in terms of g(r) alone. 


pv = nkr — ZAO D f o Er dr (50) 

3V dr 
The integral equation referred to for the f2 functions 
(once the superposition approximation is made) 
may now be expressed as an approximate integral 
equation for g(r). This equation, obtained by M. 
Born, H. S. Green, and J. Yvon, leads to approxi- 
mations for the virial coefficient, which are fairly 
good but not excellent. The pair distribution func- 
tion g(r) may be obtained directly from experimen- 
tal x-ray scattering data. In principle experimen- 
tal x-ray data could be used to obtain via Eq. (50) 
data about the equation of state. For this to be a 
feasible procedure would demand accuracies much 
beyond the present limits, as well as extensive mea- 
surements over a range of temperatures and densi- 
ties. Unfortunately, therefore, this is not a possible 
procedure at present, although the formal relation 
(50) is generally valid. A current theory of liquid 
helium makes use of the relation between the pair 
distribution function and neutron-scattering data. 
See LIQUID HELIUM; NEUTRON DIFFRACTION; X-RAY 
DIFFRACTION. 

Quantum statistical mechanics. The ensemble tech- 
niques can also be applied to systems which are de- 
scribed in terms of quantum mechanics. Consider 
an ensemble, where each system is described by a 
hamiltonian operator H. Let w°(x,D be a wave func- 
tion ( stands for x), ... Xy, and œ characterizes 
an ensemble member). The Schrödinger equation is 
written as Eq. (51). 

Q 
HY” (x, DH =~ es) 
i ot 
See NONRELATIVISTIC QUANTUM THEORY. 
If ~,@) is a complete orthogonal set, Eq. (52), the 
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function Y“ may be developed in terms of the set 
{ọ} as in Eq. (53). Here |a%(|? is the probability 
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that ensemble member g, at time t, is in state n. Of 
course, Eq. (54 is valid. 
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If G is a quantity which is defined for each en- 
semble system by G“, then an ensemble average of 
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G is defined by Eq. (55a), where N, is the number 


— 1 
G= Wee (55a) 


G= Í WO Gop * 55b) 


of ensemble members. It is important to distinguish 
the ensemble average from the ordinary quantum- 
mechanical average, which is defined for a single sys- 
tem as in Eq. (55b). The notation, double bar for en- 
semble average, single bar for quantum-mechanical 
average, stresses the difference. Of special impor- 
tance in the calculation of averages is the density 
matrix. The matrix elements of p, the density ma- 
trix, relative to the set of functions g are defined by 
Eq. (56a). The sum of the diagonal elements of p, 
called the trace of p, is given by Eq. (56b). The den- 
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sity matrix p is the counterpart of the classical den- 
sity function. From the Schrödinger equation (53) 
the analog of the Liouville theorem, Eq. (57), can 


dPmn 
dt 





be deduced immediately. Here [a,b] = ab — ba is 
the commutator. Finally the basic relation giving the 
ensemble average of any operator, Eq. (58), can be 


G = Tr(pG) (58) 


obtained. The fact that observable entities, such as 
ensemble averages, come out in the form of a trace, 
implies that the calculated results are independent of 
the set of functions Q. This is true because a change 
in this basic set will induce a similarity transforma- 
tion on both p and G, and the trace operation is 
invariant under such a transformation. See DENSITY 
MATRIX. 

An ensemble in which all ensemble members are 
in the identical state (% is independent of œ) con- 
stitutes a pure case. An ensemble in which the y“ 
do depend on « is a statistical mixture. A neces- 
sary and sufficient condition for a pure case is that 
p? =p. 

At equilibrium, the system is the counterpart of the 
canonical ensemble. Then the density operator [see 
Eq. (12a@)] is written as Eq. (59a). The free energy is 


p = eh Dirt (59a) 
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again directly related to the partition function as in 
Eq. (59b). In Eq. (60),Q is an operator e? defined by 
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the series expansion. The thermodynamic relations 
follow as before. 

The construction of an ensemble in quantum me- 
chanics must be performed in harmony with the 
knowledge of the system. If the system is known to 
be in either one of two quantum states, the ensem- 
ble members must be evenly distributed over these 
states with, in addition, random phases. It is impor- 
tant to include the requirement of random phases; 
if this were not done, a uniform stationary ensem- 
ble would not even remain stationary. See QUANTUM 
STATISTICS. Max Dresden 
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A collection of strategies and statistical methods for 
ensuring high-quality results in manufacturing and 
service processes. Traditionally, the statistical meth- 
ods associated with statistical process control (SPC) 
have focused on process monitoring. As a result, 
some experts argue that statistical process control 
is a misnomer and that the term statistical process 
monitoring is more appropriate. Others argue that 
statistical quality control methods have become so 
entwined with strategic decisions and daily opera- 
tions that the term statistical process control is en- 
tirely appropriate. Regardless, statistical process con- 
trol methods and philosophies are essential tools for 
survival in today’s highly competitive environment. 
See PROCESS CONTROL; QUALITY CONTROL; STATIS- 
TICS. 

The origins of statistical process control coincide 
with a shift in focus by manufacturing companies 
from quantity to quality. The pioneers in this transfor- 
mation include W. Shewhart, W. E. Deming, J. Juran, 
K. Ishakawa, G. Taguchi, and others. At the heart 
of this transformation were two simple ideas: un- 
derstanding the customer’s needs and meeting the 
customer’s needs consistently. Accomplishing both 
of these tasks while minimizing wasted time, effort, 
and resources insured a competitive advantage. The 
success of organizations employing quality-oriented 
management styles led to a revolution in manage- 
ment tools and philosophies. 

Total quality management. Total quality manage- 
ment (TQM) gained strength in the 1980s. The ex- 
pression total quality management refers to both 
the philosophy and practices associated with con- 
tinuously improving processes within a company. 
An all-encompassing view of processes included not 


only manufacturing operations but also service areas 
such as sales and accounting functions. Companies 
adopted various strategies for continuous improve- 
ment with varied levels of commitment under an as- 
sortment of names. Total quality control, continuous 
quality improvement, and total quality excellence 
are synonyms for total quality management. These 
programs had several common themes. An impor- 
tant commonality was reliance upon actual process 
data to monitor systems, inform decision makers, and 
guide the continuous improvement activities. The 
tools associated with statistical process control are 
fundamental building blocks in these quality-driven 
programs. International standards were eventually 
established to ensure that total quality management 
programs, while varied, followed certain basic prin- 
ciples and practices. The standards known as ISO 
9000:2000 are a product of the International Stan- 
dards Organization (SO), the American Society for 
Quality (ASQ), and the American National Standards 
Institute (ANSD. 

Six Sigma. The ideas surrounding total quality 
management grew into the Six Sigma quality move- 
ment, which continues to evolve. Among the unique 
features of Six Sigma programs are the certification 
levels for managers trained in statistical problem solv- 
ing and the use of the DMAIC process. The acronym 
DMAIC refers to five steps: define/deploy, measure, 
analyze, improve, and control. 

The certification process begins with training 
in basic statistical methods and continuous quality 
improvement concepts. Managers mastering these 
initial topics are called “green belts.” Through a se- 
ries of training classes and real-time projects, man- 
agers ascend to the ranks of “black belt” or “master 
black belt.” Managers receiving the higher-level des- 
ignations must demonstrate an understanding of the 
philosophies and tools of continuous improvement, 
as well as the ability to put these ideas into practice 
with substantial success, as measured in long-term 
savings in real dollars. 

The DMAIC problem-solving approach is central 
to Six Sigma’s successful use of statistical process 
control methods throughout an organization. The de- 
fine/deployment phase involves selecting a project 
leader (black belt), defining the project, and cre- 
ating a project support structure. The measure- 
ment phase captures current project-related wis- 
dom. Helpful tools used in this phase include many 
traditional quality control tools such as flow charts, 
check sheets, cause-and-effect diagrams, histograms, 
scatter plots, and control charts. These tools help 
identify key input and output variables used to con- 
trol and measure process performance. The analy- 
sis phase provides a comprehensive understanding 
of process data and procedures. Process changes 
(improvements) are then recommended and tested 
using designed experiments or optimization tech- 
niques. Changes that result in improvements are 
permanently incorporated in the control phase, as 
are activities such as determining control plans, up- 
dating standard operation procedures, and mistake- 
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proofing processes. See CONTROL CHART. 

The preceding paragraphs highlight the seamless 
incorporation of statistical tools into the day-to-day 
working of management. Traditional quality control 
techniques such as control charts are now com- 
plemented by appropriate statistical methods such 
as designed experiments and modern technologies. 
Hence, statistical process control tools are used be- 
yond quality engineering. The result is a blurring of 
the boundaries that traditionally defined statistical 
process control. 

Quality control charts. All processes display vari- 
ability. Processes include all meaningful activities 
that may be measured in some way. The process of 
getting to work in the morning or serving a customer 
may be measured in time. The process of surgery may 
be measured by survival rates, recovery times, or in- 
fection rates. The process of manufacturing an object 
may be measured by yield rates, dimensional proper- 
ties of the object, or conformance to customer speci- 
fications. Determining whether a process is perform- 
ing in a stable and predictable manner or displaying 
unusual behavior is the traditional domain of statisti- 
cal process control. 

All measurable processes exhibit variation that can 
be characterized as belonging to one of two classes: 
common-cause variation and special-cause variation. 
Common-cause variation is often unexplainable and 
unavoidable without fundamental changes to the 
underlying process. Special-cause variation is at- 
tributable to a specific cause that may be identified 
and corrected. The quality control chart is used to de- 
tect and address special-cause variation. Additionally, 
the control chart is used to quantify and avoid overre- 
action to common-cause variation. Overadjustment 
of equipment by operators can result in a deteriora- 
tion of overall quality. 

Underlying logic. To illustrate the usefulness of con- 
trol charting, consider a process for manufacturing 
an automobile part. The logic of control charting is 
simple, as outlined in the following steps: 

1. Select an appropriate measure for a given qual- 
ity characteristic. One may be interested in the 
length of the manufactured part. 

2. Collect samples at regular intervals. One might 
randomly select four parts every 2 hours for inspec- 
tion. 

3. Calculate an appropriate statistic from the sam- 
ple data. The sample mean (average of the lengths 
for the four parts) would help determine if parts are 
of appropriate lengths. The sample range (difference 
in the lengths of the longest and shortest parts in the 
sample) would help determine if the parts are con- 
sistently the correct length. 

4. Plot the calculated statistic(s) over time and 
compare the plotted values to predetermined val- 
ues, called control limits. Unusual patterns or ex- 
treme observations beyond the control limit could 
indicate special-cause variation to be identified and 
corrected by operators. Such patterns or extreme 
values are called signals. Values falling between the 
control limits indicate common-cause variation, and 
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operators would allow the process to run without 
interruption. 

The control chart is displayed graphically to allow 
easy use by operators. For example, the illustration 
shows the control charts for the mean length of parts 
and the range in the length of parts. For each sample, 
the average length for four randomly selected parts 
is calculated and plotted on the mean chart, and the 
sample range for the lengths is plotted on the range 
chart. Observations falling beyond the control limits 
of a control chart are called signals. The mean chart 
signal at time = 21 suggests that something unusual 
has occurred that requires the operator’s attention. 
No values are beyond the control limits of the range 
chart, which indicates that the problem affects only 
the part length, not the process consistency (that is, 
the process is consistently making bad parts). 

Guidelines for important questions. While the steps out- 
lined in the previous section appear straightforward, 
careful consideration is required at each step. 

Step 1: Get the right data. Selecting the best mea- 
surement method for a quality characteristic with re- 
gard to process information and practical considera- 
tions, such as safety, time, and ease of obtaining mea- 
surements, is obviously process-specific. There are, 
however, guidelines governing variable selection. 

Control charts typically fall into two groups: at- 
tributes and variables. Attribute data involve counts 
or proportions. For example, a sample of 100 items 
might be inspected and the number of items not 
meeting customer requirements reported. Each item 
would be simply classified as good or bad depend- 
ing on some quality attribute dength is acceptable 
or not acceptable). On the other hand, continu- 
ous variable measurements involve lengths, weights, 
times, or measurements that could obtain any value 


within a given range. For example, actual lengths of 
parts might be recorded. Continuous variable mea- 
surements are inherently more informative than at- 
tribute measurements. For example, suppose cus- 
tomer specifications require that parts have lengths 
of 12 + 0.03 in. Simply knowing that a part is not 
the correct length is less informative than knowing 
that the part length is 12.04 in., which indicates both 
the direction and magnitude of the problem. The ad- 
ditional information contained in variable measure- 
ment allows more powerful monitoring schemes and 
helps operators in diagnosing problems. Continuous 
measurements should be used where possible. See 
INSPECTION AND TESTING. 

Step 2: Get the right amount of data from the 
right place. The idea of rational subgroups is help- 
ful in determining sampling locations. One should 
collect data at locations that allow operators to iso- 
late potential problems. For example, if three pro- 
duction lines produce identical parts, sampling at the 
end of each line is preferred to sampling after parts 
from the three lines have been merged. After the 
parts have been mixed together, it may be difficult 
to determine which production line is experiencing 
problems. 

Selecting how many items should be in each sam- 
ple and how often samples should be collected re- 
quires technical knowledge of control chart perfor- 
mance. Generally, small samples taken frequently are 
preferred to large samples taken infrequently. 

Step 3: Summarize the data in an appropriate man- 
ner. This involves the statistics that should be cal- 
culated. Data summaries (sample statistics) should 
answer two questions: Is the process on target (for 
example, parts are on average the correct length)? Is 
the process behaving in a consistent manner? 


Continuous variables data provide better answers 
to these questions than do attribute data. Data sum- 
marized by averaging sample values (or weighted av- 
erages of past values) are used to determine if a pro- 
cess is on target. Practitioners refer to these statistics 
as measures of location. The sample range or sam- 
ple standard deviation is useful in quantifying consis- 
tency of process behavior. Such statistics are referred 
to as measures of variation. 

Step 4: Plot the summary statistics and control 
limits. The value(s) of the control limit(s) depends 
on the statistic being plotted, the sample size, and 
the tolerable frequency of false alarms. A signal on 
a control chart is called a false alarm if no actual 
problem exists. The average number of samples re- 
quired for a control chart to signal a process problem 
is called the average run length (ARD). A large ARL 
is desired for problem-free (in-control) processes, 
while small ARLs are desired for processes with 
problems (out-of-control processes). General guide- 
lines for determining control limits are provided in 
most introductory texts on statistical process control 
methods. 

Additional comments. The appropriate selection and 
design of control charts has received much atten- 
tion over the years. Performances of the most com- 
mon control charts are well understood. Information 
concerning the performance and selection of control 
charts may be found in many textbooks. In addition 
to common control charts, there are many special- 
ized monitoring techniques available to practition- 
ers. Multivariate control charts are used when several 
important process variables are related to one an- 
other. Forecast-based or regression-based charts are 
useful when data values change systematically over 
time. Process change due to tool wear is an exam- 
ple. Practitioners are advised to select an appropriate 
technique for each process. 

A second area of caution associated with con- 
trol charting is the relationship between control 
limits and specification limits. The relationship is 
extremely simple because there is no relationship. 
Specification limits reflect the customer’s require- 
ments. Control limits reflect the common-cause vari- 
ation in a process and the statistic plotted on the 
control chart. The customer’s needs are not deter- 
mined by the process, nor are process variations de- 
termined by the customer. 

The primary goals of control charting are to iden- 
tify and eliminate special causes of variation, ensure 
that the process remains in control, and quantify 
common-cause variation. Once a process is behaving 
in a stable and predictable manner, one may evalu- 
ate the capability of the process and use designed 
experiments to gain further improvements. 

Capability indices. One is always interested in 
knowing if inherent process variation allows one to 
meet customer needs. A capability index is a numer- 
ical measure relating the demands of the customer 
to the capability of the producer. There are many 
versions of capability indices, just as there are many 
types of control charts. The appropriate index de- 
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pends on the properties displayed by process data. 
Most capability indices are a ratio of customer spec- 
ifications and common-cause process variation. For 
example, one capability index is defined by 


(USL — LSL) 
G=— 


where ô represents the estimated standard deviation 
associated with common-cause variation, and USL 
and LSL represent the customer’s upper specifica- 
tion limit and lower specification limit, respectively. 
A large C, value indicates that the inherent process 
variation (66) is small relative to customer specifi- 
cation limits (VSL—LSL). Thus, the process is very 
capable of meeting customer requirements. 

A small capability ratio indicates that a process 
is not capable of meeting customer requirements 
without fundamental changes in the process and/or 
operation procedures. Process capability must be im- 
proved by identifying the most efficient and appro- 
priate process changes that ensure that the output is 
on target, with minimal variation. 

Designed experiments. Designed experiments 
are powerful techniques for improving process 
capability. Statistically designed experiments allow 
systematic investigation of factors that affect process 
performance. These efforts can improve processes in 
several ways. Greater understanding of process vari- 
ables aids diagnostic efforts, leads to process optimi- 
zation, and increases process robustness to nui- 
sance variables that are difficult or impossible to 
control. As processes are improved, control charts 
must be updated to reflect the new and improved 
process behavior. Process capability ratios must 
be recalculated to reflect the increased process 
capability. 

The cycle of monitoring and stabilizing processes, 
determining their capability, and improving process 
capability through statistically designed experiments 
is a powerful approach to quality improvement. 
Understanding customer needs in a quantifiable man- 
ner, communicating these requirements to opera- 
tions, and focusing attention on consistently meet- 
ing customer needs seem only logical. Statistical 
process control techniques not only maximize the 
chance of gaining a competitive advantage but also 
allow gaining an advantage at a minimum of expense. 
These techniques have been useful in pure research 
and development for many years. As understanding 
of their usefulness grows statistical process control 
methods will remain a key tool in the management 
of manufacturing and service processes. See EXPER- 
IMENT; INDUSTRIAL ENGINEERING; MANUFACTURING 
ENGINEERING. Benjamin M. Adams 
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Statistics 


The field of knowledge concerned with collecting, 
analyzing, and presenting data. Not only workers in 
the physical, biological, and social sciences, but also 
engineers, business managers, government officials, 
market analysts, and many others regularly use sta- 
tistical methods in their work. The methods range 
from simple counting to complex mathematical sys- 
tems designed to extract the maximum amount of 
information from very extensive data. 

In an important sense statistics may be regarded as 
a field of application of probability theory. The com- 
mon problem faced by a physicist reading a meter, an 
engineer testing a material, an agronomist measuring 
the yield of a hybrid corn, a chemist determining the 
concentration of ascorbic acid, and an interviewer 
studying public opinion is the problem of random 
variation which prevents repetition of exactly the 
same result when a measurement is repeated. Sta- 
tistical methods are employed to assess the magni- 
tude of random variation, to minimize it, to balance 
it out, to remove it by calculation procedures, and to 
analyze it by suitably arranged patterns of observa- 
tion. The theory of probability is concerned with the 
properties of random variables and hence furnishes 
the basis for developing techniques for controlling 
them. See PROBABILITY. 

Viewing statistics from another direction, it is the 
science of deriving information about populations by 
observing only samples of those populations. A pop- 
ulation is any well-specified collection of elements. 
Thus, one may refer to the population of adults in the 
continental United States viewing television screens 
at 8:14 P.M. on August 6, 1970; the population of 
automobiles less than 2 years old registered in Los 
Angeles County on a certain date; the population of 
vineyards in France; the hypothetical population of 
outcomes of tossing a given coin endlessly. Popula- 
tions may be finite or infinite. An element of a univari- 
ate population is characterized by the value of a ran- 
dom variable which measures some single attribute 
of interest in the population. Thus, one may be inter- 
ested in whether or not individuals of the television 
audience were or were not viewing program A; with 
each individual one may associate a random variable, 
let it be X, which takes on the value of 1 if the indi- 
vidual is watching A and 0 if he is not. If one were 
interested in a second characteristic of the elements 
of the television audience (such as age), one would 
be said to be dealing with a bivariate population; a 
third characteristic (such as economic status) would 
make it a trivariate or, less specifically, a multivariate 
population. 

Random variables are either continuous, which 
means they can take on any numerical value (the 
length of a room), or discrete, which means they 
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Fig. 1. Distribution of incomes. 


can take on only a restricted set of values (QUmber 
of windows in a room). 

Distributions. In a univariate population, the pop- 
ulation distribution is a curve (function of the ran- 
dom variable which characterizes the elements of 
the population) from which one can determine the 
proportion of the population which has elements in 
a certain range of the random variable. For exam- 
ple, the curve of Fig. 1 provides the distribution of 
annual incomes of family units. The total area under 
the curve is 1. The area under the curve between any 
two vertical lines gives the proportion of the fami- 
lies having annual incomes between the two values 
marked on the horizontal scale by the two vertical 
lines. Thus, the fact that the area under the curve be- 
tween $2000 and $6000 is 0.541 means that 54.1% 
of the family units have incomes in that range. 

The distribution is also referred to as the distri- 
bution function, the density function, the frequency 
function, or the probability density. 

The total area under the distribution curve to the 
left of each point can also be plotted to give a curve 
which starts at zero and reaches unity as the vari- 
able becomes large; the resulting curve is sometimes 
called the cumulative distribution function, the prob- 
ability distribution, or simply the distribution. The 
cumulative form of the curve of Fig. 1 is shown in 
Fig. 2; the height of the curve at any point on the 
horizontal scale equals the area to the left of that 
point under the curve of Fig. 1 and is the proportion 
of the population having incomes less than the value 
at that point. The distribution (in either frequency or 
cumulative form) gives complete information about 
the way the characterizing variable is spread through 
the population. 
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Population parameters. Populations (or population 
distributions) are often specified incompletely by 
certain population parameters. Some of these param- 
eters are location parameters or measures of central 
tendency; a second class of important parameters 
consists of measures of dispersion or scale parame- 
ters. 

The most widely used location parameters are the 
mean, the median, and the mode. The mean is 
the average over all the population of the values of 
the random variable. It is often represented by the 
Greek letter u. In mathematical terms, if x is the ran- 
dom variable, fx) the frequency function for a given 
population, and F(x) its cumulative form, then the 
mean is as shown in Eq. (1). 
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The median, often designated by Med, M, or X50, 
is anumber such that, at most, one-half the values of 
the variable associated with the elements of the pop- 
ulation fall above or below it, as in Eq. (2). The mode 
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is the most frequent value of the random variable; if 
the frequency function has a unique maximum value, 
the mode is the value of the random variable at which 
the frequency function reaches its maximum. Loca- 
tion parameters are numbers near the center of the 
range over which the random variable of the popu- 
lation varies; different ones arise from different def- 
initions of center; generally the mean is used unless 
special circumstances make some other location pa- 
rameter more appropriate. 

The extent to which a population is scattered 
on either side of its center is roughly indicated by 
measures of dispersion such as the standard devi- 
ation, the mean deviation, the interquartile range, 
the range, and sometimes others. The standard de- 
viation is the square root of the mean square of the 
deviations from the mean; it is usually denoted by 
the Greek letter o; o° is called the variance and is 
expressed by Eq. (3). The mean deviation, shown 
in Eq. (4), is the average over the population of the 
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deviations from the mean, all taken to be positive. 
The interquartile range (often denoted by Q) is the 
difference X75 — X25, where X75 is the value of the 
random variable such that one-quarter of the pop- 
ulation has values larger than X55, and X25 is the 
number such that one-quarter of the population has 


values smaller than X25. The three numbers, X 5, 
X50, X75, are called quartiles; these divide the pop- 
ulation into quarters. The range is the difference be- 
tween the largest and the smallest of the population 
elements. 

Samples. If one examines every element of a pop- 
ulation and records the value of the random variable 
for each, complete information is obtained about the 
distribution of the random variable in the population, 
and there is no statistical problem. It is usually im- 
possible or uneconomical to make a complete enu- 
meration (or census) of a population, and one must 
therefore be content to examine only a part or sam- 
ple of the population. On the basis of the sample, 
one draws conclusions about the entire population; 
the conclusions thus drawn are not certain in the 
sense that they would likely have been somewhat 
different if a different sample of the population had 
been examined. The problem of drawing valid con- 
clusions from samples and of specifying their range 
of uncertainty is known as the problem of statistical 
inference. 

Statisticians distinguish two kinds of samples. A 
survey sample is one chosen from a population, all 
the elements of which actually exist. An experiment 
is a sample chosen from a hypothetical population. 
Thus if one was interested in the total number of pigs 
being fattened in a given season by the farmers of the 
state of Iowa, the researcher would survey-sample 
the existing population of pigs in Iowa at that time. 
If the effect of a certain hormone on the growth 
rate of pigs was the subject, an experiment would 
be performed by giving a group of pigs the hormone 
for a period of time; using the sample of observations 
obtained from the experiment, the researcher would 
draw inferences about the hypothetical population 
of observations that would have resulted had all pigs 
been given the hormone. 

Random sampling. In planning a sample survey, the 
manner in which the data to be gathered will ful- 
fill the purpose should be clearly stated. The pop- 
ulation to be sampled must be explicitly defined. 
The method of sampling should be efficient and 
lead to a straightforward analysis. The question of 
what elements should be included in the popula- 
tion depends on the purpose of the survey. Thus 
the population of vineyards in France might include 
as a vineyard a dozen vines in the backyard of a per- 
son living in the heart of Paris if the purpose were 
to estimate total grape production of France; but 
the population might be defined to exclude such 
a small vineyard if the purpose were to estimate 
the size of the harvest to be available to commercial 
wineries. 

The type of sampling of interest in statistics is prob- 
ability sampling, because it eliminates subjective as- 
pects from the selection of the sample. In probabil- 
ity sampling, all possible distinct samples are known, 
the selection of the sample is done randomly accord- 
ing to a preassigned probability, and the method of 
analysis is predetermined and unambiguous. Only 
from such samples can inferences about populations 
be made with measurable precision. 
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Simple random sampling is a method of selecting 
a sample of n elements out of a population of N ele- 
ments so that all such samples have an equal proba- 
bility of being drawn. This may be done by selecting 
a first element at random from the population, then a 
second element at random from the remaining pop- 
ulation, and so on until the n elements are selected. 
Because an element cannot appear more than once 
in the sample, this is a form of sampling without re- 
placement. The sampling ratio or sampling fraction 
is n/N. 

Often the purpose of drawing a sample is to esti- 
mate the mean or average of a characteristic of the 
population. If y; is the value of the characteristic of 
the 7th unit, then the population mean is as shown in 
Eq. (5). The sample mean is the average of the n units 
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in the sample and is expressed in Eq. (6). Here the n 
xs are the n y’s selected from the population as the 
sample. The population total is Nu and is estimated 
by Nx. The population variance is defined as shown 
in Eq (7), and the sample variance is usually defined 
as in Eq. (8), although some authors use n instead 
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of n — 1. The quantities x and s?, the sample mean, 
and the sample variance are called sample statistics; 
they are the first and second sample moments and 
are also estimators of the corresponding population 
parameters, u and o°. 

The estimators are themselves random variables. If 
one repeatedly drew samples of size n from the pop- 
ulation and computed x from each sample, he would 
obtain a population of x’s with its own distribution 
which would differ from the distribution of x. It can 
be shown that the * population has exactly the same 
mean u as the x population. Further, the variance of 
the X population is as shown in notation (9), where 
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o? is the variance of the x population. The first frac- 
tion is ordinarily nearly unity so that the variance of 
X is approximately the fraction 1/n of the original 
population variance. As n, the sample size, becomes 
large, the X population becomes more concentrated 
about u and the reliability of a particular value of x 
as an estimate of the population mean increases. 
The observations of a sample, besides providing 
estimates of population parameters, can also be used 





Fig. 3. Histogram. 


to obtain an estimate of the population’s frequency 
function. This estimate is determined by dividing the 
range of the sample observations into several inter- 
vals of equal length Z and counting the number of 
observations occurring in each interval; these num- 
bers are then divided by nL to determine fractions 
giving the relative density of the sample occurring 
in each interval; then on a sheet of graph paper one 
lays out the intervals on a horizontal axis and plots 
horizontal lines above each interval at a height equal 
to the fraction corresponding to the interval; finally 
the successive plotted horizontal lines are connected 
by vertical lines to form a broken line curve known 
as a histogram (Fig. 3). The area under the curve is 
unity, and the area between any two points gives the 
fraction of the sample observations lying between 
those two points. If one takes larger and larger sam- 
ples, the chosen intervals can be made smaller and 
the broken line curve will come closer and closer 
to the underlying population frequency function. 
Often one does not trouble to divide the interval 
frequencies by nL to normalize the area of the his- 
togram, but merely plots the frequencies themselves; 
the resulting broken line curve is still referred to as a 
histogram. 

Sampling techniques. When a population can be re- 
garded as being made up of several nonoverlapping 
subpopulations, one may draw a sample from it by 
drawing a simple random sample from each subpop- 
ulation (or stratum); this procedure is called stratified 
random sampling. The method is employed when it 
is desired to have specific information about each 
stratum individually, when it is administratively con- 
venient to subdivide the population, when there are 
natural strata, or when a gain in precision would be 
realized because each stratum is more homogeneous 
than the whole population. 

Let n be the number of units sampled in the en- 
tire population of N units, and let n, be the number 
of units sampled in stratum þ containing N, units. 
Therefore, the sum of all the n, is n, and of all the N, 
is N. Let o, be the true standard deviation within stra- 
tum b. When n,/n = N;,/N,, the ny are said to be pro- 
portionately allocated. When 1,/n = N,o7,/UNpO p, 
the n, are said to be optimally allocated because 
the variance of the estimated mean is then small- 
est for fixed total size of sample. Ordinarily, if the op 
are well estimated, optimum allocation gives greater 
precision than proportional allocation, and propor- 
tional allocation gives greater precision than simple 
random sampling. 

Systematic sampling may be regarded as a sam- 
pling of units at regular intervals in the population, 


for example, every tenth unit. Usually this is done 
witha random start; that is, the first unit in the sample 
is selected at random from a small group at the begin- 
ning. A systematic sample is usually relatively easy 
and fast to execute, and if the analogy to stratified 
sampling is valid, a systematic sample will be more 
accurate. However, because of the essentially non- 
random nature of a systematic sample, it is difficult 
to estimate the sample variance unless certain as- 
sumptions are made about randomness in the order 
of the population. If the population has any peri- 
odicity, then the estimates made from a systematic 
sample can be quite poor. 

When the elements of a population are in mutu- 
ally exclusive groups called the primary units of the 
population, then a sample of these primary units 
might be made, and then, from those selected, a 
sample of the individual elements would be made. 
This procedure is called two-stage sampling or sub- 
sampling. Multistage sampling can involve more than 
two stages of sampling. Although more complicated 
and difficult to apply and to analyze, multistage 
sampling offers considerable flexibility for balanc- 
ing between statistical precision and the cost of 
sampling. 

When too little is known about a population to 
plan a sample, then two samples are made. The man- 
ner in which the second sample is taken is deter- 
mined from the results of the first sample. In Stein’s 
method of two-stage sampling, the size of the addi- 
tional sample is decided by using the estimate of the 
variance from the first sample. In double sampling or 
two-phase sampling, the first sample is used to gather 
information on a variate associated with the variate 
of interest. This information is used to establish strata 
for drawing a stratified random sample involving the 
variate of interest. 

Ratio and regression estimates are often used 
when observations are made on one or more vari- 
ates related to the variate of primary interest. 

Sequential sampling and sequential analysis refer 
to a method of sampling in which the size of the 
sample is not specified in advance. Observations are 
drawn one by one until a specified degree of con- 
fidence in the information to be obtained has been 
achieved. 

Sampling distributions. Many important sampling 
distributions are derived for random samples drawn 
from a normal or gaussian distribution, which is a 
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Fig. 4. Normal distribution. 


bell-shaped symmetrical distribution centered at its 
mean 4. The distribution is illustrated in Fig. 4 for 
three different values of o, the standard deviation. 
Equation (10) is the mathematical equation for these 
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curves. The area under the curves for the indicated 
limits is as follows: 


Area Limits 
0.50 u — 0.6750 to 4+ 0.6750 
0.683 u—o touto 
0.90 u — 1.6450 to u + 1.6450 
0.95 u — 1.9600 to u + 1.9600 
0.99 u — 2.3260 to u + 2.3260 


Referring to the above limits, 0.6750 is called the 
probable error inasmuch as the odds are even that a 
randomly drawn observation will lie within 0.6750 
of the mean. 

If samples of size n are drawn from a normal pop- 
ulation and the sample mean ¥ is computed for each, 
the x’s will have a normal distribution with the same 
mean and with variance o2/n. Thus, if samples of size 
4 were drawn from a population distributed by the 
curve marked o = 1 in Fig. 4, then the means of those 
samples would be distributed by the curve marked o 
= 0.5 (that is, 1/4). Further, the probability would 
be 0.95 that a particular x so drawn would lie with 
(1.960)(0.5) = 0.98 of the population mean p. 

The central limit theorem of the theory of prob- 
ability states that under very general conditions the 
sample mean X is approximately normally distributed 
whatever may be the distribution function for the un- 
derlying population. This powerful theorem enables 
one to make probability statements such as the one 
at the end of the preceding paragraph in ignorance 
of the actual population distribution. 

Besides the distribution of the mean, two other 
sampling distributions derived from the normal dis- 
tribution have wide application. One is the chi- 
square distribution, x7, which provides the distri- 
bution of the sample variance, as shown in Eq. (11). 
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Here x1, X2, ... , Xn are the observations of a random 
sample of size n drawn from a normal population. 

The quantity given in Eq. (12) has a distribution 

2 
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illustrated in Fig. 5 for three values of n. The math- 
ematical form for the distribution curve is shown in 
notation (13) and is often referred to as the chi-square 


@-1)/2 
ao) 
' a3) 


distribution with n — 1 degrees of freedom. 
The most useful random variable for interval 


Statistics 


387 


388 


Statistics 














Fig. 5. Chi-square distributions. 


estimation of a population mean is shown in Eq. (14), 
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which has a symmetrical distribution very similar in 
appearance to the curves plotted in Fig. 4. The math- 
ematical form for the distribution is notation (15). 
This is referred to as the ¢ distribution or the Student 
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distribution with n — 1 degrees of freedom. The fol- 


lowing gives limits which include 95% of the area 
under the curve for a few values of n: 





n Limits 
2 —12.71 to + 12.71 
3 —4.30 to + 4.30 
4 —3.18 to + 3.18 
10 —2.20 to + 2.20 
25 —2.06 to + 2.06 
120 —1.98 to + 1.98 


For very large sample sizes, the limits become —1.96 
to +1.96, as for the normal distribution. An illustra- 
tion of the use of these limits is given below. 

Estimation. In making an estimate of the value of a 
parameter of a population from a sample, a function 
(called the estimator) of the observations is used. 
For example, for estimating the mean of a normal 
population the mean of the sample observations is 
usually taken as the estimator. Another estimator is 
the average of the two most extreme observations. In 
fact, there is an infinitude of estimators. The problem 
of estimation is to find a “good” estimator. 

A good estimator may be regarded as one which re- 
sults in a distribution of estimates concentrated near 
the true value of the parameter and which can be 
applied without excessive effort. There is no single 
way of deciding how good an estimator is, but there 
are several criteria by which an estimator may be 
judged. 

An unbiased estimator is one which results in a 
distribution of estimates which has a mean exactly 
equal to the value of the parameter being estimated. 
Otherwise the estimator is called biased. The bias is 
the mean of the distribution of the estimator minus 
the value of the parameter it estimates. 


An estimator is said to be consistent if the prob- 
ability that an estimate will differ from the value of 
the parameter by more than any fixed amount can 
be made arbitrarily small by increasing the number 
of observations. 

The variance of an estimator is the mean squared 
deviation of the estimates from the value of the pa- 
rameter. The estimator with the smallest variance is 
called most efficient. The relative efficiency of two 
estimators is the ratio of the variances. When the nu- 
merator of this ratio is the variance of a most efficient 
estimator, this ratio is simply called the efficiency of 
the other estimator. 

An estimator is said to be sufficient if it contains 
all the information in the sample regarding the pa- 
rameter. This is so when the conditional distribution 
of the sample for a given estimate is independent of 
the parameter. 

There are several methods of constructing estima- 
tors of parameters. The method of moments is ap- 
plied by assuming that the first few sample moments 
are equivalent to the moments of some distribution, 
and then solving for the parameters of that distribu- 
tion. The methods known as least squares, minimum 
variance, and minimum chi-square all have as their 
basis the estimation of the values of the parameters 
which minimize some linear function of the squares 
of deviations of the observations from the values of 
the parameters. In applying Bayes’ method, the dis- 
tribution of possible values of the parameter before 
the sample is taken, called the a priori distribution, 
is used in conjunction with the observations in the 
sample to yield an estimator. The method of max- 
imum likelihood uses as the estimate that value of 
the parameter for which the probability of the sam- 
ple is highest. 

A confidence interval is an interval constructed in 
such a way that the true parameter value is within 
this interval with a predetermined probability in re- 
peated sampling; this probability is called the con- 
fidence level of the interval. For example, a sample 
mean u known to be normally distributed with vari- 
ance o? will differ less than 1.960 from the true but 
unknown mean u with probability 0.95. Thus ex- 
pression (16) can be formulated. On solving the two 
inequalities for 4, expression (16) may be written as 
expression (17). 


Probability that 
(-1.90 <xX—p < 1.960) = 0.95 (16) 


Probability that 
(* — 1.960 < w <*+ 1.960) = 0.95 a7) 


When a particular value of x” is computed from the 
observations, then x” + 1.960 is a pair of numbers 
which define a particular interval called the confi- 
dence interval. Because jz is a fixed number, it is 
within this particular interval with probability zero 
or one. However, u is unknown, so that the confi- 
dence to be associated with the interval is stated in 
terms of the proportion of all intervals constructed 
in this manner which would include n, rather than 


this particular interval. A probability of this sort, 
valid for a population of outcomes, when used in 
reference to a particular outcome is called a fiducial 
probability. 

Ordinarily o is unknown and the estimate s derived 
from the sample must be used to form a confidence 
interval; in this case, the ¢ distribution rather than the 
normal distribution must be used. As an example, 
suppose a chemist has made four determinations of 
the atomic weight yz of hydrogen as follows: 1.0066, 
1.0090, 1.0084, and 1.0086. The average of these val- 
ues is given in Eq. (18), and the sample estimate of 


= 1/4(1.0066 + 1.0090 + 1.0084 + 1.0086) 
= 1.0082 a8) 


the variance of his technique is as shown in Eq. (19), 
the estimate of o being therefore Eq. (20). It follows 


s? = "3 [1.0066 — 1.0082)? + (1.0090 — 1.0082)” 
+(1.0084 — 1.0082)? + (1.0086 — 1.00827] 
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from the definition of ¢ in the preceding section that 

in this case Eq. (21) holds. Using the short ¢ table 
_ 21.0082 — w) 
~ 0.0011 


CD 


given in the preceding section, one finds that expres- 
sion (22) can be formulated. Solving these inequali- 
ties for u gives expression (23), and the chemist can 


Probability that 
3.18 eed 3.18) =0.95 (22) 
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Probability that 


(1.00645 < pu < 1.00995) = 0.95 (23) 


assert with 95% confidence that the atomic weight 
of hydrogen lies between 1.00645 and 1.00995. For 
greater precision, more observations are required. 

In a similar fashion one may obtain a confidence 
interval for o by using s? in connection with the chi- 
square distribution. 

A confidence region is the generalization of a con- 
fidence interval and refers to the simultaneous esti- 
mation of several population parameters. The con- 
fidence level is the proportion of the time that the 
region actually includes the true values of the param- 
eters. 

In general, the most desirable confidence inter- 
val or region is the smallest one which can be con- 
structed for the selected confidence level. 

Tests of hypotheses. Besides estimation of param- 
eters, another major area of statistical inference is 
the testing of hypotheses. A hypothesis is merely an 
assertion that a population has a specific property. 
The test consists of drawing a sample from the pop- 
ulation and determining whether or not it is con- 
sistent with the assertion. Very often the hypothe- 
sis is a statement about the mean of a population; 


f(x) g(x) 





acceptance region b critical region X axis 


Fig. 6. Simple test of a hypothesis. 


that it has a given value, that it is the same as that 
of another population, that it exceeds that of an- 
other population by at least 10 units, and the like. 
Thus, one may be comparing a new blend of gaso- 
line with a current blend, a new drug with a standard 
one, a new manufacturing process with an existing 
one. 

For a very simple illustration of the basic ideas 
involved, suppose a population is known to have 
either of two distributions which differ mainly in lo- 
cation. Let the random variable be x and let the two 
functions of x which determine the two distributions 
be represented by the symbols f(x) and g(x) as indi- 
cated in Fig. 6; that is, fox) represents the height of 
the left curve at x, and g(x) represents the height 
of the right curve at x. Assume that the population 
has the distribution f(x); there is in this case only a 
single alternative, g(x), so that rejection of the hy- 
pothesis implies acceptance of the alternative. Sup- 
pose that for testing the hypothesis one can afford 
only a single observation, that is, a sample of size 
one. 

It is obvious in this instance that one should 
choose some number b in advance, then accept the 
hypothesis if the observation falls to the left of b and 
reject it if the observation falls to the right. Values of 
x to the left of b are said to constitute the acceptance 
region of the test; values to the right of b constitute 
the critical region. The area under f(x) to the left of b, 
which is denoted mathematically by notation (24), is 


b 
f f) dx (24) 


the probability that the hypothesis will be accepted 
when it is true. The area to the right of b under fœ) 
is the probability that the hypothesis will be rejected 
when it is true; this probability is called the type I 
error of the test. The area to the left of b under g(x) is 
the probability that the hypothesis will be accepted 
when it is false and is called the type II error of the 
test. The area to the right of b under g(x) is called 
the power of test and is, of course, one minus the 
type II error. 

By moving the point b to the right one can make 
the type I error as small as one likes, but in doing so 
the type II error is increased. This dilemma is char- 
acteristic of the construction of tests of hypotheses. 
Ordinarily one arbitrarily chooses the type I error to 
be some small number such as 0.05 or 0.01 and then 
chooses the critical region so as to minimize the type 
II error. 

The fraction f/()/g(x) is called the likelihood ratio 
or simply the likelihood function of x. From Fig. 6 it 
is evident that the likelihood is relatively large when 
the hypothesis is true and small when it is false. For 
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a sample of size n with observations x), x2, ... , 
Xn the sample likelihood is defined to be the prod- 
uct of the individual likelihoods given in expression 
(25). Let all possible samples of size n be divided 


OSG) fO os 

8X1) BOX2) Xn) 
into two sets, with one set (the acceptance region) 
containing those for which the likelihood is larger 
than some number b and with the other set (the crit- 
ical region) containing those samples for which the 
likelihood is less than b. It can be proved mathemat- 
ically that this is the best critical region for testing 
the hypothesis in the sense that it minimizes the type 
II error for given type I error. The likelihood crite- 
rion therefore furnishes a procedure for construct- 
ing specific tests of hypotheses. Other approaches 
to the testing problem, such as Bayes’ method or the 
minimax principle of game theory, lead to the same 
criterion. 

If, for example, one applies the likelihood crite- 
rion to the problem of testing whether a normal 
population has a given mean, perhaps 10, and spec- 
ifies that the type I error shall be 0.01, then the fol- 
lowing procedure results: Draw a random sample of 
size n, construct an 0.99 confidence interval for the 
population mean py; if the number 10 lies within 
the interval, accept the hypothesis; otherwise, 
reject it. 

An important class of hypotheses has to do with 
tests of independence in multivariate populations. As 
an example, one may consider the population of reg- 
istered voters in the United States as a bivariate pop- 
ulation with one variable being their opinions (yes, 
no, undecided) on some political proposition of the 
moment, and the other variable being geographical 
location. Elements of the population may be classi- 
fied into a so-called contingency table as shown in 
Fig. 7. The hypothesis of independence asserts that 
the division of political opinion is unaffected by loca- 
tion. To test that hypothesis, a random sample of the 
population may be interviewed and classified into 
the 12 categories or cells of the contingency table. 
Let nj be the number of individuals falling in the cell 
in the 7th row and the jth column of the table. The 
sum of all the ny is, of course, n, the sample size. 
The row sums may be denoted by r1, r2, r3, r4, and 
the column sums by cı, C2, c3; the sum of each of 
the sets is n. On applying the likelihood criterion to 
the test of the hypothesis, one finds that it rests on 

















location yes no undecided 
east ni1 n12 ni3 
south n21 n22 n23 

midwest n31 n32 n33 
west N41 Nga n43 




















Fig. 7. Contingency table. 


expression (26), which is approximately distributed 


2 (» logn + 5 nij log nij 


ij 
— y r;logr; — > cjloge) (26) 
i 7 


according to the chi-square distribution with six de- 
grees of freedom. To test at the 0.05 level for the 
type I error, one would compute the above expres- 
sion and compare it with 12.6, which is the value 
that marks 95% of the area under the chi-square dis- 
tribution curve. If the expression turned out to be 
less than 12.6, one would accept the hypothesis of 
independence; otherwise, one would reject it. 

Had there been R rows and C columns instead of 
four and three, one would have used the chi-square 
distribution with (R — 1) (C — 1) degrees of freedom. 
For a trivariate population there would have been a 
three-way contingency table and four hypotheses of 
independence that could have been tested; three of 
them assert that a given criterion of classification is 
independent of the other two, and the fourth asserts 
that all three are mutually independent. 

Design of experiments. An experiment is per- 
formed to obtain information about the relations be- 
tween several variables. For example, one may study 
the effect of storage temperature and duration of stor- 
age on the flavor of a frozen food. Three variables 
(flavor, temperature, and duration) are involved; 
one (flavor) is called the subject of the experiment; 
the other two are called factors which influence 
the subject. Sometimes the factors have intrinsic 
value in themselves; sometimes they are merely nui- 
sance variables which must be taken into account 
because it is impossible to perform the experiment 
without them. 

There exist in the statistical literature great num- 
bers of specific experimental designs. These are pat- 
terns for making experimental observations; the ac- 
tual construction of the designs requires quite ad- 
vanced mathematics based on group theory, finite 
geometries, and combinatorial analysis. The mathe- 
matical problem is to find a pattern from which it is 
possible to extract the desired information and yet 
minimize the number of observations. 

Experimental designs are most important to the 
experimenter when observations are expensive to 
make and when more than one factor is involved 
in the experiment. In the past it was believed that 
the best experimental procedure was to vary fac- 
tors one at a time. Thus, in the frozen food exper- 
iment one might have held storage temperature con- 
stant and studied the effect of duration of storage 
only on flavor. Having determined that relationship, 
one would then hold duration constant and study 
the temperature effect. This procedure is not only 
wasteful of time and resources but may very well 
lead to erroneous conclusions because in all likeli- 
hood there is interaction between the two factors; 
that is, duration effect probably changes in a not 
obvious way when temperature is changed. Even 
if one believes that one can extrapolate accurately 


the duration effect to other temperatures, there is 
much to be said for checking the extrapolation pro- 
cedure in the experiment because it can probably be 
done at no additional cost if the experiment is well 
designed. 

As an illustration of the use of a design, consider 
a very simple and useful one called the randomized 
block design. Suppose a manufacturer contemplates 
purchasing a machine which can be obtained from 
one of three sources, X, Y, or Z. He obtains one of 
each on trial in order to compare their performance 
using several operators from his own plant. Perhaps 
five operators, A, B, C, D, and E, are to be used in the 
proposed experiment, which is a two-factor exper- 
iment with performance being the subject and the 
two factors being humans and machines. The oper- 
ators are not really a factor of interest in evaluating 
the machines but are, of course, necessary to the 
experiment. 

It would be a mistake to use perhaps 15 operators, 
putting 5 on each machine, because operators differ 
in ability and a machine that turned out well might 
have been fortunate in having a good set of opera- 
tors assigned to it. It is necessary that the machines 
be compared in as nearly equivalent circumstances 
as possible; in this case, that is done by having each 
operator operate every machine. The machines may 
then be compared in blocks (operators) which are in- 
dividually homogeneous although they may be quite 
different among themselves. 

The data are obtained by measuring production of 
each operator on each machine for a specified pe- 
riod of time and filling in the table of Fig. 8. It is 
essential that the order in which an operator works 
on the three machines be randomized (by tossing 
dice, for example) so that minor factors not taken 
into account in the design will not bias the results. 
If, for example, learning is an important factor in op- 
erating the machines, then the experiment must be 
three-factor and a more elaborate design is required. 

In the resulting experimental observations the ef- 
fects of humans and of machines are entangled, 
but there exists a computational technique for well- 
designed experiments known as the analysis of vari- 
ance which will disentangle them. This enables the 
total variation to be broken down into parts, a vari- 
ance attributable to machines, one to humans, and 
one to interactions; if the experiment were dupli- 
cated, one could also break out a variance corre- 
sponding to experimental error. One can estimate 
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Fig. 8. Data form for randomized block experiment. 


the individual main effects of machines, the main ef- 
fects of humans, and individual interaction effects. In 
experiments involving more factors, one may be able 
(depending on the experimental design selected) 
to estimate higher-order interactions such as the 
second-order interactions between the main effects 
of one factor and the first-order interactions of two 
other factors. 

A common hypothesis in an analysis of variance 
is the null hypothesis that the variance associated 
with some factor or interaction is not larger than 
the experimental error variance; the test criterion is 
essentially the ratio of the two variances and has the 
so-called F distribution when the null hypothesis is 
true. Rejection of the null hypothesis implies that 
the factor or interaction in question had a significant 
effect on the subject of the experiment. 

Regression and correlation. The regression prob- 
lem is that of estimating certain unknown constants 
or parameters occurring in a function which relates 
several variables; the variables may be random or 
not. By far the most easily handled cases are those in 
which the function is linear in the unknown parame- 
ters, and it is worth considerable effort to transform 
the function to that form if at all possible. 

The eventual adult height H of a 5-year-old boy 
may be quite well predicted by a linear function of 
three variables: his own present height B, his father’s 
height F, and his mother’s height M. The linear func- 
tion is given in Eq. (27), where a, b, c, and d are 


H=a+bB+cF+dM (27) 


the unknown parameters. The three variables B, F, 
and M are called independent variables, the variable 
H is called the dependent variable, and the param- 
eters b, c, and d are often referred to as regression 
coefficients. 

To estimate the parameters, one might draw a ran- 
dom sample of 5-year-old boys, measure their heights 
and those of their parents, then some years later 
measure their adult heights. One would then have 
sufficient data from which the parameters could be 
estimated by procedures entirely analogous to the 
methods for estimating location and scale parame- 
ters described above. In practice, one would not take 
so long to get the data but would use men whose 
childhood records were available. This method in- 
volves subtle sampling problems, however; for exam- 
ple, persons whose records are available may have 
been better cared for on the average and hence taller 
on the average. 

The data which supply the estimates of regression 
coefficients can also be used to estimate the stan- 
dard deviation, or standard error of regression, o. 
Using that estimate and a table of the ¢ distribution, 
one can compute a prediction interval (analogous to 
a confidence interval) which will have the desired 
probability of including the correct adult height of a 
given boy. 

Tall fathers sometimes have short sons and short 
fathers sometimes have tall sons, but generally a fa- 
ther’s height is a good indicator of his son’s height; 
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that is, fathers’ heights and their sons’ heights are 
positively correlated. For a different example: The 
price of a commodity in a free economy is negatively 
correlated with the supply of that commodity; there 
is not a fixed relation between price and supply, but 
as a general proposition one goes up when the other 
goes down. 

Statisticians have developed measures of the de- 
gree of such imprecise relationships called coeffi- 
cients of correlation. The most widely used one is the 
Pearson, or product moment, correlation, which is 
generally denoted by p. It measures the degree of lin- 
ear association or correlation between two random 
variables of a bivariate or multivariate population and 
is defined as the mean over all the population of the 
product of the deviations of the two variables from 
their means divided by the product of their standard 
deviations. Mathematically, it is as shown in Eq. (28), 


dy 


OxOy 





p= / (x — LY — vf, y) dx (28) 


where x and y are the random variables, u and v 
their means, o, and o, their standard deviations, and 
S(@y) their frequency distribution. 

If the two variables are completely unrelated, then 
p = 0; if they have a fixed linear relation so that one 
can be calculated directly from the other, then p = 
+ 1 or p = —1 depending upon whether their rela- 
tion is direct or reverse. Otherwise p will be some 
fraction between —1 and +1 with the fraction being 
near zero if there is poor correlation between them. 

When the independent variables of a regression 
function are random variables, there is an equiva- 
lence between correlation coefficients and regres- 
sion coefficients; after one set has been defined, the 
definition of the other follows automatically; mathe- 
matical formulas connecting them may be found in 
any statistics textbook. 

Nonparametric inference. Most techniques of statis- 
tical inference rely on the central limit theorem or on 
sampling distributions derived from normal popula- 
tions. They are practically always valid for large sam- 
ples and are often valid for samples of intermediate 
size. However, there are occasions when one cannot 
rely on these techniques, particularly when samples 
are small or when some evident peculiarity of the 
population (such as marked asymmetry) makes ordi- 
narily used sampling distributions suspect. In such 
instances one uses nonparametric methods, which 
are valid whatever form the population distribution 
might take. 

Nonparametric techniques use the so-called order 
Statistics, which are merely the sample observations 
arranged in ascending order of magnitude. One may 
let x, be the smallest sample observation, x2 the 
next smallest, and so on with x, being the largest. 
A basic theorem states that on the average the sam- 
ple divides the population into 7 + 1 equal parts, 
that is, that 1/@ + 1) of the population lies be- 
tween any two successive order statistics. Many 
nonparametric methods rest on this fact and its 
consequences. 


As a simple illustration of a nonparametric esti- 
mate and confidence interval, one may consider the 
estimation of the population median, M, given an or- 
dered sample of size 5 consisting of x1, X2, X3, X4, and 
xs. The estimate of M among the observations is sim- 
ply the central sample observation x;. The extreme 
observations x and xs provide limits for a confidence 
interval for M; the probability level for the interval is 
calculated as follows. One-half of the population lies 
to the right of M, hence the probability is 1/> that a 
randomly drawn observation will lie to its right. The 
probability is ('/.)° that all five observations will lie 
to its right. Similarly, the probability is ('/.)° that all 
five will lie to its left. In all other cases the sample 
will have at least one observation on each side of M; 
therefore, expression (29) can be formulated. 


Probability that 
@ < M < x5) = 1 — 1/2% — 1/25 


= 30/32 = 0.94 (29) 


See ANALYSIS OF VARIANCE; BIOMETRICS; DISTRIBU- 
TION (PROBABILITY); EXPERIMENT; QUALITY CON- 
TROL. Alexander M. Mood 
Bibliography. K. A. Brownlee, Statistical Theory 
and Methodology in Science and Engineering, 2d 
ed., 1965, reprint 1984; W. J. Dixon and FE J. Massey, 
Introduction to Statistical Analysis, 4th ed., 1983; 
M. G. Kendall and A. Stuart, The Advanced Theory of 
Statistics, vol. 2, 5th ed., 1991, vol. 3, 4th ed., 1983, 
reprint 1987, vol. 4, 1983; A. M. Mood, Introduc- 
tion to the Theory of Statistics, 3d ed., 1974; D. H. 
Sanders, Statistics: A First Course, 5th ed., 1995; 
M. E Triola, Elementary Statistics, 8th ed., 2000. 


a] 
Statoblasts 


Chitin-encapsulated bodies, resistant to freezing and 
a limited amount of desiccation. They serve as a spe- 
cial means of asexual reproduction in the Phylac- 
tolaemata, a class of fresh-water Bryozoa. They are 
0.01-0.06 in. (0.26-1.5 mm) long, and their shape 
and structure are important to the taxonomy of the 
group. 

Statoblasts are classified as follows: sessoblasts, 
those which attach to zooecial tubes (structures of 
the outer layer or zooecial of an individual in the 
colony) or the substratum; floatoblasts and spino- 
blasts, both of which have a float of “air” cells and 
therefore are free-floating; and piptoblasts, which are 
free but have no float (see illus.). 

These bodies are produced in enormous quanti- 
ties from spring to autumn. They develop by orga- 
nization of masses of peritoneal cells and epidermal 
cells that bulge into the coelom. Each mass then se- 
cretes protective upper and lower chitinous valves, 
the rims of which often project peripherally. Stato- 
blasts are therefore somewhat disk-shaped. 

These bodies remain dormant for variable periods 
of time and serve to tide species over adverse eco- 
logical conditions, such as freezing or drying, which 





Some types of statoblasts. (a) Spinoblast of Pectinatella 
magnifica. (b) Floatoblast of Plumatella repens. 

(c) Piptoblast or sessoblast of Fredericella sultana. 

(d) Sessoblast of Stolella indica. 


kill the colony. During this time they may be dis- 
persed over considerable distances, being carried by 
animals, floating vegetation, or the action of water 
currents. When environmental conditions become 
favorable, which is usually in the spring, statoblasts 
germinate and a zooid develops from the mass of 
cells lying between the two valves. Statoblasts of 
Lopbhopodella have germinated after 50 months of 
drying. See BRYOZOA. Sybil P. Parker 


E) 
Staurolite 


A nesosilicate mineral occurring in metamorphic 
rocks. The chemical formula of staurolite may be 
written as A4B4Cı8D4 T8040Xgs, Where A = Fe**, Mg, 
= vacancy, > 2); B = Fe**, Zn, Co, Mg, 
Li, Al, Fe*+, Mn?+, (1; C = Al, Fe**+, Cr, V, Ti; D = 
Al, Mg, > 2); T = Si, Al; X = OH, E O. 
Staurolite is an order-disorder series between an or- 
dered monoclinic structure (with B = 90.64°) anda 
disordered orthorhombic structure (with B = 90°). 
The structure of staurolite consists of slabs of the 
kyanite structure [AlSiO;] and layers of Fe-Al oxy- 
hydroxide [~Alp ¢, !Fe, (OH),] alternating along 
[010], the monoclinic symmetry axis. The kyanite 
slab consists of edge-sharing chains of AlOg octahe- 
dra (Fig. 14, regular dots) extending along the c di- 
rection, flanked by AlOg octahedra (random dots) 
and cross-linked by SiO, tetrahedra (crosses); only 
small amounts of Mg and Fe substitute for Al in 
this slab. The oxide-hydroxide layer consists of edge- 
sharing chains of (Al,_lNO¢ octahedra (Fig. 1b, reg- 
ular dots) extending along the c axis, alternating 
with chains of (Al,_))O¢ octahedra (random dots) and 
(FeO, tetrahedra (crosses). In the latter chains, the 
(AL|_)O¢ octahedra share edges; the (FeO,) tetrahe- 
dra share faces with adjacent (Al|_)O¢ octahedra, 
and polyhedra adjacent across a shared face can- 
not both be locally occupied. The extensive cou- 














































































































pling of substitutions involving the cations Al, Fe?*, 
H and| | (vacancy) gives rise to extensive short-range 
order, and local bond-valence arrangements are opti- 
mal for the formula [Fe?+4Al;s- Sig044(0H)4]?+. How- 
ever, at the long-range level, a crystal cannot have 
a net charge, and hence the principal heterovalent 
substitutions in staurolite [Al < Si, Mg < Al, U < 
H, Li = Fe?+, LF] © Fe?*] all act such as to reduce 
this charge to zero. Thus the chemical complexity of 
staurolite results from the interaction of long-range 
and short-range charge-balance requirements. 

Staurolite occurs as well-formed, often-twinned, 
prismatic crystals (Fig. 2). It is brown-black, reddish 
brown, or light brown in color and has a vitreous 
to dull luster. Light color and dull luster can result 
from abundant quartz inclusions. There is no cleav- 
age, specific gravity is 3.65-3.75, and hardness is 7- 
7.5 (Mohs scale). 

Staurolite occurs in pelitic (former shale) schists 
of medium grade (degree) of metamorphism. In typi- 
cal rocks, it forms as temperature increases from the 
reaction of muscovite, garnet, and chlorite to form 
staurolite, biotite, and water, or from the breakdown 
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Fig. 1. Staurolite crystal structure: (a) kyanite slab and 
(b) oxide-hydroxide layer, where a is the shortest axis and c 
is the longest axis. 
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Fig. 2. Staurolite crystal habits. (After C. Klein and C. S. 
Hurlbut, Jr., Manual of Mineralogy, 21st ed., John Wiley, 
1993) 


of chloritoid to form staurolite, garnet, and water 
at temperatures of about 930°F (500°C) as shown 
by laboratory experiments. With increasing temper- 
ature in such rocks, staurolite reacts with muscovite 
to form garnet, biotite, and aluminosilicate miner- 
als (kyanite, sillimanite, or andalusite, depending 
on pressure) at temperatures near 1110°F (600°C). 
Thus, most staurolite forms between 930 and 1110°F 
(500 and 600°C). Staurolite may form at pressure 
as low as 2.5 kilobars (250 megapascals), corre- 
sponding to depths in the Earth’s crust of 5.5. mi 
( km); at lower pressure, cordierite and andalusite 
take its place in pelitic schists or contact metamor- 
phic rocks. Staurolite may occur at pressures corre- 
sponding to the deepest levels in the Earth’s crust 
(>10 kbar). However, it is less common and tends 
to be Mg-rich in deep crustal rocks. Typical miner- 
als occurring with staurolite are quartz, micas (mus- 
covite and biotite), garnet (almandine), tourmaline, 
and kyanite, sillimanite, or andalusite. Staurolite is 
common where pelitic schists reach medium-grade 
metamorphism. Examples are the Swiss and Italian 
Alps (notable at Saint Gotthard, Switzerland), and 
all the New England states, Virginia, the Carolinas, 
Georgia, New Mexico, Nevada, and Idaho. See META- 
MORPHISM; SILICATE MINERALS. Frank C. Hawthorne 

Bibliography. W. A. Deer, R. A. Howie, and J. 
Zussman, Rock Forming Minerals: Orthosilicates, 
vol. 1A, 1997; E C. Hawthorne et al., The crystal 
chemistry of staurolite, Can. Mineral., 31:551-616, 
1993; M. J. Holdaway, B. L. Dutrow, and P. Shore, A 
model for the crystal chemistry of staurolite, Amer 
Mineral., 71:1142-1159, 1986; C. Klein and C. S. 
Hurlbut, Jr., Manual of Mineralogy, 21st ed., 1993. 


cE 
Stauromedusae 


An order of the cnidarian class Scyphozoa, usually 
found in circumpolar regions. Haliclystus auricula 
is typical. The egg develops into a planula which can 
only creep since it lacks cilia. The planula changes 
into a polyp that metamorphoses directly into a com- 
bined polyp and medusa form. The medusa is com- 
posed of a cuplike bell called a calyx (medusan part) 
and a stem (polyp part) which terminates in a pedal 
disk (see illus.). The calyx is eight-sided and has eight 
groups of short, capped tentacles and eight sensory 
bodies, called anchors (rhopalioids), on its margin. 
The mouth, situated at the center of the calyx, has 
four thin lips and leads to the stomach in which gas- 
tral filaments are arranged in a row on either side 
of each interradius. Though sessile, the medusa can 





Stauromedusae. (a) Lucernaria. (b) Haliclystus. (After L. H. 
Hyman, The Invertebrates, vol. 1, McGraw-Hill, 1940) 


move in a leechlike fashion by alternate attachment 
and release of the pedal disk, using the substratum 
as an anchor. See SCYPHOZOA. Tohru Uchida 


ES 
Steam 


Water vapor, or water in its gaseous state. Steam is 
the most widely used working fluid in external com- 
bustion engine cycles, where it will utilize practi- 
cally any source of heat, that is, coal, oil, gas, nuclear 
fuel (uranium and thorium), waste fuel, and waste 
heat. It is also extensively used as a thermal trans- 
port fluid in the process industries and in the comfort 
heating and cooling of space. The universality of its 
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Fig. 1. Vapor-tension curve for water. 1 Ib/in.? abs. = 
6.895 kPa, absolute pressure. °C = (°F — 32)/1.8. 






















































































Steam 395 
Thermal properties of steam (approximate)* 
Enthalpy, Btu/Ib* 
Pressure, Ib/in.? Temperature, Saturated Latent Saturated 
abs. (kPa) °F (°C) liquidt heat vaport 
1 (6.895) 102 (39) 69.7 1036 1106 
10 (68.95) 193 (89) 161 982 1143 
14.7 (101.3) 212 (100) 180 970 1150 
100 (689.5) 328 (164) 298 889 1187 
1000 (6895) 545 (285) 542 649 1192 
3208 (22,120) 705 (374) 906 0 906 
*For more exact values see J. H. Keenan and F. G. Keyes, Thermodynamic Properties of Steam, Wiley, 1936, or American Society of Mechanical 
Engineers, Thermodynamic and Transport Properties of Steam (tables and charts), 1967. 
tEnthalpy is measured from saturated liquid at 32°F (0°C). 
#1 Btu/lb = 2326 J/kg. 
availability and its highly acceptable, well-defined in the wet steam decreases. Dry steam has a quality 
physical and chemical properties also contribute to of 100% (see the table and Fig. 2). See ENTHALPY. 
the usefulness of steam. The engineering profession Properties as a gas. Superheated steam at temper- 
has developed so much evaporative and condens- atures well above the boiling temperature for the 
ing equipment for changing the state of water that existing steam pressure follows closely the laws of a 
its properties can be utilized from pressures ranging perfect gas. Thus, pv © 85.87 and k © 1.3, where p 
from 0.18 in. Hg or 0.61 kilopascal (ice point) to su- is pressure in Ib/ft?, v is volume in ft*/Ilb, T is abso- 
percritical (>3208.2 lb/in.? absolute or 22.120 mega- lute temperature in °R, and k is the ratio of specific 
pascals absolute pressure and 705.47°F or 374.15°C). heat at constant pressure to specific heat at constant 
These temperature ranges are readily adaptable for volume. (In SI units, p’v’ © 461.5 T; where p’ is pres- 
use with ferrous and nonferrous metal structures at sure in Pa, v’ is volume in m?/kg, and T is absolute 
costs which dominate the design, selection, and use temperature in K.) However, the behavior of dry, sat- 
of many applications. urated steam departs from that of a perfect gas, and 
The temperature at which steam forms depends wet steam is a mixture. Therefore, the properties of 
on the pressure in the boiler. The steam formed in steam near its vaporization temperature are deter- 
the boiler (and conversely steam condensed in a mined experimentally. 
condenser) is in temperature equilibrium with the 
water. The relation is rigorously specified by the 
vapor tension curve (Fig. 1). (The table gives a 800 critical point 7 
few selected values in the first two columns.) Under 3208 Ib/in.2, 705°F, 906 Btu/lb = 
these conditions, with steam and water in contact (22.12 MPa, 374.15°C, 2.107 MJ/kg) z 
and at the same temperature, the steam is termed 700 | | S z 
saturated. Steam can be entirely vapor when it is S/ |R 
: ; : compressed fy ice 
100% dry, or it can carry entrained moisture and be liquid phase —/ fs 
wet. It may also be contaminated, as in all practi- 600 000 Ib/in.2 abs. (6895 kPa S 
cal and engineering applications, with other gases, constant pressure E 
especially air, in which case it is a mixture and Dal- S 
ton’s law prevails. After the steam is removed from 500 
contact with the liquid phase, the steam can be fur- s- D wet-vapor phase 
ther heated without changing its pressure. If initially £ O 
wet, the additional heat will first dry it and then t 400 F 
raise it above its saturation temperature. This is a 2 &/ |100 Ib/in.? abs. (689.5 kPa) ea 
sensible heat addition, and the steam is said to be § F constant pressure ae Pees 
superheated. 300 
The heat energy in steam can be divided into three latent heat 
parts: (1) enthalpy of the liquid required to raise the of evaporation 5 
water from its initial temperature (usually considered 200 S 
to be 32°F or 0°C) to the boiling temperature, (2) en- 3 
thalpy of vaporization required to convert the water s 
to steam at the boiling temperature, and (3) enthalpy 100 = 
of superheat that raises the steam to its final tem- 
perature. As the steam performs its thermodynamic 
function of giving up its heat energy, it loses its su- 0 
0 200 400 600 800 1000 1200 1400 1600 


perheat, becomes wet, and finally condenses to hot 
water. While wet, the steam has a quality that de- 
creases as the percent of dry, saturated steam present 


enthalpy, Btu/Ib 


Fig. 2. Temperature-enthalpy chart for steam. 1 Btu/Ib = 2326 J/kg. °C = (°F — 32)/1.8. 
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Fig. 3. Temperature-entropy chart for steam. °C = (°F — 32)/1.8; 1 Btu/Ib °F = 4186.8 J/kg 
°C; 1 Btu/Ib = 2326 J/kg; 1 ft?/Ib = 62.4 x 10~° m°/kg; 1 Ib/in.? = 6.895 kPa. (After J. H. 
Keenan and F. G. Keyes, Thermodynamic Properties of Steam, John Wiley and Sons, 1936) 


Data can be presented in tabular form, giving pres- 
sure, volume, entropy, enthalpy, and temperature for 
saturated liquid (water at the boiling point), satu- 
rated steam, and steam vapor at various temperatures 
of superheat. Alternatively, thermodynamic proper- 
ties of steam can be presented diagrammatically. For 
analysis of thermodynamic cycles, the temperature- 
entropy chart is widely used (Fig. 3). Area on this 
chart is proportional to heat energy. At the critical 
point, steam condenses directly into water without 
releasing energy. In the uncharted area the pressure 
is higher than is usually encountered in commercial 
practice. 

For engineering application the Mollier diagram 
presents steam data in a convenient form (Fig. 4). In 
the wet region, lines of constant temperature and of 
constant pressure are straight and coincide with each 
other. Total enthalpies above 32°F (0°C) are plotted 
as ordinates and total entropies as abscissas. 

The specific volume of steam varies widely with 
temperature and pressure. At 1 Ib/in.? absolute 
(6.895 kPa absolute pressure) the specific volume 
of saturated steam is 334 ft? (20.85 m3/kg), at atmo- 
spheric pressure it is 26.8 (1.29), at 1000 Ib/in.? abso- 
lute (6.895 MPa) it is 0.45 (28 x 1074), at 2000 Ib/in.” 


absolute (13.79 MPa) it is 0.19 (12 x 1075), and at 
the critical pressure (3208 Ib/in.? absolute or 22.12 
MPa) it is 0.05 (@ x 1079). Superheating substantially 
increases values of specific volume of a steam at sat- 
uration conditions. 

Viscosity of steam increases with temperature and 
pressure. Saturated steam at atmospheric pressure 
has a viscosity of about 2.6 x 107 db)(s)/ft? (1.2 x 
10° Pa-s). At 1000°F (538°C) and 2000 lb/in.? 
(13.79 MPa) pressure it is 13.4 x 107 b)(s)/ft? (6.4 x 
10° Pa-s). Similarly, the heat conductivity of steam in- 
creases with temperature and pressure. The thermal 
conductivities for the conditions for which viscosi- 
ties are given are about 14, 37, and 109 Btu/()@0CF) 
or 24, 64, and 189 W/Gm)C-C). 

Application. Chiefly because of its availability, but 
also because of its nontoxicity, steam is widely 
used as the working medium in thermodynamic pro- 
cesses. It has a uniquely high latent heat of vapor- 
ization: 1049 Btu/lb (2.440 MJ/kg) at 1 in. Hg abs. 
(3.39 kPa) and 79°F (26°C), 970 (2.256 MJ/kg) at 
14.7 Ib/in.? abs. (101 kPa) and 212°F (100°C), 889 
(2.068 MJ/kg) at 1000 lb/in.? abs. (6.895 MPa) and 
328°F (164°C), 649 (1.510 MJ/kg) at 1000 Ib/in.2 
abs. (6.895 MPa) and 545°F (285°C), and 463 (1.077 
MJ/kg) at 2000 Ib/in.? abs. (13.79 MPa) and 636°F 
(336°C). Steam has a specific heat in the vicinity of 
half that of water. For comparison, its specific heat 
is about twice that of air and comparable to that of 
ammonia. Except for a few gases such as hydrogen, 
with a specific heat seven times that of steam, the 
specific heat of steam is relatively high so that it can 
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Fig. 4. Mollier diagram for steam. 1 Btu/Ib = 2326 J/kg; 

1 Btu/Ib °F = 4186.8 J/kg °C; °C = (°F — 32)/1.8; 1 Ib/in.? = 
6.895 kPa. (After Mollier’s Steam Tables and Diagrams, 
Pitman, 1927) 


carry more thermal energy at practical temperatures 
than can other usable gases. See BOILER; BOYLE’S 
LAW; CHARLES’ LAW; DALTON’S LAW; ENTROPY; STEAM 
ENGINE; STEAM-GENERATING UNIT; STEAM HEATING; 
STEAM TURBINE; THERMODYNAMIC CYCLE; THERMO- 
DYNAMIC PRINCIPLES; WATER. Theodore Baumeister 
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tals, 1994, Systems, 1992; E. A. Avallone and T. 
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Keenan et al., Steam Tables: Thermodynamic Prop- 
erties of Water Including Vapor, Liquid, and Solid 
Phases, 1978; K. S. Pitzer, Thermodynamics, 3d ed., 
1994; E. Schmidt, Properties of Water and Steam, 
1981. 
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Steam condenser 


A heat-transfer device used for condensing steam to 
water by removal of the latent heat of steam and 
its subsequent absorption in a heat-receiving fluid, 
usually water, but on occasion air or a process fluid. 
Steam condensers may be classified as contact or sur- 
face condensers. 

In the contact condenser, the condensing takes 
place in a chamber in which the steam and cooling 
water mix. The direct contact surface is provided by 
sprays, baffles, or void-effecting fill such as Rashig 
rings. In the surface condenser, the condensing takes 
place separated from the cooling water or other heat- 
receiving fluid (or heat sink). A metal wall, or walls, 
provides the means for separation and forms the con- 
densing surface. 

Both contact and surface condensers are used 
for process systems and for power generation serv- 
ing engines and turbines. Modern practice has con- 
fined the use of contact condensers almost entirely 
to such process systems as those involving vacuum 
pans, evaporators, or dryers, and to condensing 
and dehumidification processes inherent in vacuum- 
producing equipment such as steam jet ejectors and 
vacuum pumps. The steam surface condenser is used 
chiefly in power generation but is also used in pro- 
cess systems, especially in those in which conden- 
sate recovery is important. Air-cooled surface con- 
densers are used increasingly in process systems and 
in power generation when the availability of cooling 
water is limited. 

Water-cooled surface condensers used for power 
generation are of the high-vacuum type. Their main 
purpose is to effect a back pressure at the turbine 
exhaust. To achieve an economical station heat rate 
and fixed cost, these condensers must be designed 
for high heat-transfer rates, minimum steam-side- 
pressure loss, and effective air removal. The usual 
power plant steam condenser incorporates a sin- 
gle steam-condensing chamber into which the steam 
turbine exhausts. Multipressure condensers, having 
two or more steam-condensing chambers operat- 


Steam condenser 


ing at different pressures, have been found to be 
more economical than single-pressure condensers 
for large installations, which generally use cooling 
towers as the final heat sink. Most multipressure 
condensers are compartmented internally, on the 
steamcondensing side, with the cooling water ar- 
ranged to flow through the tubes in one direc- 
tion from inlet to outlet. The mean cooling water 
temperature progressively increases as the water 
flows from compartment to compartment, with back 
pressure on the turbine increasing proportionately. 
Economical performance is effected with multiple- 
exhaust pressure turbines when each exhaust pres- 
sure section of the turbine is designed for the back 
pressure produced in its respective pressure sec- 
tion in the condenser. A rise in water tempera- 
ture in the order of 20-30°F (11-17°C) is neces- 
sary to effect required pressure differences in the 
turbine. 

Air-cooled surface condensers used for power gen- 
eration are designed to operate at higher back pres- 
sures than water-cooled condensers. This is con- 
sistent with their lower overall heat-transfer rate, 
which is of the order of 10-12 Btu/CF)(ft?)(h) 
[57-68 W/CC)(m?)] in contrast with the nor- 
mal 600-700 Btu/CF)ft)ch) [3.4-4.0 kW/CC)an”)] 
characteristic of water-cooled steam condensers. To 
compensate for these large differences in heat trans- 
fer, greater temperature differences are required be- 
tween the condensing steam and the cooling air 
than those needed with conventional water cool- 
ing. Steam is condensed on the inside of the heat- 
transfer tubes, and airflow is usually produced by 
fans across their outside surface. The tubes are 
of the extended-surface type with fins on the out- 
side. The normal ratio of outside to inside surface is 
on the order of 10 or 12 to 1. The use of air as the cool- 
ing medium for steam condensers used in smaller 
industrial steam power plants is increasing rapidly. 
Air-cooled steam condensers for the production of 
electric power have been confined to smaller plants; 
however, with the decreasing availability of water 
for cooling, the application of air-cooled steam con- 
densers to the larger steam electric plants may be 
anticipated. 

Condenser sizes have increased with the increase 
in size of turbine generators. The largest, in a sin- 
gle shell, is approximately 485,000 ft? (45,000 m7); 
the steam-condensing space occupies a volume of 
80,000 ft? (2265 m%). Large units have been built 
with the surface divided equally among two or three 
shells. Power plant condensers require 35-100 lb of 
cooling water to condense 1 Ib (35-100 kg for 2 kg) 
of steam, or, if air-cooled, 2500-5000 ft? of air for 
1 Ib 150-300 m3 for 1 kg) of steam. Normally, about 
0.5 ft? (0.05 m?) of surface is required for water- 
cooled condensers, and about 10 ft? (0.9 m7) of inside 
tube surface for air-cooled condensers is required 
for each kilowatt of generating capacity. See STEAM; 
STEAM TURBINE; VAPOR CONDENSER. Joseph F. Sebald 

Bibliography. H. P Bloch, A Practical Guide to 
Steam Turbine Technology, 1995; J. P. Holman, Heat 
Transfer, 9th ed., 2001. 
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Steam electric generator 


An alternating-current (ac) synchronous genera- 
tor driven by a steam turbine for 50- or 60-Hz 
electrical generating systems operation. See STEAM 
TURBINE. 

Basic operation and construction. The synchronous 
generator is made of two parts: a stator (stationary) 
and a rotor (rotating) [see illus.]. The stator is a 
cylindrical steel frame, inside of which a cylindri- 
cal iron core made of thin insulated laminations is 
mounted on a support system. The iron core has 
equally spaced axial slots on its inside circumference, 
and wound within the core slots is a stator winding. 
The stator winding copper is electrically insulated 
from the core. The rotor consists of a forged solid 
steel shaft. Wound into axial slots on the outside di- 
ameter of the shaft is a copper rotor winding that is 
held in the slots with wedges. Retaining rings sup- 
port the winding at the rotor body ends. The rotor 
winding, referred to as the field, is electrically in- 
sulated from the shaft and is arranged in pole pairs 
(always an even number) to form the magnetic field 
which produces the flux. The rotor shaft (supported 
by bearings) is coupled to the steam turbine, and ro- 
tates inside the stator core. See ELECTRIC ROTATING 
MACHINERY; MAGNETIC FLUX; WINDINGS IN ELECTRIC 
MACHINERY. 

The stator winding (armature) is connected to the 
ac electrical transmission system through the bush- 
ings and output terminals. The rotor winding (field) 
is connected to the generator’s excitation system. 


The excitation system provides the direct-current 
(dc) field power to the rotor winding via carbon 
brushes riding on a rotating collector ring mounted 
on the generator rotor. The synchronous generator’s 
output voltage amplitude and frequency must remain 
constant for proper operation of electrical load de- 
vices. During operation, the excitation system’s volt- 
age regulator monitors the generator’s output voltage 
and current. The voltage regulator controls the rotor 
winding dc voltage to maintain a constant generator 
stator output ac voltage, while allowing the stator 
current to vary with changes in load. Field windings 
typically operate at voltages between 125 and 575 V 
dc. The synchronous generator’s output frequency 
is directly proportional to the speed of the rotor, 


Frequency = (rpm x number of poles) + (120) 


and the speed of the generator rotor is held con- 
stant by a speed governor system associated with 
the steam turbine. See ALTERNATING-CURRENT GEN- 
ERATOR; GENERATOR. 

The first commercial use of the ac synchronous 
generator was in the mid-1880s. Synchronous gen- 
erators range in size from a few kilovolt-amperes 
to 1,650,000 kVA. 60-Hz steam-driven synchronous 
generators operate at speeds of either 3600 or 
1800 rpm; for 50-Hz synchronous generators these 
speeds would be 3000 or 1500 rpm. These two- and 
four-pole generators are called cylindrical rotor units. 
For comparison, water (hydro)-driven and air-driven 
synchronous generators operate at lower speeds, 
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Cross section of a typical hydrogen- and water-cooled, two-pole, 3600-rpm, steam electric generator. The dimensions 
shown are for a 900,000-kVA synchronous generator. The hydrogen coolers are not shown. 


some as low as 62 rpm (116 poles). The stator output 
voltage of large (generally greater than 100,000 kVA) 
units ranges 13,800-27,000 V. The operating char- 
acteristics of the steam-turbine-driven synchronous 
generators are defined in IEEE Std. C50.13-2005, 
IEEE Standard for Cylindrical-Rotor 50 Hz and 
60 Hz Synchronous Generators Rated 10 MVA and 
Above. See ELECTRIC POWER GENERATION; HYDRO- 
ELECTRIC GENERATOR. 

The physical size of a steam generator is de- 
termined by the transportation limitations of size 
and weight. For the generator shown in the illus- 
tration, the stator would weigh about 800,000 Ib 
(363,000 kg), and the rotor about 140,000 Ib 
(64,000 kg). 

Cooling. There are five sources of heat loss in a syn- 
chronous generator: stator winding resistance, rotor 
winding resistance, core, windage and friction, and 
stray losses. Removing the heat associated with these 
losses is the major challenge to the machine designer. 

As the demand for power increased, generators 
grew in output rating and physical size. The cooling 
requirements for the stator windings, rotor wind- 
ings, and core increase proportionally to the cube 
of the machine size. The early synchronous genera- 
tors were air-cooled. Later, air-to-water coolers were 
required to remove the heat. 

In 1937, hydrogen gas was introduced as a cool- 
ing medium in place of air. The generator frame 
had to be sealed against hydrogen leakage. The first 
units were filled with pure hydrogen at pressures 
of 0.5-5 psi 6.5-35 kPa), with a hydrogen purity 
of greater than 97%. In today’s large generators, hy- 
drogen pressures of 75 psi (520 kPa) are common. 
An elaborate seal-oil system (gland seal) is used to 
form a hydrogen seal near the bearings where the 
rotating shaft exits the frame. The seal-oil system op- 
erates at a pressure that is slightly higher than the 
hydrogen pressure. Hydrogen is the most effective 
gas available for generator cooling, and is used be- 
cause it removes approximately ten times more heat 
than air; its density is one-tenth that of air; windage 
losses are lower; hydrogen is a better electrical insu- 
lator; it is a nonoxidizing agent; and pure hydrogen 
will not support combustion in the case of a wind- 
ing failure. The hydrogen is circulated throughout 
the stator frame by blowers or fans mounted on the 
rotor, and is cooled by hydrogen-to-water heat ex- 
changers mounted within the stator frame. In the 
early machines, the hydrogen gas cooled the surface 
of the stator and rotor windings (as did air), relying 
on conduction for heat transfer through the wind- 
ing insulation. This cooling method is called indi- 
rect cooling. For larger machines, the stator wind- 
ings were designed with hydrogen cooling passages 
manufactured within the copper winding, which al- 
lowed the hydrogen to flow through the winding in 
direct contact with the copper. These designs were 
called directly cooled windings. The rotor windings 
were also directly cooled with hydrogen gas. Helium 
is not used because it has higher density, has higher 
specific heat, and costs five times more than hydro- 
gen. See HYDROGEN. 


Synchronous generators continued to increase in 
output, and in 1957 oil was introduced as a cool- 
ing medium for stator windings. Oil was quickly 
replaced by water in 1961. The stator winding 
losses are responsible for about 25-40% of the 
total losses. The copper windings were manufac- 
tured with hollow copper conductors (strands). 
Water flows through the conductors in direct con- 
tact with the copper windings, thus removing the 
heat. Water enters and leaves the stator winding 
through Teflon hoses, which electrically insulate the 
steel water piping from the stator winding copper. 
A stator cooling water system contains a storage 
tank and pumps to move the water through the sta- 
tor winding, filters, strainers, heat exchangers, and 
demineralizers to maintain low conductivity. The 
demineralizer contains a resin bed which removes 
minerals and other conductive impurities from the 
water. James R. Michalec 
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Steam engine 


A machine for converting the heat energy in steam 
to mechanical energy of a moving mechanism, for 
example, a shaft. The steam engine dominated the 
industrial revolution and made available a practical 
source of power for application to stationary or trans- 
portation services. The steam power plant could be 
placed almost anywhere, whereas other means of 
power generation were more restricted, experienc- 
ing such site limitations as an elevated water sup- 
ply, wind, animal labor, and so on. The steam engine 
can utilize any source of heat in the form of steam 
from a boiler. It was developed in sizes which ranged 
from that of children’s toys to 25,000 hp (18.6 MW), 
and it was adaptable to pressures up to 200 Ib/in.? 
(1.4 megapascals). It reached its zenith in the nine- 
teenth century in stationary services such as drives 
for pumping plants; drives for air compressor and 
refrigeration units; power supply for factory opera- 
tions with shafting for machine shops, rolling mills, 
and sawmills; and drives for electric generators as 
electrical supply systems were perfected. Its adapt- 
ability to portable and transportation services rested 
largely on its development of full-rated torque at any 
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Steam engine 


speed from rest to full throttle; its speed variability at 
the will of the operator; and its reversibility, flexibil- 
ity, and dependability under the realities of stringent 
service requirements. These same features favored 
its use for many stationary services such as rolling 
mills and mine hoists, but the steam engine’s great 
contribution was in the propulsion of small and large 
ships, both naval and merchant. Also, in the form of 
the steam locomotive, the engine made the railroad 
the practical way of land transport. Most machine 
elements known today had their origin in the steam 
engine: cylinders, pistons, piston rings, valves and 
valve gear crossheads, wrist pins, connecting rods, 
crankshafts, governors, and reversing gears. See LO- 
COMOTIVE. 

The twentieth century saw the practical end of the 
steam engine. The steam turbine with (1) its high 
speed (for example, 3600 rpm); (2) its utilization of 
maximum steam pressures (2000-5000 lb/in.? or 14- 
34 MPa), maximum steam temperatures (~1100°F 
or 600°C), and highest vacuum (~29 in. Hg or 98 
kilopascals); and (3) its large size (~1,000,000 kW) 
led to such favorable weight, bulk, efficiency, and 
cost features that it replaced the steam engine as 
the major prime mover for electric generating sta- 
tions. The internal combusion engine, especially the 
high-speed automotive types which burn volatile 
(gasoline) or nonvolatile (diesel) liquid fuel, offers a 
self-contained, flexible, low-weight, low-bulk power 
plant with high thermal efficiency that has com- 
pletely displaced the steam locomotive with the 
diesel locomotive and marine steam engines with 
the motorship and motorboat. It is the heart of 
the automotive industry, which produces in a year 
10,000,000 vehicles that are powered by engines 
smaller than 1000 hp (750 kW). Because of the steam 
engine’s weight and speed limitations, it was ex- 
cluded from the aviation field, which has become 
the exclusive preserve of the internal combustion 
piston engine or the gas turbine. See DIESEL ENGINE; 
GAS TURBINE; INTERNAL COMBUSTION ENGINE; STEAM 
TURBINE; TURBINE. 

Cylinder action. A typical steam reciprocating en- 
gine consists of a cylinder fitted with a piston 
(Fig. 1). A connecting rod and crankshaft convert 
the piston’s to-and-fro motion into rotary motion. A 
flywheel tends to maintain a constant-output angular 
velocity in the presence of the cyclically changing 
steam pressure on the piston face. A D slide valve 
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Fig. 1. Principal parts of horizontal steam engine. 
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Fig. 2. Single-ported slide valve on counterflow double- 
acting cylinder. 
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Fig. 3. Events during one cycle of a piston operation. (a) In 
ideal engine. (b) As depicted on the indicator card of a 
noncondensing steam engine. 


admits high-pressure steam to the cylinder and al- 
lows the spent steam to escape (Fig. 2). The power 
developed by the engine depends upon the pressure 
and quantity of steam admitted per unit time to the 
cylinder. 

Indicator card. The combined action of valves and 
piston is most conveniently studied by means of a 
pressure-volume diagram or indicator card (Fig. 3). 
The pressure-volume diagram is a thermodynamic 
analytical method which traces the sequence of 
phases in the cycle. It may be an idealized oper- 
ation (Fig. 3a), or it may be an actual picture of 
the phenomena within the cylinder (Fig. 3b) as ob- 
tained with an instrument commonly known as a 
steam engine indicator. This instrument, in effect, 
gives a graphic picture of the pressure and volume 
for all phases of steam admission, cutoff, expansion, 
release, exhaust, and compression. It is obtained as 
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the engine is running and shows the conditions F -> 
which prevail at any instant within the cyclinder. ce I | i 
The indicator is a useful instrument not only for H <= 
studying thermodynamic performance, but for the | 
equally important operating knowledge of inlet and 
exhaust valve leakage and losses, piston ring tight- = 
ness, and timing correctness. See THERMODYNAMIC = cylinder port 
PROCESSES. 

The net area of the indicator card shows, thermo- E — zero exhaust lap 
dynamically, the work done in the engine cylinder. <— steam lap 





By introducing the proper dimensional quantities, plain slide valve (D-valve) 


power output can be measured. Thus if the net area 
within the card is divided by the length, the conse- 
quent equivalent mean height is the average pres- 


plan 


Indicated horsepower = ——_ 
33, 000 











sure difference on the piston during the cycle and is S Ka 
generally called the mean effective pressure p, usu- 
ally expressed in Ib/in.* With a cylinder dimension of 

piston area a (in in.”), length of piston stroke / (in fo), 

and n equal to the number of cycles completed per 

minute, the equation given below holds. It has often 

been referred to as the most important equation in 

mechanical engineering. 

Engine types. Engines are classified as single- or 
double-acting, and as horizontal (Fig. 1) or vertical Corliss steam valve 
depending on the direction of piston motion. If the double-ported, in opening position 
steam does not fully expand in one cylinder, it can be 
exhausted into a second, larger cylinder to expand 
further and give up a greater part of its initial energy. 

Thus, an engine can be compounded for double or ataust 
triple expansion. In counterflow engines, steam en- k— exhaust lap 
ters and leaves at the same end of the cylinder; in uni- pas 

flow engines, steam enters at the end of the cylinder I i r 
and exhausts at the middle. s <| 

Steam engines can also be classed by functions, f x = 
and are built to optimize the characteristics most de- a 4 _ = 
sired in each application. Stationary engines drive wv sf T H 
electric generators, in which constant speed is im- x 
portant, or pumps and compressors, in which con- <— steam lap 
stant torque is important. Governors acting through piston valve 
the valves hold the desired characteristic constant. 
Marine engines require a high order of safety and 
dependability. 

Valves. The extent to which an actual steam pis- 
ton engine approaches the performance of an ideal 
engine depends largely on the effectiveness of its 
valves. The valves alternately admit steam to the 
cylinder, seal the cylinder while the steam expands 
against the piston, and exhaust steam from the cylin- 
der. The many forms of valves can be grouped as 
sliding valves and lifting valves (Fig. 4). See CARNOT 
CYCLE; THERMODYNAMIC CYCLE; VALVE. 

D valves (Fig. 2) are typical sliding valves where 
admission and exhaust are combined. A common 
sliding valve is the rocking Corliss valve; it is driven 
from an eccentric on the main shaft like other valves 
but has separate rods for each valve on the en- double-beat poppet valve seated in cage 


gine. After a Corliss valve is opened, a latch auto- 
Fig. 4. Steam engine valves in closed positions. The arrows 


matically disengages the rod and a separate dashpot indicate the path steam will travel when the valves (colored 
abruptly closes the valve. Exhaust valves are closed parts) are open. 
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(a) 





by rods, as are other sliding valves. 

Lifting valves are more suitable for use with high- 
temperature steam. They, too, are of numerous 
forms. The poppet valve is representative. 

Valves are driven through a crank or eccentric on 
the main crankshaft. The crank angle is set to open 
the steam port near dead center, when the piston 
is at its extreme position in the cylinder. The angle 
between valve crank and connecting-rod crank is 
slightly greater than 90°, the excess being the angle 
of advance. So that the valves will open and close 
quickly, they are driven at high velocity with conse- 
quently greater travel than is necessary to open and 
close the ports. The additional travel of a sliding valve 
is the steam lap and the exhaust lap. The greater the 
lap, the greater the angle of advance to obtain the 
proper timing of the valve action. 

Engine power is usually controlled by varying the 
period during which steam is admitted. A shifting 
eccentric accomplishes this function or, in releasing 
Corliss and in poppet valves, the eccentric is fixed 
and cutoff is controlled through a governor to the 
kickoff cams or to the latch that allows the valves to 
be closed by their dashpots. 

For high engine efficiency, the ratio of cylinder 
volume after expansion to volume before expansion 
should be high. The volume before expansion into 
which the steam is admitted is the volumetric clear- 
ance. It may be determined by valve design and other 
structural features. For this reason, valves and ports 
are located so as not to necessitate excessive volu- 
metric clearance. See BOILER; STEAM. 

Theodore Baumeister 

Bibliography. E. A. Avallone and T. Baumeister III 
(eds.), Marks’ Standard Handbook for Mechanical 
Engineers, 10th ed., 1996. 


SSS Se 
Steam-generating furnace 

An enclosed space provided for the combustion of 
fuel to generate steam. The closure confines the 
products of combustion and is capable of withstand- 
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(e) 


"O O O 


Water-cooled furnace-wall constructions. (a) Tangent tube. (b) Welded membrane and 
tube. (c) Flat stud and tube. (d) Full stud and refractory-covered tube. (e) Tube and tile. 
(f) Tubes backed by refractory. (After T. Baumeister, ed., Marks’ Standard Handbook for 
Mechanical Engineers, 8th ed., McGraw-Hill, 1978) 





ing the high temperatures developed and the pres- 
sures used. Its dimensions and geometry are adapted 
to the rate of heat release, to the type of fuel, and to 
the method of firing so as to promote complete burn- 
ing of the combustible and suitable disposal of ash. 
In water-cooled furnaces the heat absorbed materi- 
ally affects the temperature of gases at the furnace 
outlet and contributes directly to the generation of 
steam. See STEAM. 

Furnace walls. Prior to 1925, most furnaces were 
constructed of firebrick. As the capacity and the 
physical size of boiler units increased and the sus- 
pension burning of pulverized coal was developed, 
limitations were imposed by the height of the re- 
fractory walls that could be made self-supporting at 
high temperatures and by the inability of refractories 
to resist the fluxing action of molten fuel ash. Thus 
refractory is used primarily for special-purpose fur- 
naces, for the burner walls of small boilers, or for the 
arches of stoker-fired boilers to promote the ignition 
of the fuel. See REFRACTORY. 

The limitations of refractory constructions can be 
extended somewhat by cooling the brickwork with 
air flowing through channels in the structure, or by 
sectionalizing the wall into panels and transferring 
the structural load to external air-cooled steel or cast- 
iron supporting members. The heat absorbed can be 
recovered by using the cooling air for combustion, 
and the low rate of heat transfer through the refrac- 
tory helps to maintain high furnace temperatures, 
thus accelerating ignition and the burning of the fuel. 

The low tensile strength of refractories restricts 
the geometrical shapes that can be built, making it 
difficult to provide overhanging contours or roof 
closures. Thus sprung arches or shaped tiles sus- 
pended from steel must be used. Many refractory 
mixtures having air-setting or hydraulic-setting prop- 
erties are available, and they can be used to form 
monolithic structures by ramming, guniting, or pour- 
ing in forms. 

Water-cooling of all or most of the furnace walls 
is almost universally used in boiler units of all capac- 
ities and for all types of fuel and methods of firing. 
The water-cooling of the furnace walls reduces the 
transfer of heat to the structural members and, conse- 
quently, their temperatures are within the limits that 
provide satisfactory strength and resistance to oxida- 
tion. Water-cooled tube constructions facilitate large 
furnace dimensions and optimum arrangements of 
the roof, hopper, and arch, as well as the mount- 
ings for the burners and the provision for screens, 
platens, or division walls to increase the amount of 
heat-absorbing surface exposed in the combustion 
zone. External heat losses are small and are further 
reduced by the use of insulation. 

Furnace wall tubes are spaced on close centers 
to obtain maximum heat absorption and to facilitate 
ease of ash removal (see illus.). The so-called tan- 
gent tube construction, wherein adjacent tubes are 
almost touching with only a small clearance provided 
for erection purposes, has been frequently used. 
However, most new boiler units use membrane-tube 
walls, in which a steel bar or membrane is welded 


between adjacent tubes. This construction facilitates 
the fabrication of water-cooled walls in large shop- 
assembled tube panels. Less effective cooling is ob- 
tained when the tubes are wider spaced and ex- 
tended metal surfaces, in the form of flat studs, are 
welded to the tubes; if even less cooling is desired, 
the tube spacing can be increased and refractory in- 
stalled between or behind the tubes to form the wall 
enclosure. 

Furnace walls must be adequately supported, with 
provision for thermal expansion and with reinforc- 
ing buckstays to withstand the lateral forces result- 
ing from the difference between the furnace pres- 
sure and the surrounding atmosphere. The enclosure 
walls must prevent air infiltration when the furnace 
is operated under suction, and must prevent gas leak- 
age when the furnace is run at pressures above at- 
mospheric. 

Additional furnace cooling, in the form of tubular 
platens, division walls, or wide-spaced screens, often 
is used; in high heat input zones the tubes may be 
protected by refractory coverings anchored to the 
tubes by studs. See FURNACE CONSTRUCTION. 

Heat transfer. Heat-absorbing surfaces in the fur- 
nace receive heat from the products of combustion 
and, consequently, lower the furnace gas tempera- 
ture. The principal mechanisms of heat transfer take 
place simultaneously and include intersolid radiation 
from the fuel bed or fuel particles, nonluminous radi- 
ation from the products of combustion, convection 
from the furnace gases, and conduction through any 
deposits and the tube metal. The absorption effec- 
tiveness of the surface is greatly influenced by the 
deposits of ash or slag. 

Analytical solutions to the transfer of heat in the 































































































Steam-generating unit 


furnaces of steam-generating units are extremely 
complex, and it is very difficult, if not impossible, 
to calculate furnace-outlet gas temperatures by theo- 
retical methods. Nevertheless, the furnace-outlet gas 
temperature must be predicted accurately because 
this temperature determines the design of the re- 
mainder of the unit, particularly that of the super- 
heater. Calculations therefore are based upon test 
results supplemented by data accumulated from op- 
erating experience and judgment predicated upon 
knowledge of the principles of heat transfer and the 
characteristics of fuels and slags. See HEAT TRANSFER; 
STEAM-GENERATING UNIT. George W. Kessler 

Bibliography. E. A. Avallone and T. Baumeister III, 
Marks’ Standard Handbook for Mechanical Engi- 
neers, 10th ed., 1996; R. D. Reed, Furnace Opera- 
tions, 3d ed., 1981; M. W. Thring, The Science of 
Flames and Furnaces, 1962. 


See 
Steam-generating unit 


The wide diversity of parts, appurtenances, and func- 
tions needed to release and utilize a source of heat 
for the practical production of steam at pressures to 
5000 Ib/in.? (34 megapascals) and temperatures to 
1100°F (600°C), often referred to as a steam boiler 
for brevity. See STEAM. 

The essential steps of the steam-generating pro- 
cess include (1) a furnace for the combustion of fuel, 
or a nuclear reactor for the release of heat by fission, 
or a waste heat system; (2) a pressure vessel in which 
feedwater is raised to the boiling temperature, evap- 
orated into steam, and generally superheated beyond 
the saturation temperature; and (3) in many modern 
















































































(a) Lif 





(b) (c) 


Fig. 1. Large, central-station steam-generating unit, with pump-assisted circulation, corner-fired with tangential, tilting 
burners for pulverized coal, oil, or gas. (a) Plan; (b) burners tilted down; (c) burners tilted up. Output is 2,000,000 Ib steam/h 
(250 kg/s) at 2400 Ib/in.? (16.5 MPa), 1000°F (540°C), and 1000°F (540°C) reheat. (Combustion Engineering, Inc.) 
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central station units, a reheat section or sections for 
resuperheating steam after it has been partially ex- 
panded in a turbine. This aggregation of functions 
requires a wide assortment of components, which 
may be variously employed in the interests, primar- 
ily, of capacity and efficiency in the steam-production 
process. The selection, design, operation, and main- 
tenance of these components constitute a complex 
process which, in the limit, calls for the highest tech- 
nical skill. 

Service requirements. The wide diversity of com- 
mercial steam-generating equipment stems from at- 
tempts to accommodate design to the dictates of the 
imposed service conditions. 

Safety. The pressure parts, a potential bursting haz- 
ard, must be designed, maintained, and operated 
under the most stringent codes, such as the ASME 
Boiler and Pressure Vessel Code. The codes and reg- 
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Fig. 2. Boiling-water reactor (BWR) for a large central-station electric generating plant 
showing essential components. Typical performance is 6,000,000 Ib/h (750 kg/s) at 
1000 Ib/in.? (7 MPa), saturated steam. (General Electric Co.) 


ulations are backed by the police power of the state. 

Shape. Physical shape must fit the limits imposed. 
The differences between marine and stationary ap- 
plications are an example. 

Bulk. Space occupied has various degrees of value, 
and the equipment must best comply with these dic- 
tates. 

Weight. Frequently, weight is the lowest common 
denominator in determining portability and eco- 
nomic suitability. 

Setting. Confinement of heat source operations and 
the ensuing transport function will vary the details 
of the setting, for example, the use of metal or of 
refractory enclosure. 

Character of labor. Highly skilled labor, as in atomic 
power plants, can utilize design details that would 
be prohibitive in areas where the labor is primitive. 

Cleanability. Impurities on the heat-transfer and heat- 
release surfaces must not impair safe or efficient op- 
eration; surfaces must be cleanable to give accept- 
able reliability and performance. 

Life. Short or long life and short or long operating 
periods between overhauls are vital to the selection 
of a preferred design. 

Efficiency. Inherent high efficiency can be built into 
a design, but economics will dictate the efficiency 
chosen; the full realization of built-in high efficiency 
requires the services of skilled operators and fire- 
fighters. 

Cost. The overall cost of steam in cents per 1000 Ib 
or per 1000 kg is the ultimate criterion, reflecting 
investment, operating, and maintenance expenses in 
commercial operations. 

Adaptation to use. These service requirements dif- 
fer in each installation so that a wide variety of de- 
signs is available in the commercial markets (Figs. 1 
and 2). There is no one best design for universal 
application, but propriety rests in the proper selec- 
tion (see table). The designs are characterized by 
the various degrees of applicability of component 
parts such as steam drums and tubes; fuel-burning 
equipment and furnaces; draft systems for air sup- 
ply and flue gas removal; blowers, fans, and stacks; 
heat traps such as economizers and air preheaters; 
structural steel for the support of parts with ample 
provision for expansion; a casing or setting with pos- 
sible utilization of water walls, refractory, and insula- 
tion; pumps for boiler feed and for water circulation; 
fuel- and refuse-handling systems; feedwater purifi- 
cation systems; blowdown systems and soot-blowing 
equipment; and a wide assortment of accessories and 
instruments such as pressure gages, safety valves, 
water-level indicators, and sophisticated automatic 
control with its elaborate interlocks for foolproof, 
efficient operation and complete computerization. 

The maintenance of a safe working temperature 
for metal parts requires ample circulation of water 
over the steam-generating parts and ample circu- 
lation of steam over the superheating and reheat- 
ing parts. Water circulation in the generating sec- 
tions may be by natural convection processes where 
ample physical height gives the circulatory pres- 
sure difference between a column of water in the 
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Performance characteristics of some steam-generating units 











Steam 
Steam output, Steam pressure, temperature, Boiler 
Boiler type Fuel (potential) Ib/h (kg/s) Ib/in.? (MPa) Te (Ce) efficiency, % 
Horizontal return tubular (HRT), fire-tube Coal, oil, gas 10,000 (1.3) 150 (1.0) Dry and saturated 70 
Large industrial, natural-circulation, water-tube Coal, oil, gas 300,000 (38) 1400 (9.7) 900 (480) 88 
Large central station, natural-circulation Coal, oil, gas 2,000,000 (250) 2700 (18.6) 1050/1050 90 
(570/570) 
Large central station, supercritical, Coal, oil, gas 5,000,000 (630) 3600 (24.8) 1000/1000/1000 90 
once-through (540/540/540) 
Boiling-water reactor (BWR) Uranium 6,000,000 (750) 1000 (6.9) Dry and saturated 98+ 
(550 £°F or 
290 °C) 
Pressurized-water reactor (PWR), hot water, Uranium 6,000,000 (750) 700 (4.8) Dry and saturated 98 
4 loops (500 + °F or 
260 °C) 
Electric boiler, small 200+ volts 2,000 (0.25) 150 (1.0) Dry and saturated 98+ 





downcomer and a column of steam mixed with 
water in the riser parts. With high operating steam 
pressures, such as 1400 Ib/in.? (9.7 MPa), pump- 
assisted circulation is often selected. With supercrit- 
ical operation (above 3200 Ib/in.? or 22.1 MPa) there 
is no difference in density between liquid and vapor 
so that forced-flow, once-through principles super- 
sede water recirculation practice. Such supercritical 
steam generators have no steam or separating drums 
and, like nuclear steam generators, require the high- 
est purity of feed and boiler water (typically 10~? 
part of impurities) to avoid deposits in heated cir- 
cuits or the transport of solids to the turbine. See 
BOILER; BOILER ECONOMIZER; FIRE-TUBE BOILER; MA- 
RINE BOILER; RAW WATER; REHEATING; STEAM SEPARA- 
TOR; STEAM TEMPERATURE CONTROL; SUPERHEATER; 
WATER-TUBE BOILER. Theodore Baumeister 

Bibliography. American Society of Mechanical Engi- 
neers, Boiler And Pressure Vessel Code 2004, 2003; 
E. A. Avallone, T. Baumeister III, and A. Sadegh (eds.), 
Mark’s Standard Handbook for Mechanical Engi- 
neers, 11th ed., 2006; J. J. Jackson, Steam Boiler Op- 
eration; Principles and Practices, 2d ed., 1987; E. B. 
Woodruff, H. B. Lammers, and T. E Lammers, Steam 
Plant Operation, 8th ed., 2004. 
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Steam heating 


A heating system that uses steam generated from 
a boiler. The steam heating system conveys steam 
through pipes to heat exchangers, such as radiators, 
convectors, baseboard units, radiant panels, or fan- 
driven heaters, and returns the resulting condensed 
water to the boiler. Such systems normally operate 
at pressure not exceeding 15 lb/in.? gage or 103 kilo- 
pascals gage, and in many designs the condensed 
steam returns to the boiler by gravity because of the 
static head of water in the return piping. With utiliza- 
tion of available operating and safety control devices, 
these systems can be designed to operate automati- 
cally with minimal maintenance and attention. 
One-pipe system. In a one-pipe steam heating sys- 
tem, a single main serves the dual purpose of sup- 
plying steam to the heat exchanger and conveying 


condensate from it. Ordinarily, there is but one con- 
nection to the radiator or heat exchanger, and this 
connection serves as both the supply and return; 
separate supply and return connections are some- 
times used. Because steam cannot flow through the 
piping or into the heat exchanger until all the air is 
expelled, automatic air-venting valves must be pro- 
vided on all exchangers and at the ends of all mains. 
These valves may be ofa type which closes whenever 
steam or water comes in contact with the operating 
element but which also permits air to flow back into 
the system as the pressure drops. A vacuum valve 
closes against subatmospheric pressure to prevent 
return of air. 

Two-pipe system. A two-pipe system is provided 
with two connections from each heat exchanger, and 
in this system steam and condensate flow in sepa- 
rate mains and branches (Fig. 1). A vapor two-pipe 
system operates at a few ounces above atmospheric 
pressure, and in this system a thermostatic trap is 
located at the discharge connection from the heat 
exchanger which prevents steam passage, but per- 
mits air and condensation to flow into the return 
piping. 

When the steam condensate cannot be returned 
by gravity to the boiler in a two-pipe system, an alter- 
nating return lifting trap, condensate return pump, 
or vacuum return pump must be used to force the 
condensate back into the boiler. In a condensate 
return-pump arrangement, the return piping is ar- 
ranged for the water to flow by gravity into a collect- 
ing receiver or tank, which may be located below the 
steam-boiler waterline. A motor-driven pump con- 
trolled from the boiler water level then forces the 
condensate back to the boiler. 

In large buildings extending over a considerable 
area, it is difficult to locate all heat exchangers above 
the boiler water level or return piping. For these sys- 
tems a vacuum pump is used that maintains a suction 
below atmosphere up to +25 in. or £64 cm (max) 
of mercury in the return piping, thus creating a pos- 
itive return flow of air and condensate back to the 
pumping unit. Subatmospheric systems are similar 
to vacuum systems, but in contrast provide a means 
of partial vacuum control on both the supply and 
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Fig. 1. Two-pipe up-feed system with automatic return trap. 


return piping so that the steam temperature can be 
regulated to vary the heat emission from the heat 
exchanger in direct proportion to the heat loss from 
the structure. 

Figure 2 depicts a two-pipe vacuum heating sys- 
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tem which uses a condensation pump as a mechan- 
ical lift for systems where a part of the heating sys- 
tem is below the boiler room. Note that the low 
section of the system is maintained under the same 
vacuum conditions as the rest of the system. This is 
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Fig. 2. Layout of a vacuum heating system with condensati 








on and vacuum pumps. 


accomplished by connecting the vent from the re- 
ceiver and pump discharge to the return pipe located 
above the vacuum heating pump. 

With the wide acceptance of all-year air condi- 
tioning, low-pressure steam boilers have been used 
to produce cooling from absorption refrigeration 
equipment. With this system the boiler may be used 
for primary or supplementary steam heating. 

Exhaust from gas- or oil-driven turbines or engines 
may be used in waste heat boilers or separators, 
along with a standby boiler to produce steam for 
a heating system. 

Another source for steam for heating is from 
a high-temperature water source (350-450°F or 
180-230°C) using a high-pressure water to low- 
pressure steam heat exchanger. See BOILER; COM- 
FORT HEATING; OIL BURNER; STEAM-GENERATING FUR- 
NACE, John W. James 

Bibliography. M. Greenwald, Residential Hot Water 
and Steam Heating with Gas, Oil and Solid Fuels, 
1987; E. B. Woodruff, H. B. Lammers, and T. E 
Lammers, Steam-Plant Operation, 8th ed., 2004. 
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Steam jet ejector 


A steam-actuated device for pumping compressible 
fluids, usually from subatmospheric suction pressure 
to atmospheric discharge pressure. A steam jet ejec- 
tor is most frequently used for maintaining vacuum 
in process equipment in which evaporation or con- 
densation takes place. Because of its simplicity, com- 
pactness, reliability, and generally low first cost, it is 
often preferred to a mechanical vacuum pump for re- 
moving air from condensers serving steam turbines, 
especially for marine service. See VACUUM PUMP. 
Principle. Compression of the pumped fluid is ac- 
complished in one or more stages, depending upon 
the total compression required. Each stage consists 
of a converging-diverging steam nozzle, a suction 
chamber, and a venturi-shaped diffuser. Steam is sup- 
plied to the nozzle at pressures in the range 100- 
250 Ib/in.” gage (0.7-1.7 megapascals). A portion of 
the enthalpy of the steam is converted to kinetic en- 
ergy by expanding it through the nozzle at substan- 
tially constant entropy to ejector suction pressure, 
where it reaches velocities of 3000-4500 ft/s (900- 
1400 m/s). The air or gas, with its vapor of saturation, 
which is to be pumped and compressed is entrained, 
primarily by friction in the high-velocity steam jet. 
The impulse of the steam produces a change in the 
momentum of the air or gas vapor mixture as it mixes 
with the motive steam and travels into the converg- 
ing section of the diffuser. In the throat or most re- 
stricted area of the diffuser, the energy transfer is 
completed and the final mixture of gases and vapor 
enters the diverging section of the diffuser, where its 
velocity is progressively reduced. Here a portion of 
the kinetic energy of the mixture is reconverted to 
pressure with a corresponding increase in enthalpy. 
Thus the air or gas is compressed to a higher pressure 
than its entrance pressure to the ejector. The com- 
pression ratios selected for each stage of a steam jet 
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Fig. 1. Typical multistage steam jet ejector with contact 
barometric condensers; first and second stages 
condensing and third stage noncondensing. 


ejector usually vary from about 4 to 7. See DIFFUSER. 

Application. Two or more stages may be arranged 
in series, depending upon the total compression 
ratio required (Fig. 1). Two or more sets of series 
stages may be arranged in parallel to accommodate 
variations in capacity. 

Vapor condensers are usually interposed between 
the compression stages of multistage steam jet ejec- 
tors to condense and remove a significant portion 
of the motive steam and other condensable vapors 
(Fig. 2 ). This action reduces the amount of fluid to 
be compressed by the next higher stage and results 
in a reduction in the motive steam required. Both 
surface and contact types of vapor condensers are 
used for this purpose. See VAPOR CONDENSER. 

Ejectors used as air pumps for steam condensers 
that serve turbines are usually two-stage and are 
equipped with inter- and aftercondensers of the sur- 
face type. The steam condensed is drained through 
traps to the main condenser and returned to the 
boiler feed system. Ejectors used as vacuum pumps 
in process systems may be equipped with either sur- 
face condensers or contact condensers of the baro- 
metric type between or after stages or both. They 
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Fig. 2. Multistage steam jet ejector with surface inter- and aftercondenser; first and second stages condensing. 


may be single or multistage machines. High-vacuum 
process ejectors with as many as seven stages in 
series have been built. Industrial or process ejec- 
tors are frequently used instead of mechanical vac- 
uum pumps to pump corrosive vapors because they 
can be manufactured economically from almost any 
corrosion-resistant material. 

The number of stages of compression usually used 
for various suction pressures with atmospheric dis- 
charge pressure is as follows: 


No. of Range of suction pressure 

ejector stages 

1 3-30 in. Hg (10-102 kPa) abs. 

2 0.4-4 in. Hg (1.4-14 kPa) abs. 

3 1-25 mmHg (0.13-3.3 kPa) abs. 

4 0.15-3 mmHg (20-400 Pa) abs. 

5 20-300 um (2.7-40 Pa) 

6 5-20 um (0.7-2.7 Pa) 

7 1-5 um (0.13-0.7 Pa) 


Ejectors are also made which use air or other gases 
instead of steam as the energy source. 


Joseph F. Sebald 
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Steam separator 

A device for separating a mixture of the liquid and 
vapor phases of water. Steam separators are used in 
most boilers and may also be used in saturated steam 


lines to separate and remove the moisture formed 
because of heat loss. 

In boilers, steam separators must perform effi- 
ciently because both steam-free water and water- 
free steam are required. The force promoting water 
circulation in a natural-circulation boiler is derived 
from the density difference between the water in 
the downcomer tubes and the steam-water mixture 
in the riser tubes. Solid (steam-free) water in the 
downcomers maximizes this force. Steam-free water 
is equally important in a forced-circulation boiler to 
prevent cavitation in the pump. 

Water-free steam is quite necessary because dis- 
solved and suspended solids entering the boiler with 
the feedwater concentrate in the boiler water, and 
such solids contaminate the steam if all the water is 
not separated. Impurity concentrations as low as a 
few parts per billion can cause solid deposits to form 
in turbines or can cause product contamination if 
the steam is used in direct contact with a product. 
Solid deposits in turbines decrease turbine capacity 
and efficiency and may even cause unbalance of the 
rotor in extreme cases. Consequently, the sampling 
of steam and the measurement of impurities in steam 
have received much attention. 

Many low-pressure boilers operate satisfactorily 
without steam separators because their capacity rat- 
ings are quite conservative. Steam separation in such 
boilers may be compared with a pan of water boiling 
on a stove at a moderate rate. However, it is known 
that if the fire under the pan is turned too high or 
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Steam dryers and cyclone steam separators separate steam and water in high-capacity boiler unit. Arrows show direction of 


flow. 


if the water contains certain solids dike milk, for ex- 
ample), separation becomes inadequate and the pan 
boils over. Similar phenomena take place in natural 
separation boilers if the load is too high or the con- 
centration of solids in the water is too great. 

Operating pressure also influences natural sepa- 
ration. The force effecting separation results from 
the difference in density between water and steam. 
This difference is quite great at low pressures, but as 
pressure is increased, the density of water decreases 
while that of steam increases, resulting in a steadily 
declining density difference. 

Because of these effects, steam separators are used 
on the majority of boilers. Steam separators have 
many forms and may be as fundamental as a sim- 
ple baffle that utilizes inertia caused by a change of 
direction. Most modern, high-capacity boilers use a 
combination of cyclone separators and steam dryers, 
as shown in the illustration. 

Cyclone separators have two major advantages. 
First, they slice the steam-water mixture into thin 


streams so that the steam bubbles need travel only 
short distances through the mixture to become dis- 
engaged. Second, they whirl the mixture in a circular 
path, creating a centrifugal force many times greater 
than the force of gravity. 

Cyclone separators remove practically all of the 
steam from the water, producing the steam-free 
water needed for the boiler downcomer tubes. They 
also remove the major portion of the water from 
the steam. After leaving the cyclone separators, the 
steam is passed through the dryers, where the last 
traces of moisture are removed. Steam dryers remove 
small droplets of water from the steam by providing 
a series of changes in direction and a large surface 
area to intercept the droplets. 

Because of the efficient operation of steam sep- 
arators, steam is one of the purest mass-produced 
commodities made. See BOILER; BOILER FEEDWATER; 
STEAM; STEAM TURBINE. Earl E. Coulter 

Bibliography. American Society for Testing and 
Materials, Methods of Sampling Steam, ASTM 
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Designation D1066; ASTM, Methods of Test for 
Deposit-Forming Impurities in Steam, ASTM 
Designation D2186; ASTM, ASTM Standards, pt. 23: 
Water:Atmospheric Analysis, annually; P. M. 
Brister, E G. Raynor, and E. A. Pirsh, The Influence of 
Boiler Design and Operating Conditions on Steam 
Contamination, ASME Pap. 51-A-95; M. J. Moore 
and C. H. Sieverding, Two-Phase Steam Flow in Tur- 
bines and Separators: Theory, Instrumentation, 
Engineering, 1976. 
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Steam temperature control 


Means for regulating the operation of a steam- 
generating unit to produce steam at the required 
temperature. The temperature of steam is affected 
by the change in the relative heat absorption as load 
varies, by changes in ash or slag deposits on the heat- 
absorbing surface, by changes in fuel, by changes in 
the proportioning of fuel and combustion air, or by 
changes in feedwater temperature. Low steam tem- 
perature lowers the efficiency of the thermal cycle. 
However, high steam temperature, which increases 
thermal efficiency, is restricted by the strength and 
durability of materials used in superheaters. Con- 
trol of steam temperature is, therefore, a matter of 
primary concern in the design of modern steam- 
generating units. 

Steam temperature can be controlled by one or 
more of several methods (see illus.). These include 
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Methods of controlling steam temperature. (a) Bypass 
dampers. (b) Tilting burners. (c) Spray attemperation. 
(d) Gas recirculation. 


(1) the damper control of gases to the superheater, to 
the reheater, or to both, thus changing the heat input; 
(2) the recirculation of low-temperature flue gas to 
the furnace, thus changing the relative amounts of 
heat absorbed in the furnace and in the superheater, 
reheater, or both; (3) the selective use of burners at 
different elevations in the furnace or the use of tilting 
burners, thus changing the location of the combus- 
tion zone with respect to the furnace heat-absorbing 
surface; (4) the attemperation, or controlled cool- 
ing, of the steam by the injection of spray water or 
by the passage of a portion of the steam through 
a heat exchanger submerged in the boiler water; 
(5) the control of the firing rate in divided furnaces; 
and (6) the control of the firing rate relative to the 
pumping rate of the feedwater to forced-flow once- 
through boilers. Generally, these various controls are 
adjusted automatically. See STEAM-GENERATING UNIT. 

George W. Kessler 
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Steam turbine 


A machine for generating mechanical power in ro- 
tary motion from the energy of steam at temperature 
and pressure above that of an available sink. By far 
the most widely used and most powerful turbines 
are those driven by steam. In the United States well 
over 85% of the electrical energy consumed is pro- 
duced by steam-turbine-driven generators. Individ- 
ual turbine ratings historically have tended to follow 
the increasing capacity trend but have reached limits 
imposed by material and machine design considera- 
tions. The largest unit shipped during the 1950s was 
rated 500 MW. Units rated about 1100 MW were in 
service by the close of the 1960s, and ratings up to 
1300 MW saw frequent application in the 1970s and 
early 1980s. Units of all sizes, from a few horsepower 
to the largest, have their applications. 

Until the 1960s essentially all steam used in turbine 
cycles was produced in boilers burning fossil fuels 
(coal, oil, and gas) or, in minor quantities, certain 
waste products. The 1960s marked the beginning of 
the introduction of commercial nuclear power. See 
ELECTRIC POWER GENERATION; NUCLEAR POWER. 

Turbine parts. Figure 1 shows a small, simple 
mechanical-drive turbine of a few horsepower. It il- 
lustrates the essential parts for all steam turbines re- 
gardless of rating or complexity: (1) a casing, or shell, 
usually divided at the horizontal center line, with the 
halves bolted together for ease of assembly and disas- 
sembly; it contains the stationary blade system; (2) a 
rotor carrying the moving buckets (blades or vanes) 
either on wheels or drums, with bearing journals on 
the ends of the rotor; (3) a set of bearings attached 
to the casing to support the shaft; (4) a governor 
and valve system for regulating the speed and power 
of the turbine by controlling the steam flow, and an 
oil system for lubrication of the bearings and, on all 
but the smallest machines, for operating the control 
valves by a relay system connected with the gover- 
nor; (5) a coupling to connect with the driven ma- 
chine; and (6) pipe connections to the steam supply 
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Fig. 1. Cutaway of small single-stage steam turbine. 
(General Electric Co.) 


at the inlet and to an exhaust system at the outlet of 
the casing or shell. 

Applications. Steam turbines are ideal prime 
movers for applications requiring rotational mechan- 
ical input power. They can deliver constant or vari- 
able speed and are capable of close speed con- 
trol. Drive applications include centrifugal pumps, 
compressors, ship propellers, and, most importantly, 
electric generators. 

The turbine shown in Fig. 1 is a small mechanical- 
drive unit. Units of this general type provide 10- 
1000 hp (7.5-750 kW) with steam at 100-600 Ib/in.” 
gage (0.7-4.1 megapascals) inlet pressure and tem- 
peratures to 800°F (427°C). These and larger mul- 
tistage machines drive small electric generators, 
pumps, blowers, air and gas compressors, and paper 
machines. A useful feature is that the turbine can 
be equipped with an adjustable-speed governor and 
thus be made capable of producing power over a 
wide range of rotational speeds. In such applications 
efficiency varies with speed (Fig. 2), being 0 when 
the rotor stalls at maximum torque and also 0 at the 
runaway speed at which the output torque is 0. Maxi- 
mum efficiency and power occur where the product 
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Fig. 2. Illustrative stage performance versus speed. 


of speed and torque is the maximum. 

Many industries need steam at one or more pres- 
sures (and consequently temperatures) for heating 
and process purposes. Frequently it is more econom- 
ical to produce steam at high pressure, expand it par- 
tially through a turbine, and then extract it for fur- 
ther processing, rather than using a separate boiler 
at the process steam pressure. Figure 3 is a cross 
section through an industrial automatic extraction 
turbine. The left set of valves admits steam from the 
boiler at the flow rate to provide the desired electri- 
cal load. The steam flows through five stages to the 
controlled extraction point. The second set of valves 
acts to maintain the desired extraction pressure by 
varying the flow through the remaining 12 stages. 
Opening these internal valves increases the flow to 
the condenser and lowers the controlled extraction 
pressure. 

Industrial turbines are custom-built in a wide va- 
riety of ratings for steam pressures to 2000 Ib/in.? 
gage (14 MPa), for temperatures to 1000°F (538°C), 
and in various combinations of nonextracting, sin- 
gle and double automatic extraction, noncondensing 
and condensing. Turbines exhausting at or above at- 
mospheric pressure are classed as noncondensing re- 
gardless of what is done with the steam after it leaves 
the turbine. If the pressure at the exhaust flange is 
less than atmospheric, the turbine is classed as con- 
densing. 

Turbines in sizes to about 75,000 hp (56 MW) are 
used for ship propulsion. The drive is always through 
reduction gearing (either mechanical or electrical) 
because the turbine speed is in the range of 4000- 
10,000 rpm, while 50-200 rpm is desirable for ship 
propellers. Modern propulsion plants are designed 
for steam conditions to 1450 lb/in.? gage (10.0 MPa) 
and 950°F (510°C) with resuperheating to 950°F 
(510°C). Fuel consumption rates as low as 0.4 lb of oil 
per shaft-horsepower-hour (0.07 kg/megajoule) are 
achieved. See SHIP POWERING, MANEUVERING, AND 
SEAKEEPING. 

Central station generation of electric power pro- 
vides the largest and most important single applica- 
tion of steam turbines. Ratings smaller than 50 MW 
are seldom employed today; newer units are rated as 
large as 1300 MW. Large turbines for electric power 
production are designed for the efficient use of steam 
in a heat cycle that involves extraction of steam for 
feedwater heating, resuperheating of the main steam 
flow Cin fossil-fuel cycles), and exhausting at the low- 
est possible pressure economically consistent with 
the temperature of the available condenser cooling 
water. 

In fossil-fuel-fired cycles, steam pressures are usu- 
ally in the range of 1800-3500 Ib/in.” gage (12.4- 
24.1 MPa) and tend to increase with rating. Temper- 
atures of 950-1050°F (510-570°C) are used, with 
1000°F (540°C) the most common. Single resuper- 
heat of the steam to 950-1050°F (510-570°C) is 
almost universal. A second resuperheating is occa- 
sionally employed. Figure 4 shows a typical unit 
designed for fossil-fuel steam conditions. Tandem- 
compound double-flow machines of this general 
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Fig. 3. Cross-section view of single-automatic-extraction condensing steam turbine. (General Electric Co.) 


arrangement are applied over the rating range of 
100-400 MW. Initial steam flows through the steam 
admission valves and passes to the left through the 
high-pressure portion of the opposed flow rotor. 
After resuperheating in the boiler it is readmitted 
through the intercept valves and flows to the right 
through the reheat stages, then crosses over to the 
double-flow low-pressure rotor, and exhausts down- 
ward to the condenser. See SUPERHEATER. 

The water-cooled nuclear reactor systems com- 
mon in the United States provide steam at pres- 











sures of about 1000 Ib/in.? gage. (6.9 MPa), with 
little or no initial superheat. Temperatures higher 
than about 600°F (315°C) are not available. Further, 
reactor containment and heat-exchanger considera- 
tions preclude the practical use of resuperheating 
at the reactor. The boiling-water reactor, for exam- 
ple, provides steam to the turbine cycle at 950-Ib/in.? 
gage (6.55-MPa) pressure and the saturation temper- 
ature of 540°F (282°C). Such low steam conditions 
mean that each unit of steam flow through the tur- 
bine produces less power than in a fossil-fuel cycle. 
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Fig. 4. Partial cutaway view of 3600-rpm fossil-fuel turbine generator. (General Electric Co.) 


Fewer stages in series are needed but more total flow 
must be accommodated for a given output. Nuclear 
steam conditions often produce a turbine expan- 
sion with water of condensation present throughout 
the entire stream path. Provisions must be made to 
control the adverse effects of water: erosion, cor- 
rosion, and efficiency loss. In consequence of the 
differences in stream conditions, the design for a 
nuclear turbine differs considerably from that of a 
turbine for fossil fuel application. The former tend 
to be larger, heavier, and more costly. For example, 
at 800 MW, a typical fossil-fuel turbine can be built 
to run at 3600 rpm, is about 90 ft (27 m) long, and 
weighs about 1000 tons (900 metric tons). A compa- 
rable unit for a water-cooled nuclear reactor requires 
1800 rpm, is about 125 ft (48 m) long, and weighs 
about 2500 tons (2300 metric tons). See NUCLEAR 
REACTOR. 

Figure 5 represents a large nuclear turbine gener- 
ator suitable for ratings of 1000 to 1300 MW. Steam 
from the reactor is admitted to a double-flow high- 
pressure section, at the left, through four parallel 
pairs of stop and control valves, not shown. The stop 
valves are normally fully open and are tripped shut to 
prevent dangerous overspeed of the unit in the event 
of loss of electrical load combined with control mal- 
function. The control valves regulate output by vary- 
ing the steam flow rate. The steam exhausting from 
the high-pressure section is at 150 to 200 Ib/in.? abs. 
(1.0 to 1.4 MPa absolute pressure) and contains about 
13% liquid water by weight. The horizontal cylinder 
alongside the foundation is one of a pair of symmet- 
rically disposed vessels performing three functions. 
A moisture separator removes most of the water in 
the entering steam. Two steam-to-steam reheaters 
follow. Each is a U-tube bundle which condenses 
heating steam within the tubes and superheats the 
main steam flow on the shell side. The first stage uses 
heating steam extracted from the high-pressure tur- 
bine. The final stage employs reactor steam which 
permits reheating to near initial temperature. Alter- 
nate cycles employ reheat with initial steam only or 
moisture separation alone with no reheat. Reheat en- 
hances cycle efficiency at the expense of increased 
investment and complexity. The final choice is eco- 
nomic, with two-stage steam reheat, as shown, se- 
lected most frequently. 

Reheated steam is admitted to the three double- 
flow low-pressure turbine sections through six 
combined stop-and-intercept intermediate valves. 
The intermediate valves are normally wide open but 
provide two lines of overspeed defense in the event 
of load loss. Exhaust steam from the low-pressure 
sections passes downward to the condenser, not 
shown in Fig. 5. 

Turbine cycles and performance. Figures 6 and 
7 are representative fossil-fuel and nuclear turbine 
thermodynamic cycle diagrams, frequently called 
heat balances. Heat balance calculations establish 
turbine performance guarantees, provide data for siz- 
ing the steam supply and other cycle components, 
and are the basis for designing the turbine generator. 

The fossil-fuel cycle (Fig. 6) assumes a unit rated 
500 MW, employing the standard steam condi- 
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Fig. 5. An 1800-rpm nuclear turbine generator with combined moisture separator and 


two-stage steam reheater. (General Electric Co.) 


tions of 2400 Ib/in.2 gage (2415 lIb/in.? abs. or 
16.65 MPa absolute) and 1000°F (538°C), with re- 
superheat to 1000°F (538°C). As can be seen in 
the upper left corner, the inlet conditions corre- 
spond to a total heat content, or enthalpy, of 1461 
Btu/lb (3.398 MJ/kg) of steam flow. A flow rate of 
3,390,000 1b/h (427.1 kg/s) is needed for the desired 
output of 500 MW (500,000 kW). For efficiency con- 
siderations the regenerative feedwater heating cycle 
is used. Eight heaters in series are employed so that 
water is returned to the boiler at 475°F (246°C) and 
459 Btu/Ib (1.068 MJ/kg) enthalpy, rather than at the 
condenser temperature of 121°F (49°C). Because of 
the higher feedwater temperature, the boiler adds 
heat to the cycle at a higher average temperature, 
more closely approaching the ideal Carnot cycle, in 
which all heat is added at the highest cycle temper- 
ature. The high-pressure turbine section exhausts to 
the resuperheater at 530 psia (3.65 MPa absolute) 
pressure and 1306 Btu/lb (3.038 MJ/kg) enthalpy. 
The reheat flow of 3,031,000 Ib/h (581.9 kg/s) re- 
turns to the reheat or intermediate turbine section 
at 490 Ib/in.? abs. (3.38 MPa absolute) pressure and 
1520 Btu/lb G.536 MJ/kg) enthalpy. These data are 
sufficient to calculate the turbine heat rate, or unit 
heat charged against the turbine cycle. The units are 
Btu of heat added in the boiler per hour per kilowatt 
of generator output. Considering both the initial and 
reheat steam, the heat rate is given by Eq. (1). 


Turbine heat rate = 
3,390,000 (1461 — 459) + 3,031,000 (1520 — 1306) 
500,000 





= 8090 Btu/kWh 
= 2.37 kW of heat/kW of output a) 


The typical power plant net heat rate is poorer 
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2860 K# > 
475F 400F open 
feedwater 
heater heater 
10F DC 10F DC 10F DC 10F DC ™ DC 10F DC 
to boiler 354F 
3493 K# boiler | 417 K# 
feed pump 
3000P 
U.S. Cus- 
Symbol Quantity tomary Unit SI equivalent Abbreviations 
# Mass flow Ib/h 1.2600 x 10-4 kg/s DC = feedwater heater, 
K# Mass flow 103 Ib/h 0.12600 kg/s drain cooler 
F Temperature °F tC = (tF — 32)/1.8 temperature 
H Enthalpy,steam | Btu/Ib 1.05506 kJ approach 
h Enthalpy, water | Btu/lb 1.05506 kJ TD = feedwater heater, 
P Pressure lb/in.2 abs. | 6.8948 kPa terminal temperature 
Pressure in. Hga 3.3864 kPa difference 
Heat rate Btu/kWh 2.9307 x 10-4 kJ/(kW - s) 
Fig. 6. Typical fossil-fuel steam turbine cycle. 
than the turbine heat rate because of auxiliary power ciency of the example is given by Eq. (3). 
required throughout the plant and because of boiler 
losses. Assuming 3% auxiliary power (beyond the n: = (8412/9270) 100 = 37% 6) 














boiler-feed pump power given in the turbine cycle 
in Fig. 6) and 90% boiler efficiency, the net plant heat 
rate is given by Eq. (2). 


Net plant heat rate 
= 3,390,000 (1461 — 459) 
+3,031,000 (1520 — 1306) 
500,000 [(100 — 3)/100][90/100] 
= 9270 Btu/kWh 





= 2.717 kW of heat/kW of output (2) 


The heat rates of modern fossil-fuel plants fall in 
the range of 8600-10,000 Btu/kWh (2.52-2.93 kW of 
heat/kW of input). Considering that the heat-energy 
equivalent of 1 kWh is 3412 Btu, the thermal effi- 


Figure 6 shows 2,191,000 Ib/h (276.1 kg/s) of 
steam exhausting from the main unit to the con- 
denser at an exhaust pressure of 3.5 in. of mer- 
cury (11.9 kilopascals) absolute. The theoretical ex- 
haust enthalpy (ELEP), without considerations of 
velocity energy loss and friction loss between the 
last turbine stage and the condenser, is 1040 Btu/Ib 
(2.419 MJ/kg). The actual used energy end point 
(UEEP) is 1050 Btu/Ib (2.442 MJ/kg). The exhaust 
heat at the condenser pressure is thermodynamically 
unavailable and is rejected as waste heat to the plant’s 
surroundings. The exhaust steam is condensed at a 
constant 121°F (49°C) and leaves the condenser as 
water at 89 Btu/lb (0.207 MJ/kg) enthalpy. 

On a heat-rate basis, this cycle rejects heat to the 
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Fig. 7. Typical nuclear steam turbine cycle. 


condenser at the approximate rate which is given by 
Eq. (4). 
Net station condenser heat rejection rate 
= 2,191,000 (1050 — 89) + 144,000 (1097 — 89) 
+ 417,000 (100 — 89) 
500,000 (0.97) 





= 4650 Btu/kWh 


= 1.363 kW of heat/kW of output (4) 

If evaporating cooling towers are used, each 
pound of water provides about 1040 Btu (each kilo- 
gram provides about 2.419 MJ) cooling capacity, 
which is equivalent to a required minimum cooling- 
water flow rate of 4.5 lb/kWh (0.57 kg/MJ). The 
cooling-water needs of a large thermal plant are a 
most important consideration in plant site selection. 

The nuclear cycle (Fig. 7) assumes a unit rated 
1210 MW and the steam conditions of the boiling- 
water reactor. Many similarities can be seen to Fig. 6. 
The major differences include moisture separation 
and steam reheating and the lack of need for an in- 
termediate pressure element. The low steam condi- 





tions are apparent. The consequent turbine heat rate 
is given in Eq. (5). 

Turbine heat rate 
15,400,000 (1191 — 398) 
© 1,210,000 
10,090 Btu/kWh 
= 2.957 kW of heat/kW of output 


6) 


A typical nuclear plant also requires about 3% aux- 
iliary power beyond the reactor feed-pump power 
already included in Fig. 7. The equivalent boiler effi- 
ciency approaches 100%, however, and leads to the 
equivalent net plant heat rate given by Eq. (). 


Net plant heat rate 
15,400,000 (1191 — 398) 
~ 1,210,000 [100 — 3)/100} 
10,400 Btu/kWh 
= 3.048 kW of heat/kW of output 


(6) 


The corresponding thermal efficiency is given by 
Eq. O). 
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n: = G412/10,400) 100 = 33% O) 
Heat is rejected at the condenser at a rate given 
approximately by Eq. (8). 


Net station condenser heat rejection rate 


= [8,350,000 (1020 — 89) + 200,000 (1028 — 89) 
+ 2,350,000 (101 — 89)] 





1,210, 000 (0.97) 
= 6810 Btu/kWh 
= 1.996kW of heat/kW of output (8) 


Comparison of the heat rates shows that the nu- 
clear cycle requires about 12% more input heat than 
does the fossil-fuel cycle and rejects about 46% more 
heat to the condenser, thus requiring a correspond- 
ingly larger supply of cooling water. It consumes heat 
priced at about half that from coal or one quarter that 
from oil, and is essentially free of rejection to the at- 
mosphere of heat and combustion products from the 
steam supply. 

Turbine classification. Steam turbines are classified 
(1) by mechanical arrangement, as single-casing, 
cross-compound (more than one shaft side by side), 
or tandem-compound (more than one casing with 
a single shaft); (2) by steam flow direction (axial 
for most, but radial for a few); (3) by steam cycle, 
whether condensing, noncondensing, automatic ex- 
traction, reheat, fossil fuel, or nuclear; and (4) by 
number of exhaust flows of a condensing unit, as 
single, double, triple flow, and so on. Units with as 
many as eight exhaust flows are in use. 

Often a machine will be described by a combina- 
tion of several of these terms. 

The least demanding applications are satisfied by 
the simple single-stage turbine of Fig. 1. For large 
power output and for the high inlet pressures and 
temperatures and low exhaust pressures which are 
required for high thermal efficiency, a single stage is 
not adequate. Steam under such conditions has high 
available energy, and for its efficient utilization the 
turbine must have many stages in series, where each 
takes its share of the total energy and contributes its 
share of the total output. Also, under these condi- 
tions the exhaust volume flow becomes large, and it 
is necessary to have more than one exhaust stage to 
avoid a high velocity upon leaving and consequent 
high kinetic energy loss. Figure 5 is an example of 
a large nuclear turbine generator which has six ex- 
haust stages in parallel. 

Machine considerations. Steam turbines are high- 
speed machines whose rotating parts must be de- 
signed for high centrifugal stress. Difficult stress 
problems are found in long last-stage blading, hot 
inlet blading, wheels, and rotor bodies. 

Casing or shell stresses. The casings or shells at the 
high-pressure inlet end must be high-strength pres- 
sure vessels to contain the internal steam pressure. 
The design is made more difficult by the need for a 
casing split at the horizontal center line for assem- 
bly. The horizontal flange and bolt design must be 
leakproof. Shell design problems lead to the use of 


small-diameter, high-speed turbines at high pressure, 
and the use of double shell construction (Fig. 4). 

Rotor buckets or blades. Turbine buckets must be 
strong enough to withstand high centrifugal, steam 
bending, and vibration forces. Buckets must be de- 
signed so that their resonant natural frequencies 
avoid the vibration stimulus frequencies of the steam 
forces, or are strong enough to withstand the vibra- 
tions. 

Sealing against leakage. It is necessary to minimize to 
the greatest possible extent the wasteful leakage of 
steam along the shaft both at the ends and between 
stages. The high peripheral velocities between the 
shaft and stationary members preclude the use of 
direct-contact seals. Seals in the form of labyrinths 
with thin, sharp teeth on at least one of the members 
are utilized. In normal operation these seals do not 
touch one another, but run at close clearance. In the 
case of accidental contact, the sharp teeth can rub 
away without distorting the shaft. 

Vibration and alignment. Shaft and bearings should be 
free of critical speeds in the turbine operating range. 
The shaft must be stable and remain in balance. 

Governing. Turbines usually have two governors, 
one to control speed and a second, emergency 
governor to limit possible destructive overspeed. 
The speed signal is usually mechanical or electrical. 
A power relay control, usually hydraulic, converts 
speed signals to steam valve position. Great reliabil- 
ity is required. See GOVERNOR. 

Lubrication. The turbine shaft runs at high surface 
speed; consequently its bearings must be continu- 
ously supplied with oil. At least two oil pumps, a 
main pump anda standby driven by a separate power 
source, are usually provided on all but the smallest 
machines. A common supply of oil is often shared 
between the governing hydraulic system and the lu- 
brication system. 

Aerodynamic design. The vane design for highest ef- 
ficiency, especially for the larger sizes of turbines, 
draws upon modern aerodynamic theory. Classic 
forms of impulse and reaction buckets merge in the 
three-dimensional design that is required by modern 
concepts of loss-free fluid flow. To meet the theoret- 
ical steam flow requirements, vane sections change 
in shape along the bucket. To minimize centrifu- 
gal forces on the vanes and their attachments, long 
turbine buckets are tapered toward their tips. See 
CARNOT CYCLE; HEAT BALANCE; STEAM-GENERATING 
UNIT; TURBINE. Frederick G. Baily 

Bibliography. E. A. Avallone and T. Baumeister III 
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tion, 3(3):8-13, 1967; H. P. Bloch, A Practical Guide 
to Steam Turbine Technology, 1996; R. H. Kehlhofer 
et al., Combined-Cycle Gas and Steam Turbine 
Power Plants, 2d ed., 1999; J. K. Salisbury, Steam 
Turbines and Their Cycles, 1950, reprint 1974; 
B. G. A. Strotzki, Steam turbines, Power, 106(6):S1- 
$40, 1962. 


E 
Steel 


Any of a great number of alloys that contain the 
element iron as the major component and small 
amounts of carbon as the major alloying element. 
These alloys are more properly referred to as car- 
bon steels. Small amounts, generally on the order 
of a few percent, of other elements such as man- 
ganese, silicon, chromium, molybdenum, and nickel 
may also be present in carbon steels. However, when 
large amounts of alloying elements are added to 
iron to achieve special properties, other designations 
are used to describe the alloys. For example, large 
amounts of chromium, over 12%, are added to pro- 
duce the important groups of alloys known as stain- 
less steels. 

Phases. There are three unique thermodynami- 
cally stable arrangements of iron and carbon atoms in 
carbon steels. The arrangements are crystalline and 
referred to as phases. Ferrite is the phase with a body- 
centered cubic unit cell, austenite is the phase with 
a face-centered cubic unit cell, and cementite is the 
phase with an orthorhombic unit cell. The phase or 
combination of phases that actually exists in a given 
steel depends on temperature and composition. A 
phase diagram is a type of map that shows where the 
different phases exist under equilibrium conditions. 
Equilibrium assumes that a steel has been held long 
enough at a temperature to ensure that the most sta- 
ble arrangement of atoms and phases has developed 
(Fig. 1). 

The phase diagram for the Fe-C system shows that 
austenite is stable only at high temperatures and that 
it is replaced at low temperatures by other phases. 
This characteristic of iron-carbon alloys is the fun- 
damental basis for the processing and great versatil- 
ity of steels. When a steel is heated in the austenite 
phase field, it becomes quite ductile, and heavy sec- 
tions can be hot-rolled or forged to smaller sizes and 
more complex shapes. On cooling after hot work, 
the austenite transforms to structures of higher 
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Fig. 1. Critical temperatures in plain carbon steels and 


constituents which are present in iron-carbon alloys upon 
slow cooling. 


strength. Another important use of the austenite 
phase field is the heat treatment of steels. By heat- 
ing a finished steel part into the austenite phase, that 
is, by heating above the critical temperatures iden- 
tified by A; and Acm on the phase diagram (Fig. 1), 
the microstructure existing at room temperature is 
replaced by austenite. The resulting austenite may 
then be cooled at various rates to form new arrange- 
ments of the phases with a variety of mechanical 
properties. Cooling at very high rates or quenching 
produces the hardest structure in steels, and is dis- 
cussed in the sections on medium- and high-carbon 
steels. See ALLOY STRUCTURES. 

Microstructure and properties. When steels are 
cooled slowly from the austenite phase field, the 
austenite transforms to microstructures of ferrite and 
cementite. In steels with carbon content below 0.8%, 
the austenite first transforms to ferrite, as indicated 
by the ferrite-plus-austenite field of the phase dia- 
gram for the Fe-C system (Fig. 1). Steels with carbon 
content above 0.8% first form cementite. At or below 
the A, temperature, any austenite not transformed to 
ferrite or cementite transforms to a mixture of fer- 
rite and cementite referred to as pearlite. The ferrite 
and cementite form as parallel plates, or lamellae, in 
pearlite. Thus, when a steel reaches room tempera- 
ture, its microstructure consists of either ferrite and 
pearlite or cementite and pearlite, depending on its 
carbon content (Fig. 1). 

Typical microstructures exist for steels cooled 
very slowly in a furnace, a process referred to as full 
annealing, or cooled in air, a process referred to as 
normalizing. The 0.06% carbon steel consists almost 
entirely of ferrite in the form of fine crystals or grains. 
The higher-carbon steels contain increasingly greater 
amounts of pearlite which, at the low magnifications 
(Fig. 2), appears dark because the lamellae are too 
close together to be resolvable. 

The mechanical properties of annealed, normal- 
ized, or hot-rolled steels are directly dependent on 
the amounts of pearlite present in the microstruc- 
ture. Cementite is a hard carbide phase and, when 
present in the pearlite, significantly strengthens a 
steel. The yield and tensile strengths increase with 
increasing carbon content or increasing pearlite con- 
tent of steels containing up to 0.8% carbon. However, 
as strength increases, ductility falls and the steels be- 
come less formable, as shown by the decrease in 
reduction of area and elongation (Fig. 3). 

The change in mechanical properties with car- 
bon content separates steels into low-, medium-, 
and high-carbon classifications that are related to 
the need for formability in manufacturing and 
strength and toughness for good performance in 
service. 

Low-carbon steels. These steels, sometimes re- 
ferred to as mild steels, usually contain less than 
0.25% carbon, and therefore have microstructures 
that consist mostly of ferrite and small amounts of 
pearlite. These steels are easily hot-worked in the 
austenite condition and are produced in large ton- 
nages for beams and other structural applications. 
The relatively low strength and high ductility Fig. 3) 
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Fig. 2. Micrographs illustrating appearance of the constituents shown in Fig. 1. (a) 0.06% carbon steel: white is ferrite, grain 
boundaries appear as a dark network. (b) 0.18% carbon steel: white is ferrite, dark is pearlite. (c) 0.47% carbon steel: same as 
b, but larger amount of pearlite in this higher-carbon steel. (d) 1.2% carbon steel: dark is pearlite, white grain boundaries are 


cementite. 
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Fig. 3. Effect of carbon content on the tensile properties of 
hot-worked carbon steels. 10° psi = 6.9 MPa; 2 in. = 5 cm. 


of the low-carbon steels make it possible also to 
cold-work these steels in the ferritic condition. Cold- 
rolled low-carbon steels are extensively used for 
sheet applications in the appliance and automotive 
industries. Cold-rolled steels have excellent surface 
finishes, and both hot- and cold-worked mild steels 
are readily welded. 

The low-carbon ferritic steels have undergone in- 
teresting developments in order to meet the chal- 
lenges of the energy crisis. Although mild steels have 
excellent formability, their low strength requires rel- 
atively thick sections for a given load-bearing applica- 
tion. New compositions of ferritic steels combined 
with new processing have significantly raised the 
strength of low-carbon steels, and therefore made 
possible significant reductions in vehicle weight by 
permitting the use of thinner sections. 

One approach to improved low-carbon ferritic 
steels has led to the development of a class of steels 
referred to as high-strength low-alloy (HSLA). HSLA 
steels are alloyed with very small amounts of strong 
carbide-forming elements, such as vanadium and nio- 
bium. Very fine carbides are formed when the mi- 
croalloyed steels are finish-hot-worked low in the 


austenite phase field. The alloy carbides limit the 
growth of the austenite grains, and therefore the size 
of the ferrite grains that form from the austenite. 
The resulting very fine ferrite grain size, together 
with the fine-alloy carbides, significantly increases 
strength. Special attention is also paid to inclusion 
content and shape in HSLA steels, and as a result of 
inclusion control and the very fine ferritic grain sizes, 
HSLA steels have excellent toughness for critical ap- 
plications. 

Another approach to producing high strength and 
good formability in low-carbon steel is to heat a steel 
with a ferrite-pearlite microstructure into the ferrite- 
plus-austenite phase field (Fig. 1). Such intercritically 
annealed steels are referred to as dual-phase steels. 
The intercritical annealing treatment converts the 
pearlite areas into pools of austenite and, by control- 
ling cooling rate or alloying the austenite, or both, 
transforms to structures other than pearlite on cool- 
ing. Good formability and high strength are achieved 
by the elimination of the pearlite and by the intro- 
duction of dislocations into the ferrite matrix sur- 
rounding the tranformed austenite. The dislocations 
are crystal defects that make possible the shaping of 
metals. 

Medium-carbon steels. The medium-carbon steels 
contain between 0.25 and 0.70% carbon, and are 
most frequently used in the heat-treated condition 
for machine components that require high strength 
and good fatigue resistance (resistance to cyclic 
stressing). The heat treating consists of austenitiz- 
ing, cooling at a high enough rate to produce a new 
phase, martensite, and then tempering at a temper- 
ature below the A. The martensite is essentially a 
supersaturated ferrite with carbon atoms trapped be- 
tween the iron atoms. This structure is very hard and 
strong. The hardness and strength of martensite in- 
crease rapidly with the carbon content of the steel, 
and the greatest benefits of forming martensite occur 
above 0.3% carbon content (Fig. 4). 
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Fig. 4. Relation of hardness and tensile strength to the 


carbon content in steels which are rapidly quenched from 
above critical temperature. 10° psi = 6.9 MPa. 


However, the same structural factors that make 
martensite very hard also make it very brittle. There- 
fore, martensitic steels are tempered to increase their 
toughness. The tempering essentially makes it possi- 
ble for the carbon atoms to diffuse from the marten- 
sitic structure to form independent carbide particles. 
Thus the microstructure changes to a mixture of fer- 
rite and carbides on tempering. The extent of this 
process is controlled primarily by temperature, and 
a large range of hardness and toughness combina- 
tions can be produced. 

Martensite can be formed only when austenite 
transformation to ferrite or pearlite is suppressed. 
The ferrite and pearlite phases require the diffusion 
of carbon, a time-dependent process, for their forma- 
tion. In plain-carbon steels the diffusion can be sup- 
pressed only by quenching or cooling very quickly in 
brine or water. Even when martensite is formed on 
the surface of a bar of plain-carbon steel, however, 
the center of the bar may cool too slowly to form 
martensite. 

Small amounts of alloying elements such as 
chromium, nickel, and molybdenum reduce the rate 
at which ferrite or pearlite form, and therefore make 
it possible to form martensite in heavier sections or 
at slower cooling rates. Slower cooling reduces the 
tendency to distortion or cracking that sometimes 
accompanies the high residual stresses introduced 
by severe quenching. Hardenability is the term 
used to define the ease of martensite formation 
relative to cooling rates, section sizes, and steel com- 
position. 

High-carbon steels. Steels containing more than 
0.7% carbon are in a special category because of 
their high hardness and low toughness. This 
combination of properties makes the high-carbon 
steels ideal for bearing applications where wear re- 
sistance is important and the compressive loading 
minimizes brittle fracture that might develop on ten- 
sile loading. 

Two types of microstructure are produced in 
high-carbon steels. One consists entirely of a pearlite 
with a very fine lamellar spacing produced in steels 
with about 0.80% carbon. Figure 1 shows that no 
ferrite or cementite forms above A, at this carbon 
content. An important application of pearlitic 0.8% 
carbon steels is in railroad rail. Another application in 
which pearlitic microstructure is important is high- 
strength wire for cables and wire rope. In a process 
referred to as patenting, rods of 0.8% carbon steel 
are transformed to very fine pearlite. The rods are 
then drawn to wire to produce an aligned pearlite 
with strengths up to 350 ksi (2415 MPa). 

The other type of microstructure produced in 
high-carbon steels is tempered martensite. The most 
common bearing steel contains 1% carbon and about 
1.5% chromium. This steel is oil-quenched to marten- 
site and tempered to retain as high a hardness as 
possible. Austenitizing is performed in the austenite- 
cementite phase field. As a result, the microstructure 
contains very fine cementite particles which not 
only contribute to wear resistance but also help to 
maintain a fine austenite grain size. See CAST IRON; 
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FERROALLOY; HEAT ‘TREATMENT (METALLURGY); 

IRON; IRON ALLOYS; METAL, MECHANICAL PROPER- 

TIES OF; STAINLESS STEEL; STEEL MANUFACTURE. 
George Krauss 

Mechanical properties. The mechanical properties 
of steel are determined from a uniaxial tension test 
in which a cylindrical bar of measured diameter is 
subjected to tension along its length while the mag- 
nitude of the force applied and the resulting elonga- 
tion are measured. The force, divided by the original 
cross-sectional area of the bar, is referred to as the 
stress, and the elongation of a specific portion of the 
bar, divided by the original length of that portion, is 
the strain. The strain is clearly dimensionless, but the 
stress has the units of force per unit area. Since the 
forces involved are large (in the American system, 
forces are expressed in kips, where 1 kip is 1000 Ib), 
stresses are expressed in kips/in.? (ksi). The SI unit 
of force is the newton, and 1 N/m? is the pascal (Pa). 
For the large stresses in steel, the usual unit is the 
megapascal, (MPa); 1 ksi = 6.895 MPa. A diagram 
plotted with stress as the ordinate and strain as the 
abscissa is referred to as a stress-strain diagram. For 
all steels with carbon generally making up less than 
2% (structural steels), the initial portion of such a dia- 
gram is linear. The slope of this straight-line portion, 
the ratio of stress to strain, is the modulus of elastic- 
ity; it is about 29,000 ksi (200,000 MPa). At this time, 
the test specimen is also elastic, recovering its origi- 
nal dimensions if the load is removed, with unloading 
following down the same straight line plotted during 
loading. See ELASTICITY; STRESS AND STRAIN; STRUC- 
TURAL STEEL. 

In the low-carbon and medium-carbon steels, lin- 
earity ceases abruptly in the yield zone, with a large 
increase in strain with little or no increase in load. 
The magnitude of the strain occurring during yield 
is 10-20 times as large as the strain at the end of the 
elastic region. 

At the conclusion of the yield, strain-hardening oc- 
curs, and additional load is once again required to 
produce additional deformation, but at a much lower 
ratio of stress to strain. Eventually, the load reaches a 
maximum value, the ultimate tensile strength, where 
the specimen begins to neck down, developing a 
severely reduced cross section at some point, and 
quickly elongates a great deal and fractures. The total 
elongation is an order of magnitude larger than that 
during yield, two orders of magnitude above the elas- 
tic strain. 

In the higher-carbon steels, there is generally no 
abrupt yield. The linear zone makes a transition to 
a curve going up to the ultimate strength and down 
to fracture. It is still important to mark the end of 
elastic behavior, and therefore a yield is defined for 
these steels, either in terms of a measured amount of 
strain (typically 0.005) or as an amount of inelastic, 
nonlinear strain, found as the point where the stress- 
strain curve intersects a straight line having the same 
slope as the initial portion but with an offset (typi- 
cally 0.002). 

The behavior of steel in compression is similar, 
provided that buckling of the specimen is prevented. 


Of course, there is no neck-down or fracture, the 
specimen simply being squeezed down and short- 
ened, and flowing out. 

Commercially available steels have yield points 
ranging 33-100 ksi (230-700 MPa) and ultimate ten- 
sile strengths up to 130 ksi (900 MPa), although build- 
ing codes and specifications generally restrict oper- 
ating stress levels to values below the yield. 

G. Donald Brandt 
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E) 
Steel manufacture 


A sequence of operations in which pig iron and scrap 
steel are processed to remove impurities and are sep- 
arated into the refined metal and slag. 


Ironmaking 


Ironmaking, one of the earliest of all metallurgical 
operations, is the initial step in most steel-producing 
operations. The blast furnace process for ironmaking 
came into being with the stone-built furnaces of the 
Middle Ages. Since then, ironmaking furnaces have 
evolved into towering structures that dominate the 
integrated steel mill. 

Blast furnace. The iron blast furnace process con- 
sists of charging iron ore, coke, and limestone flux 
into the top of a shaft furnace and blowing heated 
air or blast into the bottom. This process yields a 
high-carbon iron containing silicon and manganese 
which is converted to steel in steelmaking furnaces. 

The reactions occurring in iron- and steelmak- 
ing primarily involve adjustment of the oxygen 
potential— first, to decrease the oxygen potential 
and reduce the iron ore to an impure pig iron, and 
then to increase the oxygen potential and oxidize 
impurities from that iron in the steelmaking opera- 
tion. In the iron blast furnace, the oxygen potential 
is extremely low in the hearth zone and continually 
increases as the reducing gases move up the furnace, 
reacting with iron oxide and coke. 

Direct reduced iron. Another source of iron units 
for production of steel, in addition to pig iron and 
scrap, is direct-reduced iron. Metallic iron is pro- 
duced from iron ores by removing most of the oxy- 
gen at temperatures below the melting points of ma- 
terials in the process. Direct-reduced iron can be 
refined to steel or used to increase productivity of 
the blast furnace or other melting processes. Sev- 
eral processes have been developed to contact re- 
ducing agents with iron oxides, including fluidized 
beds, moving-bed shaft furnaces, fixed-bed retorts, 
and rotary-kiln processes. 

Direct-reduction processes represent a small 


fraction of iron production, but are being adopted 
where suitable ores and reducing agents, particu- 
larly natural gas, are available. Direct-reduced iron is 
very suitable as a charge material in steelmaking pro- 
cesses, and has special advantages in electric furnace 
steelmaking. See FLUIDIZATION; IRON METALLURGY; 
PRESSURIZED BLAST FURNACE. 


Steelmaking Processes 


Reduction of iron ores by carbonaceous fuel directly 
toa steel composition was practiced in ancient times, 
but liquid processing was unknown until develop- 
ment of the crucible process, in which iron ore, 
coal, and flux materials were melted in a crucible 
to produce small quantities of liquid steel. Modern 
steelmaking processes began with the invention of 
the airblown converter by H. Bessemer in 1856. The 
Thomas process was developed in 1878; it modified 
the Bessemer process to permit treatment of high- 
phosphorus pig iron. The Siemens-Martin process, 
also known as the open-hearth process, was devel- 
oped at about the same time. The open-hearth pro- 
cess utilizes regenerative heat transfer to preheat air 
used with a burner; it can generate sufficient heat 
to refine solid steel scrap and pig iron in a reverber- 
atory furnace. After World War II, various oxygen 
steelmaking processes were developed. 

Steelmaking can be divided into acid and basic 
processes depending upon whether the slag is high 
in silica (acid) or high in lime (basic). The furnace 
lining in contact with the slag should be a compati- 
ble material. A silica or siliceous material is used in 
acid processes, and a basic material such as burned 
dolomite or magnesite is used in basic processes. 
Carbon, manganese, and silicon, the principal impu- 
rities in pig iron, are easily oxidized and separated; 
the manganese and silicon oxides go into the slag, 
and the carbon is removed as carbon monoxide and 
carbon dioxide in the off-gases. Phosphorus is also 
oxidized but does not separate from the metal unless 
the slag is basic. Removal of sulfur occurs to some 
extent by absorption in a basic slag. Thus, the basic 
steelmaking processes are more versatile in terms of 
the raw materials they can handle, and have become 
the predominant steelmaking processes. 

A typical pig iron charged to the steelmaking pro- 
cess might contain roughly 4% carbon, 1% man- 
ganese, and 1% silicon. The phosphorus and sulfur 
levels in the pig iron vary. The composition of the 
steel tapped from the steelmaking furnace generally 
ranges from 0.04 to 0.80% carbon, 0.06 to 0.30% man- 
ganese, 0.01 to 0.05% phosphorus, and 0.01 to 0.05% 
sulfur, with negligible amounts of silicon. The chem- 
istry of steelmaking can be represented as reactions 
(1)-G), which are treated in terms of oxidation by 
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liquid iron oxide in the slag. Several alternative mech- 
anisms are considered possible, such as direct oxida- 
tion by gaseous oxygen, reaction (6), or alternatively, 


C +1/200(g) = COlg) (6) 


the oxygen may dissolve in the liquid iron and react 
according to reaction (7). The removal of phospho- 


C+0=CO O) 


rus from the bath requires a basic oxidizing slag as 
illustrated by reaction (4). The actual form of the 
phosphorus in the slag is thought to be as tricalcium 
or tetracalcium phosphate, although evidence is not 
entirely clear in this regard. Sulfur is absorbed by a 
basic slag, although it may or may not form calcium 
sulfide as indicated by reaction (5). 

Bessemer converter processes. The original Besse- 
mer converter employed an acid lining. This acid 
process is no longer used extensively. The process 
requires a very low phosphorus- and sulfur-bearing 
pig iron since these elements are not removed in 
the process. A cross section of a Bessemer converter 
is shown in Fig. 1. Oxidation of the charge is ac- 
complished by blowing air through a wind box on 
the bottom of the vessel. Flements are eliminated 
essentially in the same order as the stability of the 
oxide, that is, silicon is eliminated first, along with 
manganese, and finally carbon. The Bessemer pro- 
cess is autogenous—that is, no additional heat is sup- 
plied to the process—and the oxidation of the met- 
alloids; particularly silicon, is sufficient to keep the 
charge molten and raise the temperature as purifica- 
tion proceeds so that a low-carbon steel at a temper- 
ature near 2900°F (1600°C) is produced at the end 
of the process. A small amount of cold scrap can be 
charged. 

The Thomas or basic Bessemer process, used more 
extensively in Europe, is carried out in a basic lined 
vessel, and utilizes lime or limestone as a charge 
material to form a basic slag. Pig iron charged to 
this process contains up to 1 wt % phosphorus, 
which is oxidized from the bath after most of the car- 
bon has been removed in a so-called afterblow. The 
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Fig. 1. Operating positions of a Bessemer converter. 

(a) Charging and pouring. (b) Blowing. (After R. D. Pehlke, 
Unit Processes of Extractive Metallurgy, p. 89, American 
Elsevier, 1973) 
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Fig. 2. Diagram of the principal parts of an open-hearth furnace. The arrows indicate the direction of the flow of preheated 
air, flame, and waste gases when a liquid fuel is fired through a burner in the trench at the right end of the furnace. The five 
charging doors shown are in the front wall of the furnace, and the checker chambers extend under the charging floor, which 
is not shown. When reversal of firing takes place, the function of the uptake is reversed, producing the downtake. 

(After H. E. McGannon, ed., The Making, Shaping and Treating of Steel, United States Steel Corporation, 8th ed., 


p. 471, 1964) 


oxidation of the phosphorus is exothermic and adds 
considerable thermal energy to the process. 

One of the disadvantages of Bessemer steelmaking 
is the resulting higher phosphorus and nitrogen con- 
tents in the steel. The nitrogen content of a Bessemer 
steel is approximately 0.015 wt %, although lower ni- 
trogen contents can be achieved by careful control of 
temperature and by carrying out the final oxidation 
of the bath with iron ore additions. 

An important aspect of converter steelmaking is 
control, because of the rapidity with which the end 
point of the process is approached, and the necessity 
for providing a careful thermal balance for the auto- 
genous process. End-point control has been greatly 
enhanced by the use of photocells to observe the 
character of the flame and emission from the vessel 
toward the end of the blow. 

Open-hearth process. From the turn of the century 
through the late 1950s, the basic open-hearth pro- 
cess was the workhorse of the American industry, 
accounting for approximately 85% of the steel pro- 
duced in the United States in 1955. Because of its 


melting capabilities, the basic open-hearth process 
can handle a wide range of pig iron and scrap ratios, 
the optimum operation being a metallic charge of 
40 to 60% molten pig iron with the balance as steel 
scrap. 

A typical open-hearth furnace is shown in Fig. 2. 
In an effort to obtain a high flame temperature and 
good thermal efficiency, the open hearth employs a 
regenerative heat transfer system. By reversing the di- 
rection of firing in the furnace, hot combustion gases 
pass out of the furnace through a set of brick check- 
ers which recover heat from the expanded combus- 
tion gases. The incoming air is passed through the 
hot checker work on the opposite end of the fur- 
nace before being reacted with fuel at the burner. 
This technique for preheating the combustion air 
provides fuel economy. 

In a basic operation, a wide range of raw mate- 
rials can be treated, and the phosphorus and sul- 
fur concentrations are kept under suitable control. 
An advantage of the open hearth over the Bessemer 
process is that the steels produced contain about 


0.005 wt % nitrogen, which is low enough to provide 
good properties in deep-drawing sheet material. 

The solid constituents—steel scrap, limestone, 
and iron ore—are charged to the furnace, and the 
burner turned on. Burners are built into both ends of 
the furnace, and the direction of the flame is changed 
from time to time to take advantage of the regenera- 
tive heat transfer system for the air which is located 
on either side of the furnace at a lower level. As soon 
as the charge starts to melt down, pig iron is added 
and the iron oxide reacts with the molten pig iron 
to produce carbon monoxide in a so-called ore boil. 
As the temperature increases, the decomposition of 
limestone at the bottom of the furnace occurs and a 
lime boil starts, with carbon dioxide being evolved 
at a relatively rapid rate. As the calcined lime floats 
up through the charge, phosphorus is removed as 
well as some sulfur. The productivity of the open- 
hearth furnace had been limited by both charging 
time and the extensive melting and refining periods, 
requiring approximately 6 to 8 h for melting and re- 
fining alone. The availability of low-cost oxygen has 
permitted oxygen lances to be inserted through the 
roof of open-hearth furnaces, which decreases the 
time required to process a heat (single batch of steel) 
from about 12-14 h to approximately 6 h. Oxygen 
blowing, however, creates fumes and requires gas- 
cleaning equipment. This limitation on productivity 
and the added expense of gas-cleaning equipment in 
an older installation have led to rapid displacement of 
the open-hearth furnace by the oxygen top-blowing 
process. 

Oxygen steelmaking. The high dissolved nitrogen 
content in Bessemer steel limits its application, and 
a nitrogen-free gas for blowing in the converter is 
needed. The use of pure oxygen in a bottom-blown 
vessel will result in extremely severe erosion of the 
bottom lining and tuyere area. An alternative is to 
blow a Bessemer converter with oxygen and steam 
to produce low-nitrogen steel. This process has re- 
ceived considerable attention in Europe. 

In the early 1950s, commercial development of 
the oxygen steelmaking process was achieved at 
Linz, Austria. Since Austrian ores are low in phos- 
phorus, they cannot support the basic Bessemer pro- 
cess thermally; furthermore, a shortage of scrap in 
Austria following World War II eliminated economic 
application of the open hearth. These factors along 
with availability of oxygen at much lower cost in ton- 
nage quantities led to development of the top-blown 
oxygen process, which is based on experience with 
a small vessel at Gerlafingen, Switzerland. The first 
large-scale oxygen steelmaking converters were 30- 
ton vessels. Further development of the process, par- 
ticularly in the United States, has increased the size 
of the vessel until most oxygen steelmaking shops 
employ 250-ton vessels, with some vessels as large 
as 400 tons in operation. 

A schematic cross section of an oxygen steelmak- 
ing vessel is shown in Fig. 3. Figure 4 shows the 
oxygen jetting of a 300-ton heat of steel. The process 
is autogenous and uses about 25 to 35% scrap, the re- 
mainder being basic pig iron which is relatively low 
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Fig. 3. Schematic section of oxygen steelmaking vessel. 


in sulfur. Burned lime is used as a flux, with enough 
added to produce a lime-silica ratio of around 3 to 1 in 
the final slag. Oxygen is blown down vertically into 
the bath through a Laval nozzle, which produces su- 
personic flow. With larger vessels, the use of a three- 
hole lance has improved operation. 

Copious fumes in the form of iron oxide particles 
around a micrometer in size are evolved during the 
oxygen blow. This fume loss amounts to 2 to 3% of 
the metallic charge, and gas-cleaning equipment rep- 
resents a large portion of the capital expenditure for 
a basic oxygen furnace plant. Significant operating 
variables include the lance height and oxygen flow 
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Fig. 4. Large oxygen steelmaking furnace in operation. (Bethlehem Steel Corporation) 
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rate. The required blowing time for a heat, includ- 
ing those approaching 250 tons, is approximately 
20 min. By suitable adjustment of the lance height, 
some iron can be oxidized early in the blow to dis- 
solve the lime and produce an early basic oxidizing 
slag, which assists in phosphorus removal. Later in 
the heat, lowering the lance or increasing the oxygen 
flow rate provides the deep penetration necessary to 
agitate metal and slag and provide for relatively low 
iron losses in the slag. 

The processing of high phosphorus irons in the 
top-blown oxygen steelmaking process can be ac- 
complished by injecting powdered lime into the oxy- 
gen stream. This technique, known as the LD-AC pro- 
cess, is used in Europe to process pig iron containing 
more than 0.5 wt % phosphorus. 

Oxygen converter gas recovery. Many oxygen steel- 
making vessels have open hoods into which air is 
ingested to burn the hot evolved converter gases, 
which contain about 90% carbon monoxide during 
much of the oxygen-blowing period. Considerable 
effort has been made, particularly in Japan, to re- 
cover the converter gas, which averages over 80% 
carbon monoxide, for use as a fuel in the steel plant. 
The use of a closed hood and special precautions to 
prevent explosion of the carbon monoxide has re- 
sulted in an economical process for recovering the 
thermal energy in the carbon monoxide. In most 
hoods, a waste heat boiler is employed to recover the 
sensible heat of the stack gas; but a marked improve- 
ment in thermal efficiency is achieved by collecting 
the carbon monoxide gas for use as a supplementary 
fuel in the steel plant. Furthermore, the gas-cleaning 
problems are decreased because the volume of gas 
is only approximately one-sixth as great as when air 
is ingested in an open-hood system. 

Submerged injection processes. Submerged injec- 
tion processes were developed first for several Euro- 
pean plants, in which bottom-blown Thomas (basic 
Bessemer) vessels were converted to use oxygen 
and hydrogen-bearing gases such as hydrocarbons, 
steam, or natural gas. In France, G. Savard and R. 
Lee in 1966 patented a dual tuyere with which oxy- 
gen could be injected through an inner tuyere and a 
gaseous hydrocarbon injected through the surround- 
ing annular space formed by the central pipe and the 
surrounding outer pipe. This arrangement, utilizing 
the effect of cracking the hydrocarbon to cool the 
surrounding refractory, permitted bottom blowing 
of oxygen in steelmaking converters. 

Several conversions of Thomas converter plants 
to the bottom-blown oxygen process were made by 
using natural gas in the outer surrounding tuyere. 
These steelmaking vessels varied in size up to 
50 tons capacity; in 1973 the scale-up to 200-ton 
vessels was accomplished by the United States Steel 
Corporation. 

The bottom-blown oxygen steelmaking process 
utilizes a converter with a replaceable bottom con- 
taining dual tuyeres with oxygen introduced through 
the central outlet. The number of tuyeres increases 
with the size of the furnace. Tuyere placement is 
critical in order to take advantage of the stirring 


capability of bottom injection, particularly under 
conditions where oxygen flow rates are decreased. 
Replacement of the bottom refractory plug for 
the vessel, which contains the tuyeres, is required 
only as often as the replacement of the remaining 
vessel refractory lining, typically after more than 
1000 heats. 

The absence of a lance in the bottom-blowing pro- 
cess has offered opportunities to decrease total plant 
height and has allowed relatively easy conversion of 
older open-hearth buildings to oxygen steelmaking 
shops. 

Simultaneous top and bottom blowing has been 
explored to combine the advantages of each type 
of oxygen steelmaking. The joint effects of simul- 
taneous top and bottom blowing of oxygen and 
mixed gases have resulted in higher metallic yield, 
improved manganese recovery, better control of oxy- 
gen content, and lower oxygen concentrations, as 
well as improved dephosphorization. 

Electric furnace process. Electric arc furnace tech- 
nology began late in the nineteenth century with the 
original design of P. L. T. Heroult. The three-graphite 
electrode furnace with a swinging roof for top charg- 
ing and a rocker base for tilting to tap the finished 
molten steel has been continuously improved and 
developed further. See ELECTRIC FURNACE. 

In its early form, the electric arc furnace had a 
small capacity and was used only for high-alloy and 
stainless steels, and for 5- or 10-ton heats of various 
steels cast into sand molds to produce shaped cast- 
ings in foundries. Modern direct-arc furnaces rang- 
ing in size up to 400 tons are used to produce all 
types of steel. The electric furnace may be either 
acid- or basic-lined (Fig. 5). An advantage over the 
open-hearth and oxygen converter processes is that 
very high temperatures can be produced. The high 
temperature and the flexibility of using a 100% solid 
charge, or up to 60% molten pig iron, provides con- 
siderable versatility for this melting unit. The very 
high temperatures are particularly useful in produc- 
ing alloy steels. If large amounts of alloying elements 
are added to a ladle from an oxygen converter or 
an open-hearth furnace, the steel is often cooled to 
the extent that good-quality steel cannot be poured, 
or if the alloying elements are added in the furnace, 
they may be oxidized into the slag. This is particu- 
larly true for elements such as niobium (columbium), 
vanadium, and chromium. Furthermore, the require- 
ment in certain types of stainless steel to provide 
for very low carbon contents requires that the car- 
bon be removed without excessive chromium losses. 
This is possible at high temperatures such as can be 
achieved in the electric arc furnace. 

The rapid development of steelmaking technology 
using the electric arc furnace, not only for alloy and 
stainless steels but especially for carbon steel pro- 
duction, has increased its share of production capac- 
ity to about 20% of the steel industry. This increased 
capacity reflects the ascension of the “mini-mill” as 
a major steel supplier. The size of the mini-mill also 
has dramatically increased to the point where this 
label is not descriptive of this type of steel plant, 
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Fig. 5. Schematic cross section of a Heroult electric arc furnace with a flat-bottomed shell and stadium-type subhearth 
construction, indicating typical refractories employed in (a) an acid lining and (b) a basic lining. Only two of the three 
electrodes are shown in this section. (After H. E. McGannon, ed., The Making, Shaping and Treating of Steel, 8th ed., United 


States Steel Corporation, 1964) 


several such mills being located within large inte- 
grated plants and having two or more 200-ton elec- 
tric arc furnaces. 

The modern mini-mill includes one or more elec- 
tric furnaces with high-power energy sources; a 100- 
ton electric furnace may have a 75-MW transformer. 
These furnaces are water-cooled, at least in the upper 
wall where cooled panels substitute for refractory 
above the slag line. Water-cooled roofs are in use, 
and water-cooled electrode holders are being devel- 
oped to provide reduced electrode consumption. 

The product line of the mini-mill includes special- 
quality bars and rods, as well as reinforcing bars 
and small structural shapes. Some mills are pro- 
ducing plate from electric furnace remelting op- 
erations. This technology, which may depend on 
direct-reduced iron in the future, is important in steel 
production. 

Special processes. Remelting and refining of spe- 
cial alloys are carried out in duplex or secondary pro- 
cesses; the principal ones are argon-oxygen decar- 
burization, electroslag refining, vacuum arc remelt- 
ing, and vacuum induction melting. 

In the argon-oxygen decarburization process, a liq- 
uid metal is refined in a refractory-lined vessel by 
injection of gas mixtures, in particular of varying 
argon-oxygen mixtures, through tuyeres in the side 
or bottom of the unit. Usually the liquid metal is 
provided from an electric arc furnace, or in smaller 
installations from an induction furnace. The argon- 
oxygen decarburization process was developed orig- 
inally for the production of stainless steels, but has 
been modified for the production of many other 


grades. The argon dilution of the gases in the process 
provides excellent degassing characteristics which 
allow achievement of low carbon contents in the liq- 
uid metal without chromium losses by oxidation to 
Cr20; into the slag; also hydrogen is removed from 
crack-sensitive grades of steel. Argon-oxygen decar- 
burization provides good slag-metal mixing for re- 
moval of sulfur and separation of nonmetallic inclu- 
sions. 

In the electroslag refining process, an impure elec- 
trode prepared from the material to be refined is 
remelted in a slag which is resistively heated. An 
electric current passed through the slag melts the 
electrode. Molten droplets of metal form on the tip 
of the electrode and are refined as they fall through 
the slag, forming an ingot in a water-cooled mold. 
See ELECTROMETALLURGY. 

In the vacuum arc remelting process, a prepared 
electrode is arc-melted in a vacuum. The droplets 
of metal from the electrode are exposed to the vac- 
uum and are solidified in a water-cooled mold. The 
process provides refining of volatile constituents and 
removal of dissolved gases, and promotes chemical 
homogeneity and improved structure. 

Control of temperature, pressure, and stirring can 
be exercised over wide ranges and for extended peri- 
ods of time in the vacuum induction melting process, 
allowing considerable flexibility for refining of an 
alloy. Vacuum induction melting is applied to small 
melts, as well as large ones ranging up to 60 tons 
(54 metric tons). The molds are usually enclosed in 
an adjoining vacuum chamber connected by large- 
scale access doors and valves; this minimizes the time 
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required to melt and prepare molds, since they can 
be done independently. Vacuum induction melting 
systems, essentially the same as the conventional in- 
duction melting systems, utilize high-frequency cur- 
rent in a water-cooled coil surrounding the crucible 
containing the melt; however, the processing is con- 
ducted under low vacuum pressures. See VACUUM 
METALLURGY. 

Ladle metallurgy. Ladle metallurgy was used first to 
produce high-quality steels, but has been extended 
to producing many grades of steel because of the eco- 
nomic advantages of higher productivity. The pur- 
pose of these ladle treatments is to produce clean 
steel; introduce reactive additions, such as calcium 
or rare earths; add alloying additions, as for microal- 
loyed steels, with high recovery; and increase fur- 
nace utilization, allowing higher-productivity smelt- 
ing operations of the blast furnace, and melting and 
refining operations in steelmaking, followed by final 
ladle treatment of the steel to achieve controlled low 
levels of contained inclusions. Ladle treatments in 
steel production generally are classified as synthetic 
slag systems; gas stirring or purging; direct immer- 
sion of reactants, such as rare earths; lance injection 
of reactants; and wire feeding of reactants. These are 
often used in combination to produce synergistic ef- 
fects, for example, synthetic slag and gas stirring for 
desulfurization followed by direct immersion, injec- 
tion, or wire feeding for inclusion shape control. 

A major aspect of steel development since the 
mid-1970s has been the effort directed at the im- 
provement of physical and mechanical properties 
through steel cleanliness and reduced anisotropy, 
which are largely dependent upon the amount and 
form of oxygen and sulfur in the steel. Such im- 
proved properties are required by steels used for 
thin-sheet materials, heavy plate, high-strength low- 
alloy CHSLA), and for linepipe. Thin sheet steels re- 
quire good formability; heavy plate requires guar- 
anteed through-thickness properties; and linepipe 
steels require high-transverse-impact energy values. 
Removal of reaction products is most critical during 
the course of deoxidation of molten steel. Nonmetal- 
lic inclusions in the form of oxides of silicon, man- 
ganese, and aluminum may adversely influence final 
properties. 

Sulfur in steel has always presented a problem 
in steelmaking. Improved surface quality can be 
achieved by lowering sulfur levels to 0.015% or 
below. For example, the achievement of low sul- 
fur contents with two-slag practices in electric fur- 
naces presents operating difficulties and production 
delays. 

In the 1950s steel was produced primarily in the 
open-hearth furnace, where sulfur levels were nor- 
mally in the range from 0.03 to 0.04%. Sulfur contents 
of 0.02 to 0.03% could be achieved with lower sulfur 
fuels and oxygen roof lances, permitting shorter heat 
times and improved mixing of metal and slag. The 
basic oxygen steelmaking process brought about 
an even further improvement within the range of 
0.015-0.025% sulfur, and with the bottom-blown 
oxygen converter Q-BOP (Quelle-Basic oxygen pro- 


cess), sulfur contents in the 0.01-0.02% range can be 
achieved. However, even lower sulfur contents are 
required. 

In killed steels with low oxygen content, such as 
when aluminum is used for deoxidation and grain 
size control, sulfur combines with manganese as 
highly deformable manganese sulfides. These man- 
ganese sulfides have a low melting point and, as the 
last liquid to solidify in the steel, collect as films 
at grain boundaries. During hot rolling, the man- 
ganese sulfides are plastically deformed into elon- 
gated stringers extending parallel to the rolling di- 
rection. This shape and distribution of sulfides can 
have a marked effect on the directional properties of 
steel products. 

Two methods can be utilized to minimize the di- 
rectionality problem. One is to reduce the sulfur con- 
tent in the steel product to a minimum, lowering the 
total volume of manganese sulfide inclusions. This 
can be accomplished by decreasing the total sulfur 
input, treating the hot metal to remove sulfur, and 
controlling the scrap charge; by adjusting melting 
practices, including the use of additional quantities 
of flux; and by direct treatment of the liquid steel 
with desulfurizing agents, such as calcium or mag- 
nesium. A second approach involves modifying the 
shape or morphology of sulfide inclusions by pro- 
ducing a relatively nondeformable complex sulfide 
or oxysulfide by the addition of calcium or rare-earth 
metals. 

Reoxidation and inclusion minimization. Since there 
is a strong affinity between oxygen and liquid steel, 
and the necessity to restrict pickup of oxygen during 
teeming operations, several steps to control reoxida- 
tion are advisable. The first is the protection of the 
steel pouring stream, whether from ladle to ingot 
mold or tundish, or from tundish to continuous cast- 
ing mold. The separation of reoxidation products, or 
deoxidation products formed during cooling of the 
steel in the teeming operation, can be promoted. 
In particular, in continuous casting the appropriate 
design of tundish in terms of residence time of the 
steel in the tundish and the appropriate fluid flow 
patterns can enhance separation of oxides from the 
liquid steel. See METAL CASTING; PYROMETALLURGY; 
REFRACTORY; STAINLESS STEEL; STEEL. 

Robert D. Pehlke 
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E) 
Stellar evolution 
The large-scale, systematic, and irreversible changes 
with time of the structure and composition of a 
star. 

Types of stars. Dozens of different kinds of stars 
populate the Milky Way Galaxy. The most common 


are main-sequence dwarfs like the Sun that fuse hy- 
drogen into helium within their cores (the core of the 
Sun occupies about half the mass). Dwarfs run the 
full gamut of stellar masses, from perhaps as much 
as 120 solar masses (120 M.) down to the minimum 
of 0.075 solar mass (beneath which the full proton- 
proton chain does not operate). They occupy the 
spectral sequence from class O (maximum effective 
temperature 50,000 K or 90,000°E maximum lumi- 
nosity 5 x 10° solar), through classes B, A, E G, 
K, and M, to the new class L (2000 K or 3100°F 
and under, minimum luminosity 4 x 10~> solar). 
Within the main sequence, they break into two 
broad groups, those under 1.3 solar masses (class F5) 
whose luminosities derive from the proton-proton 
chain, and higher-mass stars that are supported 
principally by the carbon cycle. Below the end of 
the main sequence (masses less than 0.075 Mo) lie, 
the brown dwarfs that occupy half of class L and all 
of class T (the latter around 1000 K or 1300°F). These 
shine both from gravitational energy and from fusion 
of their natural deuterium. Their low-mass limit is 
unknown. See CARBON-NITROGEN-OXYGEN CYCLES; 
PROTON-PROTON CHAIN; SPECTRAL TYPE; STAR. 

Scattered among the dwarfs are different kinds 
of stars distinguishable (through comparison with 
the dwarfs) by their sizes and luminosities: bright gi- 
ants (with maximum radii equaling that of the inner 
solar system), brilliant supergiants (maximum radii 
comparable to the orbits of Jupiter and Saturn), sub- 
giants (which fall between giants and dwarfs), and 
dim white dwarfs (the size of Earth). All these clas- 
sifications may be broken into subclasses. The list 
also includes the central stars of planetary nebulae, 
double stars, novae, supernovae, neutron stars, and 
black holes. The study of stellar evolution seeks to 
determine where they all came from. Do they stand 
alone, or are they somehow connected? If the latter, 
what kinds of stars precede and succeed what other 
kinds? 

Stellar models. Stars age so slowly that it is 
impossible—except under rare circumstances—to 
see the transformations take place. Instead, stellar 
theorists, using the laws of physics, build numerical 
models of stars and then age them. The procedure 
is in principle simple, in practice difficult. At the 
start, a star is in hydrostatic equilibrium in which 
each layer supports the layers above it (the pres- 
sure difference across a layer equaling the weight of 
the layer), and is internally supported by thermonu- 
clear fusion. The relation between pressure, density, 
and temperature is given by the perfect gas law, P = 
NkT, where P is pressure, N is the number density 
of atomic particles, k is Boltzmann’s constant, and 
T is temperature. As fusion proceeds, and the rate 
of energy generation and the internal chemical com- 
position change, the structure of the star changes as 
well. Stepwise calculations that predict luminosity, 
radius, and effective temperature then allow the star 
to be followed as it “moves” (changes effective tem- 
perature and luminosity) on the Hertzsprung-Russell 
(HR) diagram. When appropriate mass loss is taken 
into account, most of the different kinds of stars fall 
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into place. See GAS; HERTZSPRUNG-RUSSELL DIAGRAM; 
HYDROSTATICS; THERMODYNAMIC PROCESSES. 

Star formation. Stars are born from compact knots 
within dark molecular clouds that are refrigerated 
by dust that blocks heating starlight. If the random 
knots, compressed by supernovae or other means, 
are dense enough, they can contract under their 
own gravity. Conservation of angular momentum de- 
mands that as they collapse they must spin faster. 
Star formation requires that angular momentum be 
removed such that the new “young stellar objects” 
do not completely tear themselves apart before they 
can become fully developing protostars. See ANGU- 
LAR MOMENTUM; MOLECULAR CLOUD. 

High-speed particles (cosmic rays) from exploding 
stars partially ionize the dusty knots. The ions grab 
onto the weak magnetic field of the Galaxy and, as a 
result of their physical interaction with neutral atoms 
and molecules, provide the initial means to slow the 
rotation. If the rotation is still too fast, the contract- 
ing body may split into a double (or more complex) 
star, though the origins of doubles are not clearly 
solved. A contracting protostar still indeed rotates 
progressively faster until the part of its mass not ac- 
creted by the star itself is spun out into a dusty disk, 
from which planets might later accumulate. From 
the disk shoot powerful molecular flows that slow 
the star still more (Fig. 1). See PROTOSTAR. 

When the protostar’s interior reaches about 10° K 
(1.8 x 10° °F), it can fuse its internal deuterium. That 
and convection, which brings in fresh deuterium 
from outside the nuclear-burning zone, bring some 
stability, and a star can now be said to be born. 
Stars like the Sun shrink at constant temperature 
until deuterium fusion dies down. Then they heat 
at roughly constant luminosity until the full proton- 
proton chain begins, which provides the stars’ lu- 
minosity and stops the contraction. The stars settle 
onto the zero-age main sequence (from which they 





Fig. 1. Focused, even created, by a circumstellar disk, a jet 
pours from the youthful star Herbig-Haro 30 (HH 30). 

(C. Burrows, Space Telescope Science Institute; WPFC2 
Science Team; NASA) 
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Fig. 2. Hertzsprung-Russell diagram of star formation. Stars begin to stabilize at the 
birthline and then descend on evolutionary tracks as T Tauri stars (dots) toward the 
zero-age main sequence. (After J. B. Kaler, Cosmic Clouds, W. H. Freeman, 1997, from 
work of S. W. Stahler and I. Iben, Jr.) 


will later evolve). At the same time, the surrounding 
dusty cloud is clearing, allowing new, accreting, and 
highly active T Tauri stars to be seen flocking around 
their birthclouds (Fig. 2). The whole process takes 
only 10 or so million years, the mature stars then 
leaving their birthplaces, destined both to age and to 
orbit the Galaxy. High-mass stars proceed similarly, 
but at such a great pace that the death process be- 
gins even as the birth process is ending. See T TAURI 
STAR. 

Early evolution. The main sequence is that zone on 
the HR diagram in which stars are stabilized against 
gravitational contraction by fusion. The higher the 
stellar mass, the greater the internal compression 
and temperature, and the more luminous the star. 
Hydrogen fusion is highly sensitive to temperature, a 
small increase in stellar mass meaning a much higher 
fusion rate. Although greater mass means a greater 
nuclear-burning core mass and therefore a larger fuel 
supply, the increased fusion rate is more than offset- 
ting and thereby shortens stellar life. A star’s fuel 
supply is proportional to mass (M), while the rate 
of fusion is expressible by the luminosity (Z). The 
average mass-luminosity relation found from binary 
stars is L x M>°. Ifa star’s lifetime tis proportional to 
(fuel supply)/(rate of use), then t «x M/L « M/M>° = 
1/M?2>, While the new Sun was destined to survive 


on the main sequence for 10!° years, a 0.1-solar- 
mass star will live there for 10'* years, while a 
100-solar-mass star will exhaust its core hydrogen in 
only 2.5 x 10° years. 

Main-sequence life, while stable, is not altogether 
quiet. Even there stars change and evolve. As hydro- 
gen “burns” to helium, four particles (protons) are 
converted to one (a helium nucleus). The pressure 
of a gas depends on the number of particles per unit 
volume, not on their kind. The result is a slow shrink- 
age of the core, which increases the temperature, 
raises the fusion rate, and causes the core to eat into 
the surrounding hydrogen envelope (incorporating 
fresh fuel). As a result, main-sequence dwarfs slowly 
brighten and eventually expand and cool some at 
their surfaces. (The Sun will more than double in 
brightness, dooming life on Earth long before core 
hydrogen is gone.) As a result, the main sequence 
spreads itself into a band toward the cool side (to 
the right) of the zero-age main sequence, the band 
widening toward the top. 

Clusters of stars are born together in both space 
and time, most (presumably) with fully intact main 
sequences. As a cluster ages, stars peel off the main 
sequence (to become giants and supergiants) from 
the top down. A star cluster can be dated by where 
its main sequence ends. Open clusters, which oc- 
cupy the Galaxy’s disk, range from just-forming to 
about 10!° years old, which gives the disk’s age. 
The globular clusters of the Galaxy’s halo, how- 
ever, have burned their main sequences down to 
around 0.8 solar. This process takes roughly 12 x 10? 
years, which must be close to the age of the Galaxy. 
See MILKY WAY GALAXY; STAR CLUSTERS. 

The main sequence is divided into three parts. 
Below 0.8 solar mass (roughly class G8), no star has 
ever had time to evolve. Low-mass (K and M dwarfs) 
evolution is therefore of only academic interest. Stel- 
lar evolution deals with less than 15% of the stars. 
Their fates depend again on mass. Between 0.8 solar 
mass and around 10 solar masses (classes G8 to B1), 
the stars die as white dwarfs. Above it (classes O and 
BO), they explode. Binary stars contribute further to 
the richness of stellar phenomena. 

Intermediate-mass evolution. Main-sequence life 
lasts until the core hydrogen is almost gone, at which 
time hydrogen fusion rather suddenly shuts down. 
With no support, the now-quiet helium core can 
contract more rapidly under gravity’s force. It heats, 
causing hydrogen fusion to spread into a thick en- 
closing shell that runs on the carbon cycle. With 
a new (though temporary) energy source, the star 
first dims some while it expands and cools at the 
surface, changing its spectrum to class K. The tran- 
sition takes only a few hundred million years or less, 
leaving few stars in the middle of the HR diagram, 
the lower masses appearing as E G, and K subgiants. 

The consequences again depend on mass. As core 
contraction proceeds beyond the rightward transi- 
tion in the HR diagram, stars from about 1 to 5 solar 
masses (still fusing hydrogen in a shell) suddenly 
and dramatically increase their luminosities. The fu- 
ture Sun will eventually grow 1000 times brighter 


than it is today, and a 5-solar-mass star (which be- 
gins at about 600 solar luminosities) will reach nearly 
3000 solar. At the same time, the stars swell to be- 
come red giants. While the core (roughly half a solar 
mass) of the future Sun shrinks to the size of Earth, 
the radius will expand to that of Mercury’s orbit, or 
even beyond. When the core temperature climbs to 
108 K, helium nuclei (alpha particles) begin fusing to 
unstable beryllium @Be), which quickly decays back 
to alpha particles, setting up an equilibrium. The tiny 
amount of Be present reacts with additional alpha 
particles to create helium via the triple alpha pro- 
cess: 


4He + ‘He < *Be 
®Be + ‘He > PC + gamma ray 


(The “skip” over lithium, beryllium, and boron ren- 
ders these elements rare.) The initiation of the triple- 
alpha process is explosive in stars like the Sun, since 
the core electrons become degenerate, like those 
that support white dwarfs. (The sudden burst of en- 
ergy is absorbed and does not reach the stellar sur- 
face.) Fusion with additional helium nuclei creates 
oxygen and even neon. Above 2 solar masses, helium 
burning starts more quietly (Fig. 3). See NUCLEOSYN- 
THESIS. 

The star, now stabilized by a helium-burning core 
that is surrounded by a hydrogen-fusing shell, re- 
treats about halfway down the red giant branch. 
The numerous lower-mass stars reside in the class K 
“red giant clump.” Low-mass metal-deficient globular 
cluster stars that have suffered different rates of mass 
loss spread out from the clump toward higher tem- 
peratures to create the distinctive horizontal branch. 
Energy-generating fusion reactions will try to pro- 
ceed toward iron (the most stable of all nuclei). 
Some 80% of the energy is generated in hydrogen 
burning, so (discounting further burning modes) 
the helium-burning stage lasts only around 20% of 
the main-sequence lifetime. From 5 solar masses 
up, evolution proceeds similarly, but instead of set- 
tling into a distinct location on the HR diagram, 
the stars loop to the blue (higher temperatures) 
where they fuse helium as class G, F, and A giants. 
This change in evolutionary style is not abrupt, but 
gradual with increasing mass. See GIANT STAR. 

Asymptotic giant branch (AGB). When the helium 
has fused to carbon and oxygen, the core again con- 
tracts. Helium fusion spreads outward into a shell, 
and for the second time the star climbs the HR 
diagram’s giant branch. Since the second climb is 
roughly asymptotic to the first, the second climb 
creates the asymptotic giant branch. The shrinking 
carbon-oxygen core is now surrounded by a shell of 
fusing helium, while the old hydrogen-fusing shell 
expands, cools, and shuts down. Eventually, how- 
ever, the helium shell runs out of fuel, and the hy- 
drogen shell reignites. Hydrogen burning feeds fresh 
helium into the space between it and the carbon 
core, and when there is enough of it, helium burn- 
ing reignites explosively in a helium flash (or ther- 
mal pulse) that can affect the star’s surface. The 
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Fig. 3. Hertzsprung-Russell diagram of evolution of intermediate-mass stars. These stars 


evolve from the main sequence to become giants that stabilize during helium fusion in a 


“clump.” When helium fusion is done, they brighten again as AGB stars, lose their 
envelopes, and evolve to the left to produce planetary nebulae, finally at lower left 
becoming white dwarfs. (After J. B. Kaler, Stars, W. H. Freeman, 1992, 1993, from 


work of I. Iben, Jr.) 


flash squelches hydrogen fusion, and the whole pro- 
cess starts again, helium flashes coming at progres- 
sively shorter intervals. AGB stars become larger and 
brighter than before, passing into the cool end of 
class M, where they eventually become unstable 
enough to pulsate as long-period variables (Miras). 
The Sun will become 5000 times brighter than now 
and will reach out to the Earth’s orbit, perhaps de- 
stroying the planet. More massive Miras can exceed 
the size of the Martian orbit. See MIRA; VARIABLE 
STAR. 

During this activity, stars go through various stages 
of convective dredge-up, in which they can raise ele- 
ments created in their nuclear-burning zones to the 
stellar surfaces. The first of these dredge-ups alters 
the nitrogen and carbon isotope ratios while the star 
is on the first ascent of the red giant branch. Above 
about 3 solar masses, a second dredge-up in the early 
AGB stage can increase both surface nitrogen and he- 
lium. In the helium-flashing state, even carbon from 
helium fusion can be brought upward. Moreover, a 
host of elements created by slow neutron capture are 
elevated as well, including zirconium, strontium, bar- 
ium, and many others. When the carbon abundance 
equals that of oxygen, an S star is seen (loaded with 
fresh zirconium); when carbon exceeds oxygen, a 
genuine carbon star is seen. See CARBON STAR. 

Mass loss and planetary nebulae. During the giant 
stages, stellar winds greatly increase. Mira pulsations 
cause shock waves that help drive mass from the 
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stellar surfaces, where the cooled gas becomes ever 
richer in molecules, some even condensing into dust 
grains. High luminosity pushes the dust outward, 
and the dust couples with the gas, resulting in slow 
(10 km/s or 6 mi/s) thick winds tens of millions 
of times stronger than the solar wind (up to 10~> 
solar mass per year). Because stars are in this state 
for hundreds of thousands of years, they will lose 
much of themselves back into space—the Sun about 
half of its mass, a 10-solar-mass star over 80%. Ad- 
vanced giants and Miras lose so much that they can 
visually disappear within dust clouds. Carbon star 
winds contain various kinds of carbon dust as well as 
organic molecules, while oxygen-rich stars produce 
silicate dust. The dust blown into interstellar space 
helps make new generations of stars. The elements 
created in the nuclear-burning shells are added to 
the interstellar inventory, where they too find their 
way into new generations of both stars and planets. 








Fig. 4. Thick molecule-filled cloud surrounding the carbon 
star IRC+10 216. (a) Image made in radiation from 
hydrocyanic acid (HCN). (b) Image in radiation from 
cyanoacetylene (HC3N). Each map is 6000 astronomical 
units across. (J. Bieging; Berkeley-Illinois-Maryland 
Association) 





Fig. 5. Hubble Space Telescope image of planetary nebula 
NGC 7662. Planetary nebulae are formed when giant stars 
slough their outer envelopes, leaving a white dwarf behind. 
(B. Balick et al.; Space Telescope Science Institute; NASA) 


A good portion of the carbon, most of the nitrogen, 
and significant fractions of the Galaxy’s other ele- 
ments come from such winds (Fig. 4). 

So much mass is lost that an evolving star becomes 
stripped nearly to its fusion zone, which is protected 
from the outside by a low-mass hydrogen envelope. 
As the inner region becomes exposed, the wind 
diminishes in mass but increases in speed and 
temperature. Hammering at the surrounding dusty, 
molecule-filled shroud of lost mass, the high-speed 
wind compresses the inner edge into a thick ring. 
Eaten away from the top by the wind and from below 
by fusion, the stellar envelope shrinks, slowly expos- 
ing the hot shell-core structure beneath. When the 
stripped star’s surface reaches 25,000 K (45,000°F), 
the dense ring that its high-speed wind had previ- 
ously created is ionized. Subsequent recapture of 
electrons by ions, along with collisional excitation of 
heavy atoms, causes the shell to glow, anda planetary 
nebula is born. The first of these was announced 
by William Herschel (who named the class on the 
basis of their disklike shapes) in 1785. Over a 
thousand are now known (Fig. 5). See PLANETARY 
NEBULA. 

As the planetary nebula, expanding at about 
20 km/s (12 mi/s), grows in size, it eventually be- 
comes so tenuous that the ionizing radiation bursts 
through, allowing much of the full structure to be 
seen. After 50,000 years, the nebula invisibly merges 
with the interstellar medium. During this time, the 
star inside first heats at constant luminosity to over 
100,000 K (180,000°F), the luminosity and final tem- 
perature depending on the old core’s mass (which 
ranges from around 0.5 Mo to nearly 1.4 Mo, the 
Chandrasekhar limit, above which white dwarfs can- 
not exist). As residual nuclear fusion shuts down, 
the star cools and dims at constant radius to become 
a white dwarf. The luminosity-mass relation is now 
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Fig. 6. Hertzsprung-Russell diagram for evolution of high-mass stars. These stars evolve to red supergiants, some evolving 
back to blue supergiants. Above about 60 solar masses, evolution is stalled by extreme mass loss. (From work of A. Maeder 
and G. Meynet; and R. Humphreys and K. Davidson) 


reversed, as higher mass means higher gravitational extent, the most massive become unstable and un- 
compression and smaller radius. dergo huge eruptions. In 1846, Eta Carinae, near 

Made of carbon and oxygen surrounded by a thin 100 solar masses (and a probable double star), bright- 
hydrogen or helium atmosphere, the white dwarf is ened to match Sirius and Canopus, lost a solar mass, 
supported forever by degenerate electron pressure. and then faded to near the edge of naked-eye vision 
Since the evolutionary and cooling times of white (Fig. 7). Such luminous blue variables (LBVs), plus 
dwarfs can be calculated theoretically, the oldest of similar lesser lights, are losing their hydrogen en- 
them provides an age measure for the galactic disk, velopes and may be turning into helium-rich Wolf- 
which accords well with that found from open clus- Rayet stars, highly luminous stars that have been 


ters. See WHITE DWARF STAR. 

High-mass evolution. As the mass of a star in- 
creases, so does the mass of the core. The Sun will 
turn into a white dwarf of around 0.6 solar mass. At 
10 solar masses, the core reaches the Chandrasekhar 
limit, and the star cannot become a white dwarf. 
At first, high-mass evolution proceeds similarly to 
that of stars of lower mass. As high-mass stars use 
their core hydrogen, they too migrate to the right 
on the HR diagram, becoming not so much brighter 
but larger, cooling at their surfaces and turning into 
supergiants. Below about 40 solar masses, they be- 
come class M red supergiants, losing huge amounts 
of mass through immense winds. Some (depending 
on mass) stabilize there by the start of helium fu- 
sion; others loop back to become blue supergiants. 
Above 60 solar masses, so much mass is lost through 
winds that the stars do not make it much past class 
B, stalling there as helium fusion begins (Fig. 6). As 
the stars—as well as bright giants—loop through the Fig. 7. Hubble Space Telescope image of a luminous blue 
middle of the diagram, some become Cepheid vari- variable, the supergiant Eta Carinae, buried in the middle of 
ables. See CEPHEIDS: SUPERGIANT STAR. a giant expanding cloud of its own making. (J. Morse, 


University of Colorado; Space Telescope Science Institute; 
Though almost all these supergiants vary to some NASA) 
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Fig. 8. Field around the type II core-collapse supernova SN 1987A. (a) Two weeks after the 
supernova exploded in the Large Magellanic Cloud in 1987. The supernova far outshines 
its neighboring stars, and was easily visible 150,000 light-years away. (b) Before the 
explosion it was a rather ordinary B1 blue supergiant (indicated by arrow). (© Anglo- 
Australian Observatory, photograph by David Malin) 


almost completely stripped of their outer hydrogen. 
Though the sequence is not clear, the luminous blue 
variables probably first turn into the nitrogen-rich 
WN variety (whose nitrogen has been enriched from 
hydrogen fusion by the carbon cycle) and then into 
the carbon-rich WC species (the carbon from more 
advanced helium fusion). 

While intermediate-star fusion stops at carbon and 
oxygen, supergiants continue onward. The carbon- 
oxygen core shrinks and heats to the point that car- 
bon fusion can begin, and then carbon and oxy- 
gen convert to a more complex mix dominated by 
oxygen, neon, and magnesium. Helium fusion now 
continues in a surrounding shell that is nested in 
one that is fusing hydrogen. Once carbon burning 
has run its course, the unsupported oxygen-neon- 
magnesium core shrinks and heats, and now it is 
carbon burning’s turn to move outward into a shell. 
When hot enough, the oxygen-neon-magnesium mix 
fires up to burn to one dominated by silicon and sul- 
fur. Continuing the process, the developed silicon 
and sulfur core finally becomes hot enough to fuse 
to iron, the silicon-burning core wrapped in oxygen- 
neon-magnesium, helium, and hydrogen-burning 
shells. 

Supernova. Each nuclear fusion stage generates 
less energy, and since each takes on the role of sup- 
porting the star, each lasts a shorter period of time. 
While hydrogen fusion takes millions of years, the 
iron core is created from silicon fusion in a matter 
of weeks. Iron cannot fuse and produce energy. The 
core, about 1.5 solar masses and the size of Earth, 
suddenly collapses at a speed a good fraction that 
of light. The iron atoms are broken back to neu- 
trons and protons. Under crushing densities, free 





electrons merge with protons to make yet more neu- 
trons. When the collapsing neutron core hits nuclear 
densities of 10'* g/cm? (10'* times the density of 
water), it violently bounces, and the rebound tears 
away all the outer layers. 

From the outside, the observer sees the explosion 
as a type II supernova that can reach an absolute 
magnitude of — 18 (Fig. 8). Temperatures within the 
exploding layers are high enough that a great vari- 
ety of nuclear reactions, from equilibrium fusion of 
one element into another to rapid neutron capture, 
create all the chemical elements up to and beyond 
uranium. The optical glow eventually comes mainly 
from the manufacture of a tenth or so of a solar mass 
of radioactive nickel, °°Ni, which quickly decays to 
radioactive cobalt and then to stable iron, >°Fe, which 
gets blasted into the cosmos along with everything 
else. See RADIOACTIVITY. 

The nature of the rebound is far from under- 
stood. By itself, it does not account for the explo- 
sion of a supernova. However, the merger of core 
protons and electrons generates vast numbers of 
near-massless neutrinos. Nearly everything is trans- 
parent to them. Those created in the solar core by 
the proton-proton chain go right through the Earth. 
The densities around the rebounding neutron core, 
however, are so high that the neutrinos are trapped, 
and may aid in shoving the surrounding shells and 
envelope outward. Other ideas invoke violent con- 
vection within the neutron core to provide the en- 
ergy. However it is done, the outflowing neutrinos 
carry away 100 times more energy than does the blast 
itself. The power output of a single supernova for a 
brief moment equals that of the rest of the stars in the 
visible universe. At the highest progenitor masses, su- 
pernovae are believed to eject their energy in bipo- 
lar jets, leading to longer-duration gamma-ray bursts 
(GRBs). The progeny of O stars, supernovae are rare, 
occurring in the Milky Way Galaxy at a rate of only 
one or two per century, so the odds of one being 
close to Earth are fortunately small. See GAMMA-RAY 
BURSTS; NEUTRINO; SUPERNOVA. 

Supernova remnants and compact remains. The de- 
bris of the explosion, the supernova remnant, highly 
enriched in iron and other heavy elements, expands 
for centuries into space. Its shock wave energizes 
interstellar space, heats vast bubbles of gas to hun- 
dreds of thousands of degrees, and helps compress 
interstellar matter and create new stars. Its heavy el- 
ements become incorporated into molecular clouds. 
Most of the Earth came from previous generations of 
supernovae, each one contributing perhaps a moun- 
tain mass. Accelerated by the shock, ambient in- 
terstellar electrons, protons, and heavier nuclei ap- 
proach the speed of light and speed through the 
Galaxy as cosmic rays, which in their turn also help 
initiate star formation (Fig. 9). See CRAB NEBULA; IN- 
TERSTELLAR MATTER; NEBULA; SHOCK WAVE. 

Exposed by the explosion is the compact hot neu- 
tron star. Only 25 km (15 mi) across, it is stabilized by 
the pressure of neutron degeneracy. Conservation of 
angular momentum makes the little star spin dozens 
of times per second, its magnetic field compacted to 





Fig. 9. Crab Nebula, the remnant of the supernova of 1054, 
as seen in radio radiation. (National Radio Astronomy 
Observatories) 


102 or more times stronger than Earth’s. Radiation 
beams out along a tilted, wobbling magnetic axis, 
and if Earth is in its path, a “pulse” of radiation is 
observed, the neutron star now a “pulsar.” As the 
neutron star radiates away its energy, it spins more 
slowly and finally disappears from view. The most 
highly magnetized neutron stars, the magnetars, can 
have magnetic field strengths greater than 10'‘ times 
that of Earth, and are related to anomalous x-ray pul- 
sars (AXPs) and to soft gamma-ray repeaters (SGRs), 
the latter occasionally releasing bursts so powerful 
that they can affect the Earth. The Galaxy must con- 
tain over 108 quiet, near-invisible neutron stars. Off- 
center detonation in the supernovae that created 
them may give them kicks that can send them off at 
speeds far higher than those of most of the Galaxy’s 
stars. See NEUTRON STAR; PULSAR. 

Beyond the Chandrasekhar limit of 1.4 Mo, elec- 
tron degeneracy pressure cannot stabilize a white 
dwarf, and it must collapse. Neutron stars are close 
to this limit, and are themselves similarly limited to 
around 3 solar masses. Stars at the upper end of the 
main sequence are expected to create iron cores that 
exceed even this limiting mass. The dense remains 
cannot stabilize, and they too must therefore col- 
lapse. When such a collapsing star passes a critical 
radius at which the escape velocity is about that of 
light, it become invisible, and a black hole is born. 
About half a dozen black hole candidates are rec- 
ognized in binary systems in which the black hole 
affects the companion. See BLACK HOLE. 

Double-star evolution. Double (binary) stars are a 
common fact of stellar life. Within wide margins, 
about half the Galaxy’s stars have companions. If the 
components of a binary are well separated, by tens 
or hundreds of astronomical units, the stars evolve 
separately as if each were single. If the stars are close 
enough together, however (the critical limits depen- 
dent on mass), they can profoundly affect each other 
and their courses of evolution. 

A binary is first made of a pair of dwarf stars. The 
more massive evolves first into a giant or even a su- 
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pergiant. If the two are close enough to start with, 
the growing giant can be affected by the smaller star. 
As it swells, it approaches a teardrop-shaped tidal 
surface in which the gravitational pulls from the two 
stars (including the forces resulting from orbital mo- 
tion) are effectively equal. If the giant or supergiant 
actually reaches the zero-gravity surface, mass can 
flow through the point of the teardrop toward the 
smaller star. If the two still have sufficient separation, 
the incoming matter will first flow into a disk around 
the dwarf, from which the dwarf will accrete mass. 
If the two are close enough, matter will impact the 
dwarf directly without forming a disk. In either case, 
the result is that mass will be transferred, resulting 
both in interesting spectral activity and in the more 
massive star eventually becoming the less massive. 
Among the most dramatic examples of this kind of 
behavior is the eclipsing binary Algol, in which a 
class B dwarf accretes mass from a K giant. Mass 
loss can also wrap the pair in a common envelope. 
That and angular momentum losses resulting from 
the close interaction as well as from interacting mag- 
netic fields can draw the two closer together and 
perhaps even make them merge. 

The giant eventually produces a planetary neb- 
ula and then becomes a white dwarf. Mass loss has 
left the white dwarf with almost no hydrogen, per- 
haps only a thin skin on its surface. If the two stars 
have been drawn close enough, the white dwarf can 
tidally stretch the remaining dwarf (usually a G, K, or 
M star) to the zero-gravity surface, which can then 
transfer mass, mostly hydrogen, back to the white 
dwarf (Fig. 10). Instabilities in the accretion disk, 
as well as ionization and illumination of the flowing 
stream by a hot spot created by interacting gases, 
make the pair a flickering cataclysmic variable (CV). 
When the fresh hydrogen layer is thick, compressed, 
and hot enough, it explosively burns via the carbon 
cycle, the thermonuclear runaway creating a nova 
that brightens to absolute visual magnitude My = 
—10 or so. Several are seen each year, one of first 
magnitude every few decades. Since a nova is only a 
surface phenomenon and does not much affect the 
stars of the binary, the system settles back into its 
prior CV behavior, and after 10° years may produce 
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Fig. 10. A star fills its tidal surface and passes mass onto a white dwarf in preparation for 
a nova or even a type la supernova. (From J. B. Kaler, The Little Book of Stars, Copernicus 
Books, New York, 2000) 
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a nova explosion again. See CATACLYSMIC VARIABLE; 
NOVA. 

In wider dwarf-white dwarf combinations, the or- 
dinary dwarf will eventually begin its own evolution 
and become a giant. If it had received mass from its 
companion when it was a dwarf (perhaps through a 
wind), the giant may have been contaminated with 
by-products of nuclear fusion and appear as a barium 
star, anomalously rich in barium and other elements. 
As the giant grows, it passes its mass to the white 
dwarf through its vigorous wind or via an accretion 
disk. The hot white dwarf, or a hot boundary layer 
between an accretion disk and the white dwarf’s sur- 
face, now gives the binary dual characteristics, those 
of both a hot and a cool star at the same time. These 
symbiotic stars can vary dramatically, and the trans- 
ferred hydrogen can erupt into long-term fusion re- 
actions. See SYMBIOTIC STAR. 

White dwarf supernovae. There are two quite differ- 
ent kinds of supernovae. Type II supernovae have hy- 
drogen in their spectra. They occur in galactic disks, 
where young high-mass stars reside, and are pro- 
duced by iron-core collapse. Type I events, however, 
do not exhibit spectral hydrogen. They are divided 
into three categories: Type Ia supernovae display an 
absorption line of ionized silicon, whereas type Ib 
exhibits helium instead of silicon, and Ic has neither 
(or weak helium). While Ib and Ic supernovae are 
also confined to the disks of host galaxies, Ia super- 
novae occur in both disks and galaxy haloes where 
there are no high-mass stars. Tycho’s Star of 1572 and 
possibly Kepler’s Star of 1604 were type Ia (though 
some argue that the latter was type ID. 

The Ib variety is probably produced by core col- 
lapse in massive hydrogen-poor Wolf-Rayet stars. 
White dwarfs in double systems are obvious candi- 
dates for type Ia. The more massive the receiving 
white dwarf, the more quickly the infalling fresh 
hydrogen reaches the nova flash point. If massive 
enough, the interval between successive novae be- 
comes less than the time over which they have been 
observed, and the novae become recurrent. For ex- 
ample, RS Ophiuchi and its cohort produce novae 
every 20 years or so. Each nova leaves a little resid- 
ual matter, so the white dwarf increases in mass. If it 
is pushed over the Chandrasekhar limit, the star must 
catastrophically collapse and burn, producing a type 
Ia supernova. A second scenario that might produce 
type Ia supernovae is the merger of the stars in a 
double white dwarf system in which the sum of the 
masses exceeds the Chandrasekhar limit. Type Ia su- 
pernovae are even brighter than core-collapse type 
II events, and can reach My = —19. They also de- 
posit more iron into the cosmos than do the type II, 
typically three-tenths of a solar mass. All the iron in 
the universe is made by supernovae. See WOLF-RAYET 
STAR. 

Though the origins of type Ia explosions are not 
very well understood, the supernovae have such pre- 
dictable absolute magnitudes that they can be used 
to measure distances to distant galaxies and deter- 
mine not just the Hubble constant but also more 
subtle expansion characteristics of the universe. 


Heavy-element enrichment. Many of the above sce- 
narios are still shrouded in mystery. Not all the dou- 
bles have all the behavior patterns described, and 
there are great numbers of categories of each kind 
of phenomenon. The overall result of stellar evolu- 
tion is clear, however. As stars become giants that 
turn into white dwarfs, or as they become super- 
giants that create neutron stars or black holes, they 
feed huge quantities of enriched matter back into 
the star-generating clouds of interstellar space. Later 
generations of stars therefore have more heavy ele- 
ments than do earlier generations. Computer model- 
ing of the chemical composition of the Galaxy, start- 
ing with the hydrogen and helium of the big bang, 
can closely replicate the chemical composition of 
the Sun, providing powerful evidence that the theo- 
ries are correct. The Earth, made almost entirely of 
heavy elements, is a distillate of solar gases, and ex- 
ists only as a result of the combined action of the 
stars. James B. Kaler 
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A magnetic field, far stronger than the Earth’s mag- 
netic field, which is possessed by many stars. Mag- 
netic fields are important throughout the life cycle of 
a star. Initially, magnetic fields regulate how quickly 
interstellar clouds collapse into protostars. Later in 
the star formation process, circumstellar disk mate- 
rial flows along magnetic field lines, either accreting 
onto the star or flowing rapidly out along the rotation 
axis. Outflowing material (stellar winds) carries away 
angular momentum, slowing rotation at a rate that 
depends on stellar magnetic field strength. On the 
Sun, dark sunspots, prominences, flares, and other 
forms of surface activity are seen in regions where 
there are strong magnetic fields. There is some evi- 
dence that long-term variations in solar activity may 
affect the Earth’s climate. Turbulence in the solar at- 
mosphere drives magnetic waves which heat a ten- 
uous corona (seen during eclipses) to millions of 
degrees. About 10% of hotter stars (with tempera- 
tures of about 10,000 K) with stable atmospheres 
are Ap stars, which have stronger magnetic fields that 
control the surface distribution of exotic elements. 
Even after stars end their internal fusion cycle and 
become compact remnants, magnetic fields channel 
accreting material from binary companions, occa- 
sionally producing spectacular novae. Despite the 
enormous gravity around pulsars, magnetic forces 
far exceed gravitational forces, creating intense elec- 
tromagnetic beams that spin down (slow the rota- 
tion of) the pulsar. See BINARY STAR; CATACLYSMIC 


VARIABLE; NOVA; PROTOSTAR; PULSAR; STELLAR EVO- 
LUTION. 

Measurement techniques. Stellar magnetic fields 
are measured remotely using optical, infrared, and 
radio spectra. Magnetic fields alter atomic and molec- 
ular energy levels via the Zeeman effect, causing 
spectral lines to split and shift relative to their non- 
magnetic locations. The size of the shift depends di- 
rectly on wavelength and magnetic field strength. 
When the splitting is large compared to line width 
(as in sunspots, some Ap stars, and magnetic white 
dwarfs), the location and relative strengths of individ- 
ual components yield both field strength and orienta- 
tion. When the splitting is too small to resolve, mea- 
surements of spectral line polarization (polarimetry) 
can sometimes put limits on magnetic field strength, 
but only if the field geometry is simple (as in Ap 
stars). For distant stars that resemble the Sun, the 
polarization signal is small because the surface as a 
whole contains nearly equal amounts of oppositely 
directed magnetic field. In this case, the only re- 
course is to look for subtle enhancements of spectral 
line width or strength due to the Zeeman effect. See 
ASTRONOMICAL SPECTROSCOPY; POLARIMETRY; ZEE- 
MAN EFFECT. 

Observations. Broadening of spectral lines in 
sunspots was first observed by J. Norman Lockyer 
in 1866, but the magnetic origin of this broadening 
was not understood until after the theoretical work 
of Pieter Zeeman in 1897 and the first polarimetry 
by George Ellery Hale in 1908. Field strengths are 
typically 0.3 tesla (3 kilogauss) in sunspots. Zeeman 
broadening implies similar field strengths for sev- 
eral other cool stars, but current stellar measurement 
techniques are effective only if at least 5% of the sur- 
face contains strong magnetic fields (compared to 
0.5% for the Sun). For cool stars, polarimetry is use- 
ful only if specific areas with uniform magnetic field 
orientation can be localized (through spatial maps 
of the Sun, Doppler images of rapidly rotating stars, 
and emission from flares or accretion sites). Ap stars 
have magnetic fields as strong as 3.3 T (33 kG), or- 
ganized into global dipoles that are often offset from 
the center of the star and tilted with respect to the 
rotation axis. For reference, the Earth has a dipole 
magnetic field that is only 6 x 107% T (0.6 G) at the 
pole. Finally, dense stellar remnants can have tremen- 
dous fields, reaching up to 104 T (10° G) for white 
dwarfs and 10° T (10!? G) for neutron stars. See GEO- 
MAGNETISM; NEUTRON STAR; SOLAR MAGNETIC FIELD; 
WHITE DWARF STAR. 

Origin and dynamics. Large magnetic structures on 
the Sun vary in location, number, and magnetic po- 
larity with an average cycle period of 22 years. Sim- 
ilar cycles have been detected on several other cool 
stars with turbulent (convective) atmospheres. In 
these stars, magnetic heating of coronae increases 
with rotation rate. Such behavior is expected of hy- 
dromagnetic dynamos, but details are sketchy and 
more than one type of dynamo may operate in the 
Sun. In warmer or more compact stars with convec- 
tively stable envelopes, magnetic fields are assumed 
to be concentrated fossils from earlier stages of evo- 
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lution. Ultimately, the magnetic fields in these stable 
stars are attributed to the original interstellar gas out 
of which the stars formed. See MAGNETOHYDRODY- 
NAMICS; STAR; SUN. Jeff A. Valenti 
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One of the categories into which stars may be clas- 
sified, based on their based on their spatial distribu- 
tion and dynamics within a galaxy and their epoch 
of formation. The stellar component of the Milky 
Way Galaxy consists of three populations: the thin 
disk, the thick disk, and the halo. The thin disk, orig- 
inally referred to as population I, is the youngest, 
and includes stars currently forming as well as older 
stars such as the Sun. It is located amid most of the 
molecular and cold atomic gas, and confined to a 
height of order 1 kiloparsec (3.3 x 10% light-years 
or 3.1 x 10!°km or 1.9 x 10!° mi) from the galactic 
plane. The bulge and halo, roughly corresponding to 
the original population II, is a much older, far more 
extended structure, having an approximately spher- 
ical distribution with a scale length of order 3.5 kpc. 

Development of concept. The existence of differ- 
ent populations in the stellar component of external 
galaxies was announced by W. Baade in 1944. During 
the wartime blackouts of Los Angeles, he was able to 
take the first deep multiwavelength photographs of 
the Andromeda Galaxy, M31. He found that the blue 
stars are confined to the disk of that galaxy, while the 
redder stars whose colors are more like the Sun or 
the globular clusters are more concentrated toward 
the galactic center and nuclear bulge. Baade referred 
to these stellar components as populations and la- 
beled the younger blue disk stars population I. Those 
in the halo and nuclear spheroid, identified with the 
RR Lyrae variables (also known as cluster variables 
because of their preponderance in globular clusters), 
were called population II. A hypothetical popula- 
tion III has also been proposed to represent the first 
stars formed during the collapse of a galaxy. Some 
evidence for these stars is found in the extremely 
high abundances of r-process elements found in stars 
in which the overall abundance of heavy elements 
(elements other than hydrogen and helium) is less 
than 10~ that of the Sun. Further kinematic stud- 
ies of space motions demonstrated the distinctness 
of the two principal populations and accounted for 
their different distributions but left open the inter- 
pretation of these effects. See ANDROMEDA GALAXY; 
NUCLEOSYNTHESIS; VARIABLE STAR. 

The population concept is now central to the un- 
derstanding of chemical evolution of the Milky Way 
Galaxy. The globular clusters, the prototypical old 
stellar objects, have main-sequence turn-off ages of 
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The metallicity distribution of stars as a function of velocity 
V perpendicular to the galactic plane. (Negative values of V 
indicate motion toward the plane.) (a) —30 km/s < V < —10 
km/s. (b) —60 km/s < V < —30 km/s. (c) —100 km/s < V< 
—60 km/s. (d) —150 km/s < V < —100 km/s. (e) V < —150 
km/s. (After B.W. Carney, D. W. Latham, and J. R. Laird, A 
survey of proper-motion stars, VIII. On the galaxy’s third 
population, Astron. J., 97:423-430, 1989) 


at least 10!° years and are among the most metal- 
deficient stars in the Milky Way Galaxy. These are 
widely distributed throughout the system and serve 
as markers in the halo to the mass distribution. On 
the other hand, the OB associations contain many 
massive stars with ages not greater than 10” years and 
have the highest metal abundances, often as much as 
a factor of 2 higher than the Sun. (At an age of about 
4.6 x 10° years, the Sun was formed asa thin disk star 
as well.) The subdwarfs, the metal-deficient main- 
sequence stars, are a systematically older group of 
stars than those found confined to the thick galac- 
tic disk. The illustration shows how the metallicity 
distribution of stars decreases with increasing magni- 
tude of velocity perpendicular to the galactic plane. 
Here, the metallicity [m/H] is the logarithmic dif- 
ference in the overall abundance of heavy elements 
with respect to the Sun. For example, in a star with 
[m/H] = 0, this abundance is equal to that of the Sun, 
while in a star with [m/H] = —3 it is 10~ that of the 
Sun. See STAR CLUSTERS. 

Populations in galaxies. Fundamentally, the pres- 
ence of these distinct stellar populations demon- 
strates that the Milky Way Galaxy has undergone 
an extended period of active star formation. The 
spheroid population is quite similar to elliptical 
galaxies, which are assumed to have ended active star 
formation many billions of years ago. Starburst galax- 


ies give a counterexample of the fixity of the popula- 
tion distinction, however, since they may have com- 
pleted their primary star-forming phase long ago, 
but the injection of a new mass of gas through colli- 
sions or accretion in a large-scale flow can produce 
a dramatic turn-on of the star formation. There is 
also increasing evidence that the Galactic popula- 
tions may be contaminated by accretion of other, 
lower-mass satellite galaxies within the Local Group 
whose star-forming histories differed from that of the 
Milky Way. See GALAXY, EXTERNAL; GALAXY FORMA- 
TION AND EVOLUTION; STARBURST GALAXY. 

Characteristics of populations. The thin disk is the 
youngest component of the Galactic stellar popula- 
tion. Still actively forming massive stars from molec- 
ular clouds, it is confined to within about 0.35 kpc of 
the plane. All of the stars have metallicities lying be- 
tween about one-fifth and twice the solar value (illus. 
a), similar to that of the interstellar medium diffuse 
gas, and star formation appears to have remained rel- 
atively constant in this population for about the past 
8 x 10° years, although there have been some ex- 
cursions of a factor of order 50% during that time. 
One reason for the relatively small thickness of the 
disk is the low velocity dispersion of the component 
stars, about 22 km/s (14 mi/s); their systematic mo- 
tion is completely dominated by the differential rota- 
tion of the disk. Since the most massive stars are also 
the youngest, having the shortest nuclear lifetimes, 
these hot stars are found associated with H II regions 
and OB associations as well as in open clusters. Mas- 
sive stars are uniquely formed in the thin disk, sup- 
porting the attribution of youth to the population. 
Finally, type II supernovae, the death throes of mas- 
sive stars, show a thin-disk distribution throughout 
the Milky Way Galaxy, and there is an indication that 
the remnants of massive stars, the pulsars (neutron 
stars with strong magnetic fields), are also confined 
to the thin disk. See INTERSTELLAR MATTER; MOLECU- 
LAR CLOUD; PULSAR; SUPERNOVA. 

The thick disk is an older population, approxi- 
mately 9-10 x 10° years, roughly corresponding to 
the range between what was once called popula- 
tion II and population I. Its metallicity lies between 
about one-tenth and one-third of the solar value. The 
stars in this population are distributed over greater 
distances from the plane, up to 1.5 kpc, and have 
correspondingly larger velocity dispersion, about 45 
km/s (28 mi/s). This population also includes globu- 
lar clusters and subdwarfs that overlap at the lowest 
end of the abundances with the properties of the 
halo globulars, although the system of old disk glob- 
ulars is distributed differently than those of the halo. 
Type I supernovae are associated with this popula- 
tion. In the Milky Way Galaxy the spatial structure 
of this population roughly resembles that of an E7 
elliptical galaxy. 

Finally, enclosing the disk and the nuclear 
spheroidal bulge, there is a halo that extends to con- 
siderable distances from the plane, with some stars 
as distant as 30 kpc. This population has an age of 
order 10-15 x 10° years and a scale height of order 
3.5 kpc or greater. The stars in this region have very 
large velocity dispersions, about 130 km/s (80 mi/s), 


and do not appear to participate in the differential 
rotation as much as other stars. Their metallicities 
are all lower than about one-twentieth that of the 
Sun and may extend down to 10~? of the solar value 
(illus. e). The most metal-poor globular clusters be- 
long to this population. This stellar halo is not the 
same as the so-called dark matter halo, but is proba- 
bly embedded within it. See DARK MATTER. 

Delineation of populations. The population para- 
digm has become progressively more blurred with 
time, as the complex history of Galactic star forma- 
tion has become clearer, and there has been consider- 
able debate over whether any of these populations 
are truly distinct. Indeed, the separation based on 
elemental abundances renders this question some- 
what circular since these abundances are already em- 
bedded in the definition of the membership in the 
different populations. Nonetheless, there are some 
distinct differences in these populations that permit 
their delineation. For instance, the globular clusters, 
although separated into halo and disk systems, are 
distinctly older than any of the clusters found in the 
disk. The oldest disk open clusters, NGC 188 and 
M69, show abundances nearly as low as the most 
metal-rich globulars like 47 Tucanae, but are younger 
by nearly 3 x 10° years than the youngest of the 
disk globulars. The most metal-deficient globulars 
are considerably older than any of the stars in the 
disk and much lower in abundance. See MILKY WAY 
GALAXY; STAR. Steven N. Shore 
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The spinning of stars, due to their angular momen- 
tum. Stars do not necessarily rotate as solid bod- 
ies, and their angular momentum may be distributed 
nonuniformly, depending on radius or latitude. How- 
ever, it is nearly impossible to resolve the surfaces of 
stars, or see their interiors, and the limited ability to 
observe them means that their rotation is generally 
expressed as a single number, veg sin 7. In this mea- 
sured quantity, Veq is the star’s equatorial rotational 
velocity (in kilometers per second), and 7 is the angle 
between the star’s rotation axis and the line of sight 
to the star. In other words, Veg sin 7 is the compo- 
nent of a star’s rotation that is projected onto the 
line of sight between the star and the observer. Mea- 
surements Of Veg sin 7 in stars range from as little as 


1 km s~! (0.6 mi s7» up to 400 km s~! (250 mi s75 
or more. A more physically useful measure of rota- 
tion is Q, a star’s angular velocity, or P,.,, the rotation 
period (the inverse of Q). In some cases it is possible 
to measure P,ot directly. See ANGULAR MOMENTUM; 
STAR. 

A Veq Sin 7 value is determined from the breadth of 
absorption lines in the star’s spectrum. The Doppler 
effect causes the lines to be broadened because one 
limb of the star is receding and the other approach- 
ing. As a reference, the Sun is a very slowly rotating 
star (Veq Sin ¢ = 1.8 km s“! or 1.1 mi s~'), and solar 
absorption lines have a full width at half maximum 
(FWHM) of about 7 km s~! (4 mi s~4), which arises 
from thermal motions, the physics of line formation, 
and turbulent motions in the Sun’s atmosphere. This 
means that if a star’s rotation rate exceeds 15-20 km 
s7! (9-12 mi s7», then rotation dominates the broad- 
ening of the lines and other effects need not be con- 
sidered, in general. To determine smaller values of 
Veq Sin 7 reliably, it is necessary to observe absorption 
lines carefully and to then fit them with models that 
take into account all the broadening mechanisms. 
By doing this it is possible to measure veq sin i down 
to the Sun’s level and below. See DOPPLER EFFECT; 
SUN. 

Late-type stars (cooler than about 6500 K or 
11,200°F) exhibit spots on their surfaces analogous 
to sunspots, and these can be large enough to pro- 
duce observable changes in the light of the star. In 
such cases it is possible to measure P,,, directly and, 
in a few instances, changes in P,.; have led to esti- 
mates of differential rotation (the dependence of Q 
on latitude). Differential rotation is critical for the 
mechanisms that generate magnetic fields in low- 
mass stars. Rotation periods are independent of as- 
pect and so provide a true indication of the intrinsic 
spread of rotation within star clusters, for instance. 
The high quality of P,., measurements means that 
more is now known about the rotation in low-mass 
stars than is known for massive stars, the opposite of 
the case a generation ago. See STAR CLUSTERS; STEL- 
LAR MAGNETIC FIELD. 

Stars get their angular momentum from the mate- 
rial from which they form, and one of the key prob- 
lems in star formation is understanding how a cloud 
of material in the interstellar medium can contract 
enough to become a star. This is because a tiny de- 
gree of spin in the natal cloud leads to huge rotation 
rates by the time the material is dense enough to be 
a star. Stars appear to regulate their rotation in these 
early phases by interacting with the disk of material 
that surrounds them (the debris left over from star 
formation) and through jets, although this is an ac- 
tive area in current research. Stars are observed to 
have their rotation axes distributed randomly, which 
means that they do not acquire their rotation from 
the rotation of the Milky Way Galaxy but from turbu- 
lent motions in the parent cloud. This is confirmed 
by noting that the angular momentum axes of binary 
stars are also oriented randomly and that there is no 
tendency for stars in clusters to have rotation axes 
that align with the Galaxy’s. See BINARY STAR; PRO- 
TOSTAR; STELLAR EVOLUTION. 
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Each type of star has its own rotational charac- 
teristics, and there is a broad trend in which typi- 
cal rotation rates decline with the mass of the star 
in going down the main sequence. There is an es- 
pecially prominent break in rotation such that stars 
with masses below about 1.2 times the solar mass 
have very low rotation rates, due to their intrinsic 
structure, as explained below. 

Early-type stars. The early-type stars (types O, 
B, A, and early-F) typically have veq sin į values of 
50 km s~! G0 mis~!) or more, and so their rotation 
is easily discerned on moderate-resolution spectro- 
grams. For this reason, and because the early-type 
stars were easier to study with photographic plates, 
they were the first to have their rotational proper- 
ties examined in detail, and most of the extant data 
dates from the photographic era. The most massive 
stars (of types O and B) sometimes exhibit nonradial 
pulsations and strong winds. They also can have sur- 
face abundance anomalies that result from internal 
mixing. All these effects are related to rotation, and 
they can complicate the interpretation of the obser- 
vations in very massive stars. At the same time, rota- 
tion rates in O and B stars can approach the breakup 
velocity, reducing the effective gravity in the star and 
thereby affecting the star’s evolution. See ASTRONOM- 
ICAL SPECTROSCOPY; SPECTRAL TYPE. 

For the O stars, the distribution of veq sin į has 
two peaks, centered at about 100 and 300 km s7! 
(60 and 190 mi s7»). The highest velocities seen are 
near 400 km s~! (250 mi s~'), and the high veq sin i 
values (200 km s~! or 125 mi s~! and more) are seen 
only in the main-sequence O stars. These very high 
rates extend into the Be stars. O stars can also have 
significant turbulence (20 kms~! or 12 mis~'), and so 
there are none with sharp-lined spectra. O stars have 
strong radiation-driven winds and significant mass 
loss so that these stars can lose angular momentum 
during their main-sequence lifetimes. 

The highest rotation rates can reduce the central 
pressure within a star; this can lead to slightly lower 
luminosity (by as much as 3%). The shape of the 
star can also change so that the surface temperature 
changes with latitude. A more significant effect is that 
rapid rotation influences mixing within the star and 
transports angular momentum. In particular, higher 
rotation leads to a higher degree of turbulent diffu- 
sion that can double a star’s luminosity in some cases. 

The B stars have an average ve, sin 7 of about 
140 km s“! (87 mi s~}), but the Be stars (B stars 
with emission lines, comprising about 15-20% of 
the B stars) rotate at significantly higher rates (the 
average Veq for Be stars, corrected for inclination, is 
265 km s™! or 165 mi s~}). Even this high rate is 
still about a factor of 2 below the breakup velocity 
(500-550 km s~! or 310-340 mi s~}). 

The A and early-F stars have moderate rotation 
rates (50-100 km s~! or 30-60 mi s~!) except for 
those exhibiting peculiar chemical abundances. The 
Am, or metallic-line A stars, and the Ap, or magnetic 
A stars, rotate at an average rate of about 40 km s~! 
(25 mis~}), a factor of 3-4 lower than normal stars 
of similar colors. Only a very few chemically peculiar 
stars have Veq Sin į exceeding 100 km s~! (60 mi s7’. 


Also, in these peculiar stars rotation declines with 
age, but the relationship is neither well delineated 
nor understood. The opposite is also true: very few 
normal A stars have low veq sin 7 values. 

Late-type stars. Late-type stars are cooler than 
about 6500 K (11,200°F) and have spectral types 
of late-F, G, K, or M. The key property of late-type 
stars that determines much of their phenomenology 
is the presence of a convective envelope. In such 
stars, rotation (especially differential rotation) inter- 
acts with the convection to produce complex mo- 
tions in the electrically conductive plasma of the 
star. These circulatory patterns enable the star to re- 
generate a seed magnetic field, the so-called dynamo 
mechanism. The magnetic field can grip an ionized 
wind beyond the star’s surface, leading to gradual an- 
gular momentum loss, a process that is seen occur- 
ring on the Sun. More rapidly rotating stars produce 
stronger magnetic fields, as evidenced by spectro- 
scopic indicators of magnetic-related activity, such 
as emission in the cores of the Ca II H and K lines. 
There is, therefore, a strong correlation between ro- 
tation and activity, in the various forms that activity 
is observed (Ca II H and K, Ha, x-rays, and so forth). 
These activity indices are often observed to measure 
Py for stars because they vary more pronouncedly 
than the star’s light overall. See MAGNETISM; STELLAR 
MAGNETIC FIELD. 

In this scenario, late-type main-sequence stars spin 
down over their lifetimes, and that is observed to 
occur because steadily declining rotation rates can 
be seen among stars in clusters of increasing age. 
Moreover, stars that are born together but with 
different rotation rates will tend to end up with the 
same P,o, because the faster-rotating star will lose an- 
gular momentum more quickly. Young stars tend to 
rotate rapidly because they have not had time to lose 
the angular momentum with which they formed, 
while old low-mass stars rotate slowly. 

Young late-type stars are mostly observed in clus- 
ters because there are few of them elsewhere, and 
because clusters provide rich samples of the same 
age. The Pleiades, for example, is 100 million years 
old, meaning that its low-mass stars have only re- 
cently reached the zero-age main sequence (ZAMS). 
Among the solar-type stars of the Pleiades, an intrin- 
sic spread in rotation of more than a factor of 20 is 
seen at any one mass, with the most rapidly rotat- 
ing stars spinning at 100 times the solar rate. These 
ultrafast rotators are a minority constituent of the 
cluster (about 20% of the stars), but their presence 
along with many slowly rotating stars is difficult to 
account for in models of the angular momentum evo- 
lution of low-mass stars. It has been speculated that 
some of these young stars may experience a decou- 
pling of the radiative core from the convective en- 
velope, with the core continuing to spin at a high 
rate for an extended time. The «œ Persei cluster is 
also at the zero-age main sequence, with an age of 
50 million years, and its stars confirm that what is 
seen in the Pleiades is typical for young low-mass 
stars. By the time that stars become as old as the 
Hyades (age 600 million years), their rotation rates 
have largely converged and show little spread at any 


one mass, although there is a strong mass depen- 
dence. See HYADES; PLEIADES. 

The most rapidly rotating low-mass stars seen out- 
side clusters spin at about 10 times the solar rate, but 
the vast majority, being old like the Sun, rotate very 
slowly (at the solar level or below). Evolved late-type 
stars (giants and supergiants) rotate at very low rates, 
in part because they have swelled so much since leav- 
ing the main sequence. A few rare exceptions spin 
rapidly, probably because they acquired the orbital 
angular momentum of their companion when a bi- 
nary system coalesced. See GIANT STAR; SUPERGIANT 
STAR. 

Very low mass stars and brown dwarfs. Significant 
numbers of very low mass stars and brown dwarfs 
have been found and studied at high spectroscopic 
resolution. These objects are fully convective from 
their surfaces all the way to their cores, in contrast 
to the Sun and similar stars, which have a convective 
envelope. This change in structure corresponds to a 
change in the observed rotation in that very low mass 
stars and brown dwarfs appear not to lose angular 
momentum and so continue to spin rapidly. These 
objects are very small in size, so the rotation rates 
seen—15 to 60 km s7! (10 to 40 mi s~!)—imply high 
angular velocities of as much as 100 times that of the 
Sun. See BROWN DWARF. 

Presence of planets. There is no relationship be- 
tween the rotation of a star and the presence of 
planets around it. It is true that planets have been 
found primarily around old, slowly rotating stars like 
the Sun, but that is in part because such stars have 
the very sharp absorption lines needed to measure 
the reflex motion to the necessary high precision. 
Young solar-type stars have broader lines that pre- 
vent reaching the necessary precision, and they also 
exhibit effects related to their activity that add noise 
to the velocity measurements. See EXTRASOLAR PLAN- 
ETS. 

Solar rotational history. The rotational history of 
the Sun is not known in detail, but the broad fea- 
tures of that history can be delineated from studying 
stars of solar mass of many ages. The Sun formed 
from interstellar material, most probably in concert 
with many other stars at the same time. For about 
the first 10 million years of its life, it kept a disk of 
debris material around it, and this formed into plan- 
ets before about 50 million years was reached. At 
an age of 10 million years the Sun rotated at two or 
three times its present angular velocity, and it was 
well above the main sequence. The Sun reached the 
zero-age main sequence when it was about 50 mil- 
lion years old, at which point it could have rotated 
at a rate of 3 to 100 times the current solar rate, al- 
though it probably was at the low end of this range. 
Over its 4.5-billion-year main-sequence lifetime, the 
Sun has gradually lost angular momentum to get to 
the rate that is seen today. Helioseismological studies 
of the Sun’s interior show it to rotate as a solid body, 
so if it ever had a rapidly rotating core, it does no 
longer. In about 5 billion years the Sun will exhaust 
most of the hydrogen in its core and leave the main 
sequence to become a giant star. It will bloat to 20 
or more times its present size, and conservation of 


angular momentum will mean that its rotation rate 
will become extremely small. 

The Sun remains the standard of comparison for 
other stars. The rotation of the Sun was among 
Galileo’s first discoveries as he traced the motions 
of spots across the surface, and only a little later 
the different rates of rotation at different solar lati- 
tudes led to the fundamental notion that the Sun is 
a fluid, not a glowing solid. Angular momentum is 
central to the formation and evolution of stars, and 
work now taking place will lead to a fuller picture of 
what happens in stars and how the Sun fits into that 
picture. David Soderblom 
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Stem 


The organ of vascular plants that usually develops 
branches and bears leaves and flowers. On woody 
stems a branch that is the current season’s growth 
froma bud is called a twig. The stems of some species 
produce adventitious roots. See ROOT (BOTANY). 


General Characteristics 


While most stems are erect, such as aerial structures 
(Fig. 1a), some remain underground (Fig. 2a—)), 
others creep over or lie prostrate on the surface of 
the ground (Fig. 1c-e), and still others are so short 
and inconspicuous that the plants are said to be 
stemless, or acaulescent (Fig. 1b). When stems lie 
flattened immediately above but not on the ground, 
with tips curved upward, they are said to be decum- 
bent, as in juniper (Fig. 1c). If stems lie flat on the 
ground but do not root at the nodes (joints), the stem 





(a) 


Fig. 1. Kinds of stems. (a) Woody (plum). (b) Acaulescent 
(evening primrose). (c) Decumbent (juniper). (d) Procum- 
bent (purslane). (e) Repent (ground ivy). 
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Fig. 2. Specialized stems. (a) Rhizome (Solomon’s seal). 

(b) Rhizome (grass). (c) Moniliform rhizome (pepperroot). 
(a) Caudex (trillium). (e) Tuber (potato). (f) Corm (crocus). 
(g) Section of corm. (h) Tunicated bulb (onion). (i) Scaly bulb 
(lily). () Scape (English plantain). (k) Thorn (hawthorn). 

(i) Tendril (grape). (m) Cladode (asparagus). 


is called procumbent or prostrate, as in purslane 
(Fig. 1d). If a stem creeps along the ground, root- 
ing at the nodes, it is said to be repent or creeping, 
as in ground ivy (Fig. 1e). 

Shape and texture. Most stems are cylindrical and 
tapering (terete), appearing circular in cross section; 
others may be quadrangular, as in mints, or triangular, 
as in some sedges. 

Herbaceous stems (annuals and herbaceous peren- 
nials) die to the ground after blooming or at the end 
of the growing season. They usually contain little 
woody tissue. Woody stems (perennials) have con- 
siderable woody supporting tissue and live from year 
to year. A woody plant with no main stem or trunk, 
but usually with several stems developed from a com- 
mon base at or near the ground, is known as a shrub. 
Suffrutescent stems are intermediate between herba- 
ceous and shrubby and become partly woody and 
perennial at the base, as in teaberry. 

Specialized stems. The stems of some plants are 
highly modified in various ways (Fig. 2). An under- 
ground horizontal stem is a rhizome or rootstock. It 
may be thickened, as in Solomon’s seal; slender, as 
in Bermuda grass; or contracted at regular intervals, 
as in pepperroot, in which case it is a moniliform 
rhizome. A caudex is an upright perennial under- 
ground stem; it is much like a rhizome but grows 
vertically, as in trillium. The term caudex is applied 
also to the trunks of palm trees which grow in a sim- 
ilar manner but aboveground. Tubers are enlarged 
ends of rhizomes in which food accumulates, as in 
white potato. A corm is a short, erect, fleshy sub- 
terranean stem usually broader than high and often 
covered with dry membranous coats, as in crocus. A 


bulb may be regarded as a short, subterranean stem 
with many overlapping fleshy leaf bases or scales, as 
in onion. A scape is a leafless flowering stem aris- 
ing from the ground, as in plantain or dandelion. A 
thorn is a rigid, sharp-pointed modified branch of a 
woody plant, as in hawthorn. A tendril is a slender 
coiling structure capable of twining about a support 
to which the plant is then attached, as in grape. A 
cladophyll, or phylloclade, is a usually flattened stem 
resembling a leaf that arises in the axil of a minute 
leaf (scale), as in asparagus. In some instances the 
resemblance of a cladophyll to a leaf is noted only 
or mainly in its green color, as in certain species of 
cacti. See PLANT ORGANS. Nelle Ammons 


External Features 


A shoot or branch usually consists of a stem, or 
axis, and leafy appendages. Stems have several dis- 
tinguishing features. They arise either from the epi- 
cotyl of the embryo in a seed or from buds. The 
stem bears both leaves and buds at nodes, which 
are separated by leafless regions or internodes, and 
sometimes roots and flowers. A woody twig (Fig. 3) 
shows leaf scars and lenticels on its surface. Even- 
tually, secondary growth thickens the stem axis and 
obscures its division into nodes and internodes and 
the relation between buds and leaves. See LEAF. 

Nodes. The nodes are the regions of the primary 
stem where leaves and buds arise. The number of 
leaves at a node is usually specific for each plant 
species. In deciduous plants which are leafless dur- 
ing winter, the place of former attachment of a leaf 
is marked by the leaf scar. The scar is formed in part 
by the abscission zone formed at the base of the leaf 
petiole. Small bundle scars left by the broken ends of 
the vascular (conducting) tissues between the stem 
and leaf are seen within the leaf scar. Flower and fruit 
scars, marking the points of flower and fruit attach- 
ment, may also be visible. 

Internodes. The stem regions between nodes are 
called internodes. Internode length varies greatly 
among species, in different parts of the same stem, 
and under different growing conditions. One or 
many internodes may grow rapidly during stem elon- 
gation. Generally, an individual internode matures 
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Fig. 3. Winter woody twig (horse chestnut) showing apical 


dominance. (After E. W. Sinnott and K. S. Wilson, Botany: 
Principles and Problems, 5th ed., McGraw-Hill, 1955) 





Fig. 4. Portion of a cross section of a young stem of elder 
(Sambucus canadensis) showing a lenticel. (G. H. Conant, 
Triarch Products) 


from the base upward in dicotyledons and many 
monocotyledons. Conversely, internodes of some 
monocotyledons (grasses and sedges) and horse- 
tails Œquisetum) have prolonged growth from in- 
tercalary meristems located at the base of each inter- 
node. 

Lenticels. There are small, slightly raised or ridged 
regions of the stem surface that are composed of 
loosely arranged masses of cells in the bark (Fig. 4). 
Their intercellular spaces are continuous with those 
in the interior of the stem, therefore permitting gas 
exchange similar to the stomata that are present be- 
fore bark initiation. The size and shape of lenticels 
vary among species and with age they are absent in 
a few species. See PERIDERM. 

Bud scales. In most temperate, perennial plants, 
buds are protected by a series of modified leaves 
or bud scales. Wax or hairs on their surface further 
protect the apical meristem within the bud. The buds 
of many tropical trees are often protected by the 
bases of mature leaves or simply by the expanding, 
immature leaves. Such naked buds are also present 
in most herbaceous plants. 

Most temperate trees form a terminal bud at the 
end of a season’s growth. The cluster of scars that 
mark the position of the closely packed terminal bud 
scales also delimit the segment of stem produced dur- 
ing one growing season (Fig. 3). The age and yearly 
growth of a twig can be determined by observing 
the number of sets of these terminal bud scale scars 
and the distances between them. 

Bud types. Buds are partly developed shoots or un- 
differentiated masses of meristematic cells. They may 
be classified as terminal, lateral, and adventitious. See 
BUD. 

Terminal or apical buds. These are buds at the tips of 
branches, often the largest buds on the plant, which 
usually contribute to the growth in length of the stem 
bearing them. 

Lateral buds. These are buds located on the sides of 
the stem and are initiated within the terminal bud 
from superficial tissues at the nodes. They are further 
classified according to their position. Most plants 
have a bud in the axil of each leaf directly above the 
leaf (or leaf scar). Such axillary buds usually remain 


dormant for some time. If the axillary bud develops 
into a branch, it becomes organized into a new ter- 
minal bud which itself produces new lateral buds. In 
some species, additional buds are produced above 
or on either side of the axillary bud. Such accessory 
or supernumerary buds may eventually develop as 
flower buds, thorns, or vegetative branches like the 
axillary bud. 

Adventitious buds. These are buds which are not pro- 
duced within the terminal bud and lack a definite 
relation to leaf position. They arise from differenti- 
ated (mature) tissues of the root, stem, or leaf, or 
from wound callus. The root suckers of many trees 
(oak, sweet gum, aspen) and troublesome weeds 
arise from adventitious buds. Such buds are also pro- 
duced on leaf cuttings of African violet and begonia. 

Bud development. Most buds of a plant remain un- 
developed for indefinite periods and are known as 
dormant, latent, or potential buds in contrast to ac- 
tive or developing buds. 

In many trees and shrubs, especially those with 
large, overwintering buds, the whole year’s growth 
is laid down in rudimentary form in the bud dur- 
ing early summer. These buds expand by elongation 
of internodes the following spring. Some buds, es- 
pecially flower buds, may form in the spring of the 
same season during which they expand. In temper- 
ate trees most lateral buds remain dormant for the 
first season, and only grow into branches the follow- 
ing season because of the apical dominance of the 
terminal bud. Such delayed bud outgrowth is char- 
acteristic of proleptic branches. In contrast, many 
tropical trees have sylleptic branches, in which lat- 
eral buds grow out immediately without a rest pe- 
riod. See APICAL DOMINANCE; PLANT GROWTH. 

Branching. There are three major types of stem 
branching: dichotomous, monopodial, and sympo- 
dial. Dichotomy occurs by a division of the apical 
meristem to form two new axes. Such branching 
without axillary buds occurs in some lower vascu- 
lar plants and in a few angiosperms (some palms 
and cacti, and Strelitzia). If the terminal bud of an 
axis continues to grow and lateral buds grow out 
as branches, the branching is called monopodial. If 
the apical bud terminates growth in a flower or dies 
back and one or more axillary buds grow out, the 
branching is called sympodial. Often only one bud 
develops so that what appears to be a single axis is 
in fact composed of a series of lateral branches that 
are arranged in linear sequence. 

In many woody plants there is a distinction be- 
tween long and short shoots, which results from 
the degree of internodal elongation. Short shoots 
are commonly the site of reproduction, and the spur 
shoots of apple, because of their congested nodes, 
bear a tight cluster of leaves. 

Stem form. The large and conspicuous stems of 
trees and shrubs assume a wide variety of distinctive 
forms. 

Columnar stems are basically unbranched and 
form a terminal leaf cluster, as in palms, or lack ob- 
vious leaves, as in cacti. 

Branching stems have been classified either as ex- 
current, when there is a central trunk and a conical 
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Fig. 5. Tree architecture. The idealized models of tree architecture can account for most types of branching and growth in 
woody plants. (After F. Hallé and R. A. A. Oldeman, An essay on the architecture and dynamics of growth of tropical trees, 


Penerbit Universiti Malaya, 1975) 


leaf crown, as in firs and other conifers, or as decur- 
rent (or deliquescent), when the trunk quickly di- 
vides up into many separate axes so that the crown 
lacks a central trunk, as in elm. 

Another approach to the study of stem form is 
the concept of tree architecture which uses mod- 
els that can describe most living and fossil trees and 
shrubs (Fig. 5). Each architectural model is defined 
by the orientation, growth sequence, and branching 
pattern of the trunk and lateral branches of shoot. 
In at least some trees (Jerminalia), the observed 
branching pattern approaches the theoretical ideal 
for the optimum filling of space and exposure of leaf 
surfaces, important adaptive characteristics for stem 
form that have probably been evolutionarily selected 
for. See TREE. Jack B. Fisher 


Internal Features 


The stem is composed of the three fundamental tis- 
sue systems that are found also in all other plant 
organs: the dermal (skin) system, consisting of epi- 
dermis in young stems and periderm in older stems 
of many species; the vascular (conducting) system, 
consisting of xylem (water conduction) and phloem 


(food conduction); and the fundamental or ground 
tissue system, consisting of parenchyma and scle- 
renchyma tissues in which the vascular tissues are 
embedded. The arrangement of the vascular tissues 
varies in stems of different groups of plants, but fre- 
quently these tissues form a hollow cylinder enclos- 
ing a region of ground tissue called pith and sep- 
arated from the dermal tissue by another region of 
ground tissue called cortex. See CORTEX (PLANT); EPI- 
DERMIS (PLANT); PARENCHYMA; PHLOEM; PITH; SCLE- 
RENCHYMA; XYLEM. 

Primary and secondary tissues. Part of the growth 
of the stem results from the activity of the apical 
meristem located at the tip of the shoot. The deriva- 
tives of this meristem are the primary tissues: epi- 
dermis, primary vascular tissues, and the ground tis- 
sues of the cortex and pith. In many species, espe- 
cially those having woody stems, secondary tissues 
are added to the primary. These tissues are derived 
from the lateral meristems, oriented parallel with 
the sides of the stem: cork cambium (phellogen), 
which gives rise to the secondary protective tissue 
periderm, which consists of phellem (cork), phel- 
logen (cork cambium), and phelloderm (secondary 





cortex) and which replaces the epidermis and vas- 
cular cambium, which is inserted between the pri- 
mary xylem and pholem and forms secondary xylem 
(wood) and phloem (Fig. 6). See APICAL MERISTEM; 
LATERAL MERISTEM. 

Stele. The vascular tissues and the closely as- 
sociated ground tissues—pericycle (on the outer 
boundary of vascular region), interfascicular regions 
(medullary or pith rays), and frequently also the 
pith—may be treated as a unit called the stele. The 
variations in the arrangement of the vascular tissues 
serve as a basis for distinguishing the stelar types. The 
word stele means column and thus characterizes the 
system of vascular and associated ground tissues as a 
column. This column is enclosed within the cortex, 
which is not part of the stele. See PERICYCLE. 


Primary State 


The leafy stem of the plant is initiated as a shoot pri- 
mordium in the embryo. This primordium is called 
the plumule, or epicotyl, and is located above the 
insertion of the cotyledons (seed leaves) on the 
hypocotyl. The epicotyl has an apical meristem that 
gives rise to all subsequent primary parts of the leafy 
stems. In the primary growth of the stem, nodes 
and internodes are differentiated, and the leaves ap- 
pear in a characteristic arrangement (phyllotaxis). 
The cells derived from the apical meristem compose 
the meristematic precursors of the primary tissues 
of the stem and are called primary meristems (pro- 
toderm, ground meristem, procambium) [Fig. 6]. 
These meristems differentiate into the primary tis- 
sue systems. 

Shoot apex or apical meristem. This so-called grow- 
ing point consists of the meristematic initials and 
their derivatives. The organization of the apical 
meristem varies in different groups of plants. In 
the angiosperms the initials are usually arranged in 
two or more layers (tiers), frequently three in gym- 
nosperms, a large group of apical initials without a 
tunica and with some cell stratification; and in the 
ferns and still lower vascular plants the shoot tip may 
bear a single apical initial. The organization in the an- 
giosperms is referred to as that of tunica and corpus. 
The outermost tier or tiers of initials and their imme- 
diate derivatives constitute the tunica; the innermost 
tier and its immediate derivations compose the cor- 
pus. The tunica is a mantlelike region enclosing the 
corpus or core. The tunica increases only in surface 
by anticlinal (at right angles to the surface) divisions; 
the corpus shows volume growth, for its cells divide 
in various planes. All the initials and their first deriva- 
tives together are often designated as a promeristem. 
Below this promeristem are partially differentiated 
tissues, the primary meristems, named from the pe- 
riphery inward: the protoderm; the outer ground 
meristem; the procambium, or provascular tissue; 
and the inner ground meristem. As stated previously, 
the primary peristems differentiate into primary tis- 
sue systems: the protoderm into the epidermis, the 
ground meristem into tissues of cortex and pith, 
and the procambium into primary vascular tissues 


(Fig. 6). 
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Fig. 6. Diagrammatic representation of a woody, dicotyledonous stem in transverse and 
longitudinal sections. It illustrates the gradual transition (arrow) from primary growth 


(growth in length) to secondary growth (growth in diameter). 


The primary plant body is 


derived from the apical meristem; the secondary plant body is produced by the vascular 


cambium and the cork cambium. 


Differentiation. The complex of physiological and 
morphological changes that occur ina cell as it devel- 
ops from a meristematic to a mature state is called dif- 
ferentiation. Common evidences of differentiation of 
cells are vacuolation (water uptake), accumulation of 
ergastic substances (metabolic products), formation 
of plastids, and increase in thickness of the cell wall 
and its impregnation with new wall substances. In 
highly specialized cells the nucleus may break down 
(sieve elements in the phloem), or even the entire 
protoplasm may be lost (water-conducting cells in 
the xylem). See PLANT CELL. 

Primary tissue systems. As seen in the stem as a 
whole, the primary tissues form integrated systems 
and are referred to as the primary tissue systems: epi- 
dermis (or epidermal system), ground tissue system, 
and primary vascular system. 

Epidermis. The epidermal system differentiates from 
the protoderm. The epidermis is usually composed 
of a single layer of cells. Occasionally, the protoderm 
divides periclinally (parallel to the surface) and pro- 
duces a multiple epidermis. Epidermal cells may be 
variously modified into guard cells with stomata or 
the diverse epidermal appendages called hairs, or 
trichomes. The walls of the epidermis are cutinized, 
that is, impregnated with a waxy substance, and the 
outer walls are covered with a continuous layer of the 
same substance, the cuticle. See EPIDERMIS (PLANT). 
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Cortex. Between the epidermis and the vascular sys- 
tem is the cortex, formed by the outer ground tissue. 
However, there is no distinguishable cortex in plants 
in which the vascular bundles have a scattered ar- 
rangement, as in corn. The outer layers of the cor- 
tex frequently form collenchyma, a mechanical, or 
strengthening, tissue. The cortex may also include 
chlorenchyma (tissue containing chlorophyll. Still 
other cells may develop as sclereids, fibers, or lati- 
cifers. Secretory canals may be present. In water 
plants the cortical parenchyma may contain large 
air spaces and is then called aerenchyma. The out- 
ermost layers of the cortex are sometimes cutinized, 
like the epidermis, and are referred to as the hypo- 
dermis. Cortical parenchymal cells contain various 
inclusions such as starch, tannins, and crystals. See 
COLLENCHYMA; CORTEX (PLANT). 

Endodermis. The innermost layer of the cortex (or 
possibly, in the lower vascular plants, the outermost 
layer of the stele) forms a histochemically distinct 
and sometimes a morphologically specialized layer, 
the endodermis. If an endodermis is morphologically 
distinct, it forms a compact layer and the anticli- 
nal walls of its cells bear a suberized and lignified 
band, or Casparian strip. The endodermis is found 
in the stems and roots of lower vascular plants, the 
rhizomes and roots of seed plants, and a few herba- 
ceous stems of angiosperms. It is not differentiated 
in the aerial stems of gymnosperms and most an- 
giosperms. In young stems of angiosperms a homol- 
ogous layer contains abundant starch and is called 
the starch sheath. See ENDODERMIS. 

Pericycle. If a distinct layer or layers of cells are 
present between the ground-tissue endodermis and 
the vascular tissue, the tissue constitutes the pericy- 
cle. This tissue is interpreted as part of the ground 
tissue of the stele. The pericycle may consist of 
parenchyma or sclerenchyma. It may be seen in 
the stems and roots of lower vascular plants, but 
is usually absent in the stems of gymnosperms and 
angiosperms. The fibers located on the periphery 
of the vascular systems in many dicotyledons are 
often called pericycle fibers. In many species, how- 
ever, these fibers originate in the phloem. See PERI- 
CYCLE. 

Pith. The ground tissue enclosed by the vascular 
tissues is the medulla or pith. In stems with no cor- 
tex the pith is not distinguishable. The vascular bun- 
dles may be scattered throughout the central ground 
tissue, as in some monocotyledons, or there may be 
strands of internal phloem or complete vascular bun- 
dles located within the pith, as in some dicotyledons. 
The pith may consist of parenchyma or may become 
sclerotic chard-walled), especially at the nodes. The 
outer zone of the pith often consists of small, per- 
sistent cells with more or less sclerified walls. Many 
herbaceous plants have hollow stems (internodes) 
because the growth of the pith fails to keep pace 
with stem expansion. Some woody plants, such as 
the walnut and Liriodendron, have a diaphragmed 
pith, that is, pith with thick-walled parenchyma cells 
and sclereids arranged in disks, or diaphragms, of 
firm tissue. 


Primary vascular system. In the ferns and the seed 
plants the primary vascular system, derived from the 
procambium, is composed of anastomosing (inter- 
connected) vascular bundles. At each node some of 
the vascular bundles diverge into the leaves and ax- 
illary branches. A bundle in the stem which leads 
directly into a leaf is called a leaf trace; one lead- 
ing into a branch is a branch trace. One or more 
leaf traces may be associated with each leaf. Where 
the leaf trace diverges from the stem into a leaf at 
a node, the vascular cylinder commonly has a more 
or less circumscribed region of parenchyma. This 
region is called a leaf gap or lacuna. The lateral 
branches of most dicotyledons and gymnosperms 
have two branch traces. Some species have one trace 
per branch, others more than two. In monocotyle- 
dons, for example, the axillary shoot connection con- 
sists of many vascular strands. At the node the branch 
traces may have a single branch gap which, more- 
over, is commonly confluent with the leaf gap. 

Separate parts of the primary vascular system of 
the stem constitute vascular strands or vascular bun- 
dles. The phloem and xylem within a vascular bundle 
may be arranged in various ways. The most common 
type in gymosperms and angiosperms is the collat- 
eral bundle in which the phloem occurs only on one 
side of the xylem. If phloem occurs on both sides 
of the bundle, as in the Cucurbitaceae, the bundle 
is bicollateral. If one kind of vascular tissue com- 
pletely surrounds the other, the bundles are concen- 
tric. The concentric bundle is amphicribral if the 
phloem surrounds the xylem, as in ferns, or am- 
phivasal if the xylem surrounds the phloem, as in 
some monocotyledons and dicotyledons. 

The primary vascular system with associated fun- 
damental tissue is termed the stele. The simplest type 
of stele, and perhaps also the most primitive, is the 
protostele, in which a solid central core of xylem 
is surrounded by phloem, as found generally in the 
roots, in the stems of lower vascular plants such as 
the fern Gleichenia, and the stems of some water 
plants (Fig. 7a). In the plant group Lycopsida the 
lycopods or “ground pines” have a modified proto- 
stele with alternating bands of xylem and phloem 
(Fig. 7b), and the “club mosses” (Selaginella) have 
separate protosteles supported in air spaces by ra- 
dially elongated endodermal cells designated trabec- 
ulae. In the stems of vascular plants located higher 
than Lycopsida on the evolutionary scale, the center 
of the stele is occupied by a pith, and the vascu- 
lar system has a tubular structure. The stele, there- 
fore, is called a siphonostele (tubular stele). If the 
phloem occurs only on the outer side of the xylem of 
a siphonostele (gymnosperms and angiosperms), the 
stele is called an ectophloic siphonostele (Fig. 7c). If 
the phloem differentiates also on the inner side of 
the xylem (in such ferns as Adiantum, Dicksonia, 
and Marsilia), the stele is designated an amphiphloic 
siphonostele or a solenostele (Fig. 7d). In the ferns, 
in which the stem is short and the leaves are set 
close together, the leaf gaps overlap, and the tubular 
stele is dissected into a network of distinct strands. 
Such a stele is called dictyostele, as in Polypodium 





Fig. 7. Types of steles as seen in photomicrographs of cross sections of stems. (a) Protostele (Gleichenia). (b) Radial 
(modified) protostele (Lycopodium). (c) Ectophloic siphonostele (Tilia). (d) Amphiphloic siphonostele (Dicksonia). 


(e) Dictyostele (Polypodium). 


and Dryopteris (Fig. 7e). Another modification of the 
siphonostele is the eustele, in which the vascular sys- 
tem consists of collateral or bicollateral strands, and 
both interfascicular regions and leaf gaps are pres- 
ent. Eustele is characteristic of gymnosperms and an- 
giosperms. The most complex stele, the atactostele, 
has a dispersed arrangement of strands, as in many 
monocotyledons. 

Primary development. The origin of the stem from 
the apical meristem is closely associated with the 
development of leaves. At initiation the leaf pri- 
mordium is merely a small protuberance on the flank 
of the shoot tip and cannot be delimited from the 
stem part to which it is attached. Only later, as growth 
continues, does the primordium become an organ 
distinguishable externally and internally from the 
stem. The stem itself shows at first no division into 
nodes and internodes. It appears like a complex of 
superimposed disks (future nodes), each bearing one 
or more leaf primordia, depending on leaf arrange- 
ment. Later, during elongation, the disklike zones 
become separated from one another by cell division 
and enlargement between the levels at which the leaf 
primordia are inserted. Thus, the internodal elonga- 
tion is initiated, and the stem becomes differentiated 
into nodes and internodes. The elongation of intern- 
odes is the phenomenon that brings about the char- 


acteristic rapid extension of new shoots during the 
spring growth of trees and shrubs. 

Internally, the differentiation of stem tissues is 
more or less closely correlated with the differenti- 
ation of leaves. In lower vascular plants with proto- 
steles in their stems, as the lycopods, the vascular 
system differentiates beneath the apical meristem as 
a column of provascular tissue (procambium), and 
the leaf traces, which connect the small leaves with 
this column, differentiate later through the cortex. In 
its differentiation the provascular tissue becomes dis- 
tinct from the surrounding ground tissue of the cor- 
tex because the cells of the latter show an increase 
in vacuolation and an overall enlargement, whereas 
the cells of the provascular tissue retain their dense 
cytoplasm and they increase in length but remain 
narrow. 

In the ferns and seed plants, in which the leaves 
are more prominent parts of the shoot than in the 
lycopods and the vascular system of the stem con- 
sists of a cylinder of strands (siphonostele structure) 
connected with the leaves at regular intervals (at 
the nodes), the vascular system is initiated in rela- 
tion to the leaves. Vacuolation of the differentiating 
cortex and pith delimits the future vascular region 
as a still highly meristematic cylinder of cells. Then 
within this cylinder, procambial strands differentiate 
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beneath the emerging leaf primordia chiefly by lon- 
gitudinal cell division and cell elongation; that is, the 
leaf traces are initiated. The cells between the bun- 
dles that do not become procambial cells differenti- 
ate as parenchyma cells of the interfascicular regions 
and the leaf gaps. If followed downward in the stem, 
the young leaf traces will be found joined with one 
another and with older traces. Thus, the procambial 
strands form an interconnected system similar to that 
of the mature primary vascular system, except that 
all its parts are still short. The procambial system 
elongates in coordination with the other tissues of 
the internodes by further division and elongation of 
cells. Eventually, primary phloem and primary xylem 
differentiate in the procambial strands, the pholem 
in continuity with that of the older parts of the stem 
(acropetal differentiation, that is, from the base of the 
stem toward its apex), the xylem first appearing at 
the bases of leaves then differentiating in two direc- 
tions: downward (basipetally) toward a connection 
with the xylem of older stem regions and upward 
in the leaves. See LYCOPODIALES; MAGNOLIOPHYTA; 
POLYPODIALES. 

Differentiation of lateral buds. The lateral or axil- 
lary buds arise more or less close to the apical meri- 
stem of the terminal bud, depending on the type of 
branching characteristic of the plant. The buds are 
initiated by cell division in the stem to produce a 
lens-shaped group of cells in the outer cortex near 
the axils of subtending leaves. The divisions occur 
in such a manner that eventually an apical meristem 
of the bud is organized. The bud meristem may de- 
velop immediately into a side shoot by producing 
leaves and increments of stem, or it may interrupt 
growth after variable degrees of initial development. 
The lateral bud may also develop into an inflores- 
cence, or flower, by appropriate changes in growth 
pattern as compared with that of a vegetative shoot. 
The bud and parent stem establish a vascular con- 
nection through the bud traces, the differentiation 
of which is more or less similar to that of the leaf 
traces of the same plant. 


Secondary State 


The stems of gymnosperms, most dicotyledons, and 
some monocotyledons show an increase in thick- 
ness by secondary growth. A lateral meristem, the 
vascular cambium, produces secondary vascular tis- 
sues which constitute the secondary body. Also, a 
cork cambium, or phellogen, produces a secondary 
protective tissue called periderm, a tissue which re- 
places the primary protective tissue, the epidermis. 

The vascular cambium originates from the pro- 
cambial and certain parenchyma cells of the primary 
body. Procambial cells that remain meristematic after 
the stem completes its elongation and the primary 
vascular tissues become mature proceed to divide 
periclinally (parallel with the surface). These cells 
are designated as cambial initials. In addition, per- 
iclinal divisions are initiated by parenchymal cells 
between the vascular bundles; these cells also are 
cambial initials. Collectively, all these initials con- 
stitute the vascular cambium. The part of cambium 
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Fig. 8. Secondary tissue in the stem of sunflower 
(Helianthus annuus). (a) Transverse section of the entire 
stem showing the thicker-walled, dark, secondary xylem 
cells as they are added to the primary, more in the 
fascicular than in the interfascicular areas. (b) Enlarged 
view of a portion of a. (From J. B. Hill, L. O. Overholts, and 
H. W. Popp, Botany: A Textbook for Colleges, 2d ed., 
McGraw-Hill, 1950) 


originating within the vascular bundles is the fasci- 
cular cambium, that arising in the parenchyma be- 
tween vascular bundles, the interfascicular cambium 
(Fig. 8). 

The arrangement of the secondary vascular tissues 
produced by the cambium varies in different plants. 
This tissue may appear as (1) a continuous cylin- 
der; (2) individual strands, with secondary activity 
limited to the bundles (no interfascicular cambium 
is formed); (3) individual strands, with secondary 
vascular tissues produced in the fascicular regions 
and secondary ray parenchyma in the interfascicu- 
lar regions; or (4) anomalous (atypical) arrangements 
characterized by uneven growth of xylem or phloem 
or by formation of successive cambia, one farther 
outward than the preceding. 

The cork cambium or phellogen originates in 
the epidermis or in the cortex from parenchyma 
or collenchyma cells. It produces phellem or cork 


large wood ray annual ring 


spring vessels 





Fig. 9. Diagram of a segment of an oak log showing the 
grain in three dimensions, that is, the pattern formed by the 
annual rings and the wood rays. At the right, the block has 
been cut radially, and above, transversely. At the left (the 
surface of the log), a portion of the phloem and bark has 
been removed, showing a tangential view of the wood 
underneath. (After E. W. Sinnott and K. S. Wilson, Botany: 
Principles and Problems, 5th ed., McGraw-Hill, 1955) 


outwardly and phelloderm or secondary cortex in- 
wardly. 

Woody dicotyledons. The cambium here is a contin- 
uous cylinder. The fascicular cambium produces sec- 
ondary xylem and phloem; the interfascicular cam- 
bium forms either the same kind of tissue as the fas- 
cicular cambium or only vascular ray tissue. 

As seen in cross sections, the secondary xylem, or 
wood, is formed in distinct concentric rings called 
annual, or growth, rings (Fig. 9). Each annual ring 
is composed of an inner portion called the spring, 
or early, wood, in which the vessels and tracheids 
are relatively large, and an outer portion called the 
summer, or late, wood, in which the cells have much 
smaller diameters and thicker walls than those of the 
early wood. Usually only the outermost growth rings 
function in conduction of water. The outer, light- 
colored, relatively soft, functioning part of the wood 
is called sapwood. The nonconducting older (early 
formed) wood, called heartwood, is often filled with 
gums, resins, tannins, and mineral salts, and is dark in 
color. Growth rings in oblique or horizontal branches 
become asymmetrical, the rings being wider in the 
upper half Gin most angiosperms) or in the lower half 
(in conifers). The wood with the wider rings has a 
different histologic and chemical structure from that 
with the narrower and is called tension wood in the 
angiosperms and compression wood in the conifers. 

The wood varies in structure, chiefly depend- 
ing on (1) the relative amounts of vessels, tra- 
cheids, and wood fibers, (2) the distribution of wood 
parenchyma, and (3) the presence and character of 
vascular rays. 

Two patterns of vessel arrangement are recog- 
nized. If the early wood contains wide vessels and 
the late wood narrow vessels, so that each growth 
ring begins with a zone of large vessels, the wood is 
called ring-porous (Fig. 10a). If vessels of more or 
less uniform widths are formed throughout the year, 
the wood is called diffuse-porous (Fig. 100). 

Wood texture, coarse or fine, refers to the size and 
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number of the xylem cells, especially vessels. The 
term grain refers to the arrangement of the cells, 
especially fibers, making up the wood. A wood is 
hard or soft, depending upon the number of lignified 
fibers. See WOOD ANATOMY. 

Among the many wood products, plywood and ve- 
neer are particularly important. In making plywood, 
the wood is softened and the log rotated against a 
heavy blade to produce a continuous sheet of very 
thin wood which is cut to suitable lengths; pieces 
alternately stacked with grain at right angles are 
bonded together in varying odd numbers of layers, 
pressed, and dried. Plywood, or other wood bases, 
may be covered by thin sheets of fine-grained wood 
or veneer. Characteristic and interesting grain pat- 
terns for fine woods or veneers are obtained by 
cutting wood in transverse, radial, and tangential sec- 
tion (Fig. 11). Veneer wood with the most interest- 
ing pattern or grain comes from the base of a tree, 





Fig. 10. Transverse sections, magnified, of two kinds of dicotyledonous wood. (a) White 
oak, ring-porous wood with large spring vessels and narrow (uniseriate) and wide 
(multiseriate) rays. (b) Yellow birch, a diffuse-porous wood with narrow (mostly uniseriate) 
rays. (U. S. Forest Products Laboratory) 





Fig. 11. Silver maple (Acer saccharinum) wood in different 
sections. Dark and light colors and grain irregularities result 
from cutting through fibers and vessels of annual rings in 
different planes. Also, the rays, particularly in radial section, 
give the wood a pleasing and interesting appearance. 

(a) Transverse section. (b) Radial section. (c) Tangential 
section. (From J. B. Hill, L. O. Overholts, and H. W. Popp, 
Botany: A Textbook for Colleges, 2d ed., McGraw-Hill, 1950) 
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Fig. 12. Block of white-pine wood, in three dimensions, 
showing relationships of tracheids and ray elements. (After 
J. B. Hill, L. O. Overholts, and H. W. Popp, Botany: A 
Textbook for Colleges, 2d ed., McGraw-Hill, 1950) 


from burls, or from crotch wood, where the vascu- 
lar arrangements may be wavy or spiral. Some of the 
choice curly grains, such as curly maple, result from 
the development of adventitious buds. Injury to the 
cambium may produce a similar pattern. See WOOD 
PRODUCTS. 

Gymnosperms. Most gymnosperms are similar to 
the woody dicotyledons in that a vascular cambium 
forms a continuous cylinder of secondary xylem and 
phloem. Commercial gymnosperm wood is usually 
derived from conifers. Conifer wood is composed of 
tracheids, fiber tracheids, and uniseriate wood rays, 
but no vessels (Fig. 12). Because of the absence of 
vessels, the conifer wood is more homogeneous in 
sections than the angiosperm wood (Fig. 13). 

As stems grow and form new branches, those near 
the base are likely to be crowded, shaded, and die, 
particularly in pine. The dead branches fall off, and 
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their bases are covered by successive layers of wood, 
forming knots. Knots weaken the wood and are not 
desirable unless for ornamental paneling. 

The secondary growth of other gymnosperms is 
similar to that of the conifers. The cycads are inter- 
esting because growth rings are added at intervals 
of 2-20 years and are difficult to distinguish, except 
in Dioon. Many of the cycads form only one ring of 
wood; but others, such as Cycas, Dioon, Macroza- 
mia, and Encephalartos, produce several rings. 

Monocotyledons. Some monocotyledons, as the 
bamboos and palms, may form rather woody trunks 
by primary thickening. The thickening growth may 
occur rather generally throughout the stem, or there 
may be a primary thickening meristem, a cambium- 
like zone of cells originating from the ground tissue 
beneath young leaf bases. It increases the width of 
the stem directly beneath the apex. Additional divi- 
sion of cells throughout the stem and the enlarge- 
ment of cells complete the growth in thickness. In 
certain monocotyledons, such as Aloe, Yucca, Agave, 
and Dracaena, secondary growth also occurs. How- 
ever, the cambium does not originate in the bundles, 
which are scattered in the ground tissue. It appears in 
the ground parenchyma outside the vascular bundles 
and by tangential divisions forms parenchyma on the 
outside and both vascular strands and parenchyma 
on the inside. The basic structure of the primary and 
secondary bodies is the same—in both, discrete vas- 
cular strands are embedded in ground tissue, usually 
sclerified (hardened) parenchyma. 

Effect of secondary growth. The secondary xylem 
covers the primary xylem and pith, usually without 
changing them; whereas the primary phloem and 
cortex are pushed outward and are more or less com- 
pressed by the centrifugal growth of the secondary 
vascular tissues. The epidermis and cortex may keep 
pace with secondary vascular growth, or they may 
be ruptured and replaced by periderm. 

Cambium develops in the parenchyma of the leaf 
gap, and its derivatives close the leaf gaps and break 
and bury the leaf traces. 





Fig. 13. Enlarged photomicrographs of pine wood. (a) Transverse section. The wood cells are cut across at right angles to 
their length. Note the thin-walled cells (tracheids) in the springwood and the thick-walled cells in the summerwood (fiber 
tracheids). The uniseriate wood rays run at right angles to the annual rings. The large openings are resin canals. (b) Radial 
section. (c) Tangential section. In the last two sections the tracheids are cut lengthwise. In the radial section the rays run ina 
horizontal direction across the tracheids. In the tangential section the rays appear as lens-shaped clusters of cells. (U. S. 


Forest Products Laboratory) 





Fig. 14. Cross section of 4-year-old pine stem, showing 
annual xylem rings and other tissues. (From J. B. Hill, L. O. 
Overholts, and H. W. Popp, Botany: A Textbook for 
Colleges, 2d ed., McGraw-Hill, 1950) 


Types 


The stems of vascular plants, or Tracheophyta, show 
an endless variety of form and structure. In this arti- 
cle only a few are considered as type examples. 
Gymnosperm stems. This group of Tracheophyta 
contains some of the largest plants in the world (es- 
pecially in Pinales), as well as Welwitschia, in which 
there is no aerial stem at all. No annual or perennial 
herbs are recognized in this group. See PINOPSIDA. 
Pinales. The gymnosperms of this order are much- 
branched, small-leaved, woody tree forms, except 
some shrubby junipers. The stem has typically an ec- 
tophloic siphonostele. Most conifers produce a con- 
spicuous amount of secondary tissues which form 
a solid continuous cylinder (Fig. 14). Resin ducts 
occur in the cortex and the vascular tissues, except 
in the yews. The wood is dense and massive, com- 
posed of tracheids, fiber tracheids, and uniseriate 
wood rays. See SECRETORY STRUCTURES (PLANT). 
Cycadales. The stem types range from tuberous and 
partly or wholly subterranean to relatively tall, aerial 
stems. The stem is usually unbranched and bears a 
crown of large leaves. The cortex and pith are large, 
loose, and parenchymatous. The vascular cylinder is 
narrow and has broad medullary rays and a loose- 


textured wood. The stele is a siphonostele. The pri- 
mary xylem usually contains scalariform tracheids, 
the secondary xylem tracheids with bordered pits. 
Leaf traces girdle the stem that is, they have a nearly 
horizontal course in the cortex. The stem is rigid 
mainly because of persistent leaf bases. 

Ginkgoales. The only living representative, Ginkgo 
biloba, is a large profusely branched tree with both 
long shoots (normal vegetative) and short shoots 
(restricted vegetative or reproductive). Secondary 
growth is vigorous, and the pith and cortex are rela- 
tively small. 

Gnetales. The three genera included in this order 
are highly specialized. Ephedra is usually a much 
branched shrub, Gnetum species are mostly lianas, 
and in Welwitschia most of the stem is buried in the 
ground. A feature which distinguishes these plants 
from other gymnosperms is the presence of vessels 
among the xylem elements. 

Angiosperms. The angiosperms (flowering plants) 
include two main groups: the dicotyledons (plants 
with two seed leaves) and the monocotyledons 
(plants with one seed leaf). The stems of these two 
groups of plants show some differences. It is cus- 
tomary also to divide the stems of angiosperms into 
woody and herbaceous types. Although the vari- 
ous stem types intergrade in their character, some 
approximate generalizations regarding their differ- 
ences may be made, as shown in the table. 





Fig. 15. Woody dicotyledon; 1-year-old woody stem of 
Liriodendron, showing characteristics of ectophloic 
siphonostele; epidermis not yet replaced by periderm. 





Comparison of dicotyledon and monocotyledon stems 





Dicotyledons 


Monocotyledons 





Stems woody, herbaceous, vines 

Secondary growth common; phloem, vascular cambium, and 
xylem in concentric layers; separate vascular bundles 
common in the primary body, uncommon in secondary 


Distinct pith and cortex 

Endodermis and pericycle usually absent; sometimes 
pericycle fibers on outer limit of vascular cylinder; usually 
phloem fibers 

Number of leaf traces to each leaf is variable, rarely large 


Periderm frequently present 





Stems herbaceous, few woody 

Some woody types with secondary growth; vascular 
cambium not between xylem and phloem; in the primary 
body vascular bundles often scattered through ground 
tissue, sometimes in two circles; in the secondary body 
also separate vascular bundles 

No definable cortex and pith 

Endodermis and pericycle usually absent; vascular 
bundles with sheaths of sclerenchyma 


Number of leaf traces to each leaf is large; sheathing leaf 
bases enclose the stem 
Periderm not common 
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Fig. 16. Transverse sections of woody vines. (a) Clematis. (b) Serjania mexicana; mature stem shows irregular development 
(unequal amounts) of secondary xylem and phloem. (c) Serjania subdentata; mature stem shows five well-developed cortical 


steles. 


Woody dicotyledons. The primary vascular cylinder in 
the woody dicotyledons is an ectophloic siphono- 
stele, and the pith and cortex are well defined. Liri- 
odendron illustrates this type of structure (Fig. 15). 
The stem from the inside out consists of (1) a central 
pith of large, loose, parenchyma cells; (2) a vascular 
system containing closely arranged vascular bundles, 
with the leaf traces often recognizable as rather large 
bundles protruding into the cortex; (3) a cortex; and 
(4) an epidermis. When secondary growth occurs, a 
continuous vascular cambium and continuous layers 
of secondary xylem and phloem are formed. The epi- 
dermis is eventually replaced by a periderm. There 
is no endodermis and no pericycle. Primary phloem 
fibers form the outer limit of the vascular region. 

Woody vines or lianas. Vines are generally character- 
ized by a primary and secondary vascular system 
dissected into strands by broad rays (Fig. 16a). 





Fig. 17. The primary tissues of the stem of a typical herbaceous dicotyledonous plant, 
sunflower (Helianthus annuus), showing the characteristics of an ectophloic 
siphonostele. (a) Transverse section of entire stem showing the arrangement of vascular 
tissues in one circle. (b) Enlarged portion of a showing details of the structure, including 
one vascular bundle. (From J. B. Hill, L. O. Overholts, and H. W. Popp, Botany: A Textbook 
for Colleges, 2d ed., McGraw-Hill, 1950) 


Also, they often have a ringbark which results from 
concentric successively formed periderms. Lenticels 
may be absent in ringbark. 

Tropical lianas (climbing woody vines) often have 
a more or less unusual structure. There may be, 
for example, a highly irregular development of sec- 
ondary xylem and phloem, so that the outline of the 
xylem is uneven (Fig. 16b). A still more striking struc- 
ture results when the leaf traces develop complete 
cylinders of cambium, xylem, and phloem at levels 
where they have diverged from the central cylinder. 
The stem appears as though composed of several 
central cylinders (Fig. 16c). 

Herbaceous dicotyledons. These forms differ from the 
arborescent, or woody, dicotyledons chiefly in hav- 
ing a smaller amount of cambial activity or none. 
They are often similar to 1-year-old woody stems. 
The primary vascular cylinder forms an ectophloic 
siphonostele with the vascular strands often more 
widely separated by interfascicular regions than 
those in the arborescent species (Fig. 17). Vascular 
cambium may be absent, as in Ranunculus, may be 
present but form only parenchyma and sclerenchyma 
in the interfascicular regions, as in Medicago, or may 
form a continuous vascular cylinder, as in Vinca or 
Digitalis. 

Herbaceous monocotyledons. In these plants the vascu- 
lar system is composed of widely spaced strands ar- 
ranged in one of four ways. First, as in most grasses, 
the vascular bundles are arranged in two circles, with 
the outer smaller bundles embedded in a continu- 
ous sheath of sclerenchyma close to the epidermis 
(Fig. 18a). The vascular bundles are collateral, each 
enclosed in a sheath of sclerenchyma. The pith may 
break down in the internodes but not in the nodes. 
Transverse bundles in the nodal region interconnect 
the leaf traces. 

Second, in a few monocotyledons, for example 
Clintonia (Fig. 18b), a single, concentrically ar- 
ranged series of bundles may occur. Each bundle has 
its own complete endodermis. 

Third, in a few monocotyledons the bundles are 
grouped in the center of the stem, as in the rhizome 
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Fig. 18. Photomicrographs of transverse sections of monocotyledonous stems showing various arrangements of vascular 
bundles. (a) Wheat (grass). (b) Clintonia. (c) Corn. (d) Bamboo. (From J. B. Hill, L. O. Overholts, and H. W. Popp, Botany: A 


Textbook for Colleges, 2d ed., McGraw-Hill, 1950) 


of Acorus calamus. The bundles are amphivasal, the 
phloem being surrounded by xylem. 

Fourth and most commonly, vascular bundles are 
numerous and scattered in a ground tissue, as in corn 
or bamboo (Fig. 18c and d), with no pith or cortical 
regions being evident. The complex arrangement of 
the vascular bundles is related to the large number 
and variation in size of leaf traces. There may or may 
not be a sclerenchyma cylinder about each bundle, 
but subepidermal parenchyma is strongly sclerified. 
Endodermis and pericycle are absent. 

Woody monocotyledons. Some monocotyledons, as the 
bamboos and palms, may form rather woody trunks 
by thickening of cell walls with age until nearly the 
whole ground tissue is heavily sclerified. Despite 
their large size these trunks are composed of primary 
tissues only. There are, however, woody monocotyle- 
dons with secondary growth, but the secondary tis- 
sues resemble the primary tissues in that they are 
composed of vascular bundles embedded in ground 
tissue, usually sclerified parenchyma. 

Specialized erect stems. Two kinds of erect stems 
have special names. Culm is a name applied to 
hollow, but solid-jointed, stems of grasses, woody 
and herbaceous. Caudex means the axis of a plant, 
consisting of root and stem. The term is sometimes 
applied to short, enduring stems or stocks which 
throw up new stalks each year from persistent buds 
at ground level. Also, the name is used to describe 
the trunks of palms and tree ferns which consist of 
persistent leaf bases forming a column. 

Perennial stems. Perennials which have woody 
stems are generally called trees or shrubs. Shrubs are 
woody perennials with several main stems branched 
from or near the ground and generally not more than 
20-25 ft (6.0-7.5 m) high. Trees are woody perenni- 
als with a single main trunk or axis which rises some 
distance above the ground before branching. Trees 
generally exceed shrubs in height. The two words are 
convenient but not exact terms because trees and 
shrubs intergrade in height and form. Herbaceous 
plants also may be perennial. Their stems commonly 
die down to the ground each year. 

Modified aerial stems. Stems may be variously mod- 
ified for photosynthesis, storage, propagation, sup- 
port, and protection. 


Cladode. This term is applied to a branch with a 
single internode which is flattened and serves as a 
leaf, as is found in asparagus. 

Phylloclade. A flattened leaflike shoot that serves to 
replace a leaf as a photosynthetic organ, as found in 
the Opuntia cactus, is called a phylloclade. 

Thorn. This hard, pointed projection may be a mod- 
ified branch, or leaf, or stipule. Thorns may be un- 
branched, as in the osage orange, or branched, as in 
the honey locust. A lateral branch may form leaves 
and flowers, then terminate growth by producing a 
thorn at the apex. 

Creeping or prostrate stems. Stems that trail along 
the surface of the ground and that may take root at 
the nodes are called creeping or prostrate stems. 

Runner. A runner, as in the strawberry, is a hori- 
zontally growing, sympodial stem system; that is, it 
appears single but actually is composed of a series of 
lateral branches arranged in lineal order. In this sys- 
tem the stem forms adventitious roots near the tip, 
then gives rise to a rosette of leaves from a bud. A 
new runner emerges from the axil of a reduced leaf 
at the base of this rosette. 

Stolon. The term stolon is used for creeping stems, 
shorter than runners. They also root adventitiously 
(raspberry, currant). The term has no precise mean- 
ing. 

Climbing stems or vines. These are long, slender 
stems which usually climb by special devices. 

Rambler. The ramblers rest on the tops of other 
plants and some, as certain roses, have spines or 
prickles which help them to adhere to their support. 

Root climbers. English ivy and posion ivy have stems 
which climb by means of adventitious roots. 

Tendril climbers. These plants climb by means of mod- 
ified leaf tendrils, as in the garden pea, or by stem 
tendrils, as in the grape. 

Twiners. In the twiners the whole stem winds about 
its support, as pole beans or many tropical lianas. 

Underground stems. Some stems grow under- 
ground and are often mistaken for roots. The main 
kinds of underground stems are rhizomes or root- 
stocks, tubers, corms, bulbs, and rhizomorphic drop- 
pers. 

Rhizomes are usually quite varied, plagiotropic 
(growing horizontally), perennial, underground 
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stems found in a vast number of plants. Their cauline 
(stem) nature is evident by well-defined nodes and 
internodes. Roots, scale leaves, and axillary buds 
form at the nodes. Some buds form leafy, upright 
shoots while others form new, sympodially branched 
underground shoots. Among different species, rhi- 
zomes may be (1) thin, tough, and rapidly growing; 
(2) fleshy though rapidly growing; and (3) short and 
fleshy, serving more for storage than for spreading. 
The anatomy of rhizomes, except for some dorsiven- 
trality, is much like that of the aerial stem of the same 
species. The other types of underground stems, such 
as tubers, corms, and bulbs, are essentially modified 
rhizomes. James E. Gunckel 

Bibliography. J. D. Mauseth, Plant Anatomy, 1988. 
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Cells that have the ability to self-replicate and to give 
rise to mature cells. The concept of stem cells was 
originally based on renewing tissues. Many adult tis- 
sues, such as the skin, blood, and intestines, consist 
of mostly mature and short-lived cells that must be 
continuously replaced. Stem cells were postulated 
as the source of the self-renewal. In the early 1960s, 
Canadian scientists Ernest A. McCulloch and James 
E. Till provided the first experimental proof of the 
existence of stem cells in the blood system. They re- 
vealed that a type of cell in bone marrow possesses 
the capacity to replicate itself and to differentiate to 
various lineages of mature blood cells. Self-renewal, 
together with the capacity for differentiation, de- 
fined the properties of stem cells. This definition is 
generally used in stem cell biology today. 

Stem cells can be found at different stages of fetal 
development and are present in a wide range of adult 
tissues. Many of the terms used to distinguish stem 
cells are based on their origins and the cell types of 
their progeny. There are three basic types of stem 
cells. Totipotent stem cells, meaning that their po- 
tential is total, have the capacity to give rise to every 
cell type of the body and to form an entire organism. 
Pluripotent stem cells, such as embryonic stem cells, 
are capable of generating virtually all cell types of the 
body but are unable to form a functioning organism. 
Multipotent stem cells can give rise only to a limited 
number of cell types. For example, adult stem cells, 
also called organ- or tissue-specific stem cells, are 
multipotent stem cells found in specialized organs 
and tissues after birth. Their primary function is to 
replenish cells lost from normal turnover or disease 
in the specific organs and tissues in which they are 
found. See CELL DIFFERENTIATION; HISTOGENESIS. 

Totipotent and embryonic stem cells. Totipotent 
stem cells occur at the earliest stage of embryonic 
development. The union of sperm and egg creates 
a single totipotent cell. This cell divides into iden- 
tical cells in the first hours after fertilization. All 
these cells have the potential to develop into a 
fetus when they are placed into the uterus. [To date, 
no such totipotent stem cell lines (primary cell cul- 
tures) have been developed.] The first differentiation 


of totipotent cells forms a sphere of cells called the 
blastocyst, which has an outer layer of cells and an 
inner cell mass. The outer layer of cells will form the 
placenta and other supporting tissues during fetal 
development, whereas cells of the inner cell mass 
go on to form all three primary germ layers: ecto- 
derm, mesoderm, and endoderm. The three germ 
layers are the embryonic source of all types of cells 
and tissues of the body. Embryonic stem cells are de- 
rived from the inner cell mass of the blastocyst. They 
retain the capacity to give rise to cells of all three 
germ layers. However, embryonic stem cells cannot 
form a complete organism because they are unable to 
generate the entire spectrum of cells and structures 
required for fetal development. Thus, embryonic 
stem cells are pluripotent, not totipotent, stem 
cells. See EMBRYOLOGY; EMBRYONIC DIFFERENTIA- 
TION; GERM LAYERS. 

Embryonic germ cells. Embryonic germ cells differ 
from embryonic stem cells in the tissue sources from 
which they are derived, but appear to be similar to 
embryonic stem cells in their pluripotency. Human 
embryonic germ cell lines are established from the 
cultures of the primordial germ cells obtained from 
the gonadal ridge of late-stage embryos, a specific 
part that normally develops into the testes or the 
ovaries. Embryonic germ cells in culture, like cul- 
tured embryonic stem cells, form embryoid bodies, 
which are dense, multilayered cell aggregates con- 
sisting of partially differentiated cells. The embryoid 
body-derived cells have high growth potential. The 
cell lines generated from cultures of the embryoid 
body cells can give rise to cells of all three embry- 
onic germ layers, indicating that embryonic germ 
cells may represent another source of pluripotent 
stem cells. 

Growing mouse embryonic stem cells. Much of the 
knowledge about embryonic development and stem 
cells has been accumulated from basic research on 
mouse embryonic stem cells. The techniques for sep- 
arating and culturing mouse embryonic stem cells 
from the inner cell mass of the blastocyst were 
first developed in the early 1980s. To maintain their 
growth potential and pluripotency, mouse embry- 
onic stem cells can be grown on a feeder layer, usually 
consisting of mouse embryonic fibroblast cells. The 
feeder cells support embryonic stem cells by secret- 
ing a cytokine growth factor, the leukemia inhibitory 
factor, into the growth medium. Alternatively, puri- 
fied leukemia inhibitory factor can be added to the 
growth medium without the use of a mouse em- 
bryonic feeder layer. (The leukemia inhibitory fac- 
tor serves as an essential growth factor to maintain 
embryonic stem cells in culture.) A line of embry- 
onic stem cells can be generated from a single cell 
under culture conditions that keep embryonic stem 
cells in a proliferative and undifferentiated state. Em- 
bryonic stem cell lines can produce indefinite num- 
bers of identical stem cells. When mouse embryonic 
stem cells are integrated into an embryo at the blas- 
tocyst stage, the introduced embryonic stem cells 
can contribute to cells in all tissues of the result- 
ing mouse. In the absence of feeder cells and the 


leukemia inhibitory factor in cultures, embryonic 
stem cells undergo differentiation spontaneously. 
Many studies are focused on directing differentiation 
of embryonic stem cells in culture. The goal is to 
generate specific cell types. Formation of cell aggre- 
gates with three-dimensional structure during em- 
bryonic stem cell differentiation in culture may allow 
some of the cell-cell interaction to mimic that of in 
vivo development. The culture conditions can be de- 
signed to support and select specific cell types. With 
these experimental strategies, preliminary success 
has been achieved to generate some cell types, such 
as primitive types of vascular structures, blood cells, 
nerve cells, and pancreatic insulin-producing cells. 
See GENETIC ENGINEERING; SOMATIC CELL GENETICS. 

Growing human embryonic stem cells. Since 1998, 
research teams have succeeded in growing human 
embryonic stem cells in culture. Human embryonic 
stem cell lines have been established from the inner 
cell mass of human blastocysts that were produced 
through in vitro fertilization procedures. The tech- 
niques for growing human embryonic stem cells are 
similar to those used for growth of mouse embry- 
onic stem cells. However, human embryonic stem 
cells must be grown on a mouse embryonic fibro- 
blast feeder layer or in media conditioned by mouse 
embryonic fibroblasts (see illustration). There are 
a number of human embryonic stem cell lines being 
generated and maintained in laboratories in the 
United States and other nations, including Australia, 
Sweden, India, South Korea, and Israel. The National 
Institutes of Health has created a Human Embryonic 
Stem Cell Registry, which lists stem cell lines that 
have been developed and can be used for research. 
Human embryonic stem cell lines can be maintained 
in culture to generate indefinite numbers of identi- 
cal stem cells for research. As with mouse embryonic 
stem cells, culture conditions have been designed to 
direct differentiation into specific cell types (for ex- 
ample, neural and hematopoietic cells). 

Adult stem cells. Adult stem cells, also referred 
to as somatic stem cells, occur in a wide variety 
of mature tissues in adults as well as in children. 
Like all stem cells, adult stem cells can self-replicate. 
Their ability to selfrenew can last throughout the 
lifetime of individual organisms. Unlike embryonic 
stem cells, though, it is usually difficult to expand 
adult stem cells in culture. Adult stem cells reside in 
specific organs and tissues but account for a very 
small number of the cells in tissues. They are re- 
sponsible for maintaining a stable state of the spe- 
cialized tissues. To replace lost cells, stem cells typi- 
cally generate intermediate cells called precursor or 
progenitor cells, which are no longer capable of self- 
renewal. However, they continue undergoing cell di- 
vision, coupled with maturation, to yield fully spe- 
cialized cells. Such stem cells have been identified 
in many types of adult tissues, including bone mar- 
row, blood, skin, gastrointestinal tract, dental pulp, 
retina of the eye, skeletal muscle, liver, pancreas, 
and brain. Adult stem cells are usually designated 
according to their source and their potential. Adult 
stem cells are multipotent because their potential 
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A colony of the human embryonic stem cell line H9 growing in culture. The elongated cells 
surrounding the colony are part of the mouse embryonic fibroblast feeder layer on which 
the stem cells are grown. The inset shows the enlarged area, with arrows pointing to the 
two nuclei of a single stem cell. The H9 cell line meets the criteria for the use of human 
embryonic stem cells by researchers funded by the U.S. government. (Courtesy of Clay 


Glennon) 


is normally limited to one or more lineages of spe- 
cialized cells. However, a special multipotent stem 
cell that can be found in bone marrow, called the 
mesenchymal stem cell, can produce all cell types of 
bone, cartilage, fat, blood, and connective tissues. 
See REGENERATION BIOLOGY). 

Blood stem cells. Blood stem cells, or hematopoietic 
stem cells, are the most studied type of adult stem 
cells. The concept of hematopoietic stem cells is 
not new, as it has been long realized that mature 
blood cells are constantly lost and destroyed. Bil- 
lions of new blood cells are produced each day to 
make up the loss. This process of blood cell gen- 
eration, called hematopoiesis, occurs largely in the 
bone marrow. The presence of hematopoietic stem 
cells in the bone marrow was first demonstrated by 
E. A. McCulloch and J. E. Till in a mouse model in the 
early 1960s. The first experimental work on stem 
cells was an unexpected outcome from their study 
for measuring the effects of radiation. They found 
that the blood system of a mouse that has been sub- 
jected to heavy radiation can be restored by infu- 
sion of bone marrow. The stem cells responsible 
for reconstituting the blood system generate visible 
cell colonies on the spleen of the recipient mouse. 
Each of the spleen colonies consists of one or more 
types of blood cells, and all the cells in a colony are 
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derived from a single cell. Selfrenewal capacity 
of the colony-forming cells is demonstrated by 
their ability to form secondary spleen colonies. 
Such blood stem cells, known as colony forming 
unit-spleen cells, qualify as pluripotent hematopoi- 
etic stem cells because they can replicate and give 
rise to multiple types of mature blood cells. A defini- 
tive proof of blood stem cells is their ability to recon- 
stitute the blood system. Bone marrow transplanta- 
tion demonstrates the restorative powers of blood 
stem cells in humans. See HEMATOPOIESIS; TRANS- 
PLANTATION BIOLOGY. 

Isolating blood stem cells. Like other adult stem cells, 
blood stem cells are rare and difficult to isolate. Only 
about 1 in 100,000 cells in the bone marrow is a stem 
cell. Scientists have used cell-sorting methods to en- 
rich and purify blood stem cells. Stem cells differ 
from mature cells in their surface markers, which are 
specific protein molecules on the cell membrane that 
can be tagged with monoclonal antibodies. By using 
a set of surface markers, some expressed mainly on 
stem cells and others on mature blood cells, nearly 
pure populations of stem cells can be separated from 
bone marrow. The stem cells purified by this ap- 
proach can engraft (begin to grow and function) and 
reconstitute the blood system in the recipient. In 
animal studies, as few as 30 purified stem cells can 
rescue a mouse that has been subjected to heavy ra- 
diation. Besides the bone marrow, a small number of 
blood stem cells can be found in circulating blood. In 
addition, stem cells in the bone marrow can be mo- 
bilized into the bloodstream by injecting the donor 
with certain growth factors or cytokines. This ap- 
proach can result in a large number of stem cells cir- 
culating in peripheral blood, from which they can 
be collected and used for transplant therapy. 

Umbilical cord blood and cord blood banks. An alternative 
source of blood stem cells is human umbilical cord 
blood, a small amount of blood remaining in the pla- 
centa and blood vessels of the umbilical cord. It is 
traditionally treated as a waste material after deliv- 
ery of the newborn. However, since the recognition 
of the presence of blood stem cells in umbilical cord 
blood in the late 1980s, its collection and banking 
has grown quickly. Similar to bone marrow, umbilical 
cord blood can be used as a source material of stem 
cells for transplant therapy. In 1989, the first suc- 
cessful cord blood transplant was reported for treat- 
ing a 6-year-old boy suffering from Fanconi’s anemia 
(an inherited disease that primarily affects the bone 
marrow, resulting in decreased production of blood 
cells) in Paris. Since then, over 6000 cord blood 
stem cell transplants have been performed world- 
wide, mainly in patients with blood conditions and 
in some cancer therapies. However, because of the 
limited number of stem cells in umbilical cord blood, 
most of the procedures are performed on young chil- 
dren of relatively low body weight. A current focus 
of study is to promote the growth of umbilical cord 
blood stem cells in culture in order to generate suf- 
ficient numbers of stem cells for adult recipients. 

Many blood banks have been established to col- 
lect and cryopreserve cord blood cells. Commercial 


banks offer services of storing cord blood of healthy 
newborns for potential future use by themselves or 
their siblings. Although it is considered a biological 
insurance, the chance of a child using his or her own 
cord blood is estimated at 1 per 20,000 collections. 
Of the estimated 6000 cord blood transplants, only 
14 were performed using autologous sources. 

Public banks encourage donation of cord blood 
for unrelated transplants. The Stem Cell Research and 
Therapeutic Act of 2005 (H.R. 2520) established a na- 
tional umbilical cord blood program, providing fed- 
eral funding to collect and store cord blood for blood 
cell transplants. The program functions to provide a 
national inventory of 150,000 cord blood units for 
public use and to establish a registry network inte- 
grated with the national marrow donor registry ad- 
ministered by the National Marrow Donor Program 
(NMDP). 

Mesenchymal stem cells. Mesenchymal stem cells 
(MSCs) are a type of multipotent adult stem cells, 
and they are defined by the capacity to give rise to a 
variety of connective tissue lineages, including bone, 
cartilage, tendon, muscle, and fat cells. Classic stud- 
ies found a type of cells in bone marrow stroma capa- 
ble of generating fibroblast-like cell colonies. These 
clonogenic cells were termed colony forming unit- 
fibroblasts (CFU-F). CFU-F share some characteris- 
tics of MSCs. MSCs appear as fibroblast-like spindle- 
shaped cells. CFU-F assay is still used to evaluate 
MSCs in cell cultures. MSCs can be distinguished and 
isolated from other cells based on phenotypic char- 
acteristics. Typically, MSCs express specific surface 
antigens SH2, SH4, and STRO-1 and lack blood cell 
markers CD45 and CD34. MSCs can replicate as mul- 
tipotent cells. The mesenchymal cell lineage poten- 
tial can be demonstrated in vitro with appropriate 
culture conditions. Differentiation can be induced 
to osteocytes by dexamethasone and ascorbate, to 
chrondrocytes by transforming growth factor-63, or 
to adipocytes by dexamethasone and insulin. 

MSCs are primarily obtained from bone marrow 
stromal cells. They are also found in small numbers 
in umbilical cord blood. In addition, adipose-derived 
stem cells (ASCs) have been shown to be similar to 
MSCs. Fat tissue is of mesenchymal origin and con- 
tains stromal components. ASCs can be isolated from 
fat tissue by the method of liposuction. Human ASCs 
have been shown to exhibit the capacity to give rise 
to fat, bone, cartilage, muscle, and possibly neurons. 
Thus, ASCs may provide a potential source of multi- 
potent adult stem cells. 

Neural stem cells. Neural stem cells, the multipotent 
stem cells that generate nerve cells, are a new focus 
in stem cell research. Active cellular turnover does 
not occur in the adult nervous system as it does in 
renewing tissues such as blood or skin. Because of 
this observation, it had been dogma that the adult 
brain and spinal cord were unable to regenerate 
new nerve cells. However, since the early 1990s, 
neural stem cells have been isolated from the adult 
brain as well as from fetal brain tissues. Stem cells 
in the adult brain are found in the areas called the 
subventricular zone and the ventricle zone. Brain 


ventricles are small cavities filled with cerebrospinal 
fluid. Another location of brain stem cells occurs in 
the hippocampus, a special structure of the cerebral 
cortex related to memory function. Stem cells iso- 
lated from these areas are able to divide and to give 
rise to nerve cells (Neurons) and neuron-supporting 
cell types in culture. 

Plasticity. Stem cell plasticity refers to the pheno- 
menon of adult stem cells from one tissue generat- 
ing the specialized cells of another tissue. The long- 
standing concept of adult organ-specific stem cells 
is that they are restricted to producing the cell types 
of their specific tissues. However, a series of recent 
studies have challenged the concept of tissue restric- 
tion of adult stem cells. Much of the experimental ev- 
idence is derived from transplant studies with blood 
stem cells. Bone marrow stem cells have been shown 
to contribute to liver, skeletal muscle, and cardiac 
cells in human recipients. In mouse models, purified 
blood stem cells have been demonstrated to gener- 
ate cells in nonblood tissues, including the liver, gut, 
and skin. Although the stem cells appear able to cross 
their tissue-specific boundaries, crossing occurs gen- 
erally at a low frequency and mostly only under con- 
ditions of host organ damage. The finding of stem 
cell plasticity is unorthodox and unexpected (since 
adult stem cells are considered to be organ/tissue- 
specific), but it carries significant implications for 
potential cell therapy. For example, if differentiation 
can be redirected, stem cells of abundant source 
and easy access, such as blood stem cells in bone 
marrow or umbilical cord blood, could be used to 
substitute stem cells in tissues that are difficult to iso- 
late, such as heart and nervous system tissue. How- 
ever, the concept of plasticity has been the subject 
of controversy. The observed frequency of lineage 
conversion is generally low. An alternative explana- 
tion to plasticity is the phenomenon of fusion of host 
and donor cells. Recent findings suggest that blood 
cells contribute to other tissues by fusing with pre- 
existing cells rather than by converting to other cell 
lineages. 

Potential clinical applications. Stem cells hold great 
potential for developing cell therapies to treat a wide 
range of human diseases. Already in clinical use is 
blood stem cell transplant therapy, well known as 
bone marrow transplant therapy for the treatment of 
patients with certain types of blood diseases and can- 
cers. The discovery of stem cells in various adult tis- 
sues, stem cell plasticity, and human embryonic stem 
cells brings new excitement and opportunities. Stem 
cells offer the possibility of cell replacement ther- 
apy for many human diseases, such as Parkinson’s 
and Alzheimer’s diseases, spinal cord injury, diabetes, 
heart disease, and arthritis, that result from loss or 
damage of cells in a specialized tissue of the body. 
Stem cell therapy might revolutionize the treatment 
and outcome of these diseases. Stem cell science is 
still in the very early stage. Much more research is re- 
quired to understand the biological mechanisms that 
govern cell differentiation and to identify factors that 
direct cell specialization. Future cell therapy will de- 
pend largely on advances in the understanding of 


stem cell biology and the ability to harness the pro- 
cess of stem cell growth and differentiation. 

Somatic cell nuclear transfer (SCNT) stem cells. 
SCNT involves a micromanipulation procedure in 
which the nucleus of an egg is removed and re- 
placed by a nucleus taken from somatic cells, typ- 
ically skin cells. Successful nuclear transfer requires 
reprogramming of the donor nucleus. The cells so 
created may divide in cultures to generate embry- 
onic stem cells that can initiate embryogenesis. This 
is the technique being used in cloning animals, such 
as the first cloned mammal, Dolly the sheep. How- 
ever, cloning by nuclear transfer is observed with ex- 
tremely low efficiency, probably due to faulty and in- 
complete reprogramming of the donor nucleus. The 
mechanisms governing the transition from a differ- 
entiated genome to a totipotent state remain largely 
unknown. 

A major interest in SCNT is the prospect of creat- 
ing patient-specific embryonic stem cells. These cells 
would be genetically identical to the nuclear donor 
except for maternal mitochondrial deoxyribonucleic 
acid (mtDNA) of the oocyte. Therefore, the prob- 
lem of graft rejection would be avoided if the cells 
could be used for transplant therapy for the donor 
patients. The concept of using SCNT to generate cus- 
tomized stem cells for cell therapy is also referred 
to as therapeutic cloning. However, there are hur- 
dles and limitations to using embryonic stem cells in 
clinical applications. A major challenge is to achieve 
the directed differentiation and controlled growth 
before stem cells can be used for transplant therapy. 
Another issue on SCNT in human stem cells is the 
sourcing of human eggs. The procedure requires a 
large number of eggs from women, and poses an 
ethical and technical challenge. See HISTOCOMPATI- 
BILITY. 

The success in producing embryonic stem cell 
lines by the SCNT technique has been demonstrated 
in mice. In an article published in Science in 2005, a 
team led by Hwang Woo Suk of South Korea claimed 
the establishment of patient-specific stem cell lines 
by using the SCNT technique. However, the paper 
was later retracted as the results were fabricated and 
the claim a fraud. The field is still left uncertain if 
somatic nuclear replacement is feasible in humans. 

Ethical and regulatory issues. The use of human 
embryonic stem cells raises ethical, social, and le- 
gal issues. The major concern centers on the source 
of stem cells. Human embryonic stem cell lines are 
made from the inner cell mass of a blastocyst stage 
embryo. Most embryos used to produce stem cells 
are left over from in vitro fertilization AVP) treat- 
ment. The embryos are destroyed by the procedure 
of extracting stem cells. The early embryo has the bi- 
ological potential to develop into a person. However, 
society has not reached consensus on when human 
life begins. The attention on stem cell research and 
cloning calls for regulation and legislation from gov- 
ernments. In the United States, current policy allows 
federal funds to be used for research only on existing 
human embryonic stem lines. The human embryonic 
stem cell lines that meet the eligibility criteria are 
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listed in the Human Embryonic Stem Cell Registry 
by the National Institutes of Health (NIH). 

One concern about SCNT is that it may lead to 
the reproductive cloning of humans. In theory, the 
embryo created via SCNT could be used to clone a 
human if it were implanted into a womans uterus. 
In the United States, the legislators in the House of 
Representatives and the Senate have introduced bills 
proposing a ban of all forms of cloning, including 
research cloning, or inhibiting reproductive cloning 
while preserving therapeutic cloning research. How- 
ever, these bills have not been passed, and no federal 
law has been established on human embryonic stem 
cell research. The Canadian Parliament has passed 
Bill C-6 that prohibits creation of a human clone, sale 
of sperm or ova, and commercial surrogacy. The bill 
permits the use of stem cells obtained from discarded 
products of in vitro fertilization, that is, excess and 
unused embryos. In the United Kingdom, a law per- 
mits the use of embryos in research and therapeutic 
cloning research but bans reproductive cloning, and 
implanting a cloned embryo in a human uterus is li- 
able to criminal prosecution. Cloning research must 
be licensed from the Human Fertilization and Embry- 
ology Authority that governs embryonic and stem 
cell research in the United Kingdom. Chen Wang 
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Stenolaemata 


A class of Bryozoa confined to fully marine water. 
The Stenolaemata include several thousand species 
distributed among four orders: Cystoporata, 
Trepostomata, Cryptostomata, and Cyclostomata 
(=Tubuliporata). The commonly recognized order 
Fenestrata likely is part of the Cryptostomata. First 
appearing late in the Early Ordovician, stenolae- 
mates expanded quickly to dominate bryozoan 
assemblages through the Early Cretaceous. By 
mid-Cretaceous, the calcified gymnolaemate order 
Cheilostomata, which originated in the Late Jurassic, 


had increased in diversity and abundance to equal 
that of stenolaemates, eventually surpassing the 
stenolaemates, which are now a minor part of 
the bryozoan fauna in most marine environments. 
See BRYOZOA; CRYPTOSTOMATA; CYCLOSTOMATA 
(BRYOZOA); CYSTOPORATA; TREPOSTOMATA. 

Colony morphology. Stenolaemate colonies vary 
greatly in size and shape. Many Paleozoic through 
mid-Mesozoic forms were large and massive, but 
more recent representatives are commonly small 
and delicate. Colonies are encrusting, erect, or free- 
living. Some erect forms have single or regularly 
spaced, flexible cuticular joints; segments between 
joints and all other entire colonies are rigidly calci- 
fied and enclosed within a thin cuticular membrane. 
The outer cuticle is secreted by a thin layer of ep- 
ithelial cells that on colony basal (underside) sur- 
faces switch secretion from cuticle to calcite crystals 
bound within ultrathin cuticular sheets. This may 
also happen locally on other surfaces of colonies, 
generating fixed-wall regions. In actively growing 
zones and over entire nonbasal surfaces of many 
colonies, all skeleton above the basal surface is se- 
creted by single layers of epithelial cells that are ulti- 
mately folded in from the basal wall, with a fluid-filled 
gap between the skeleton and the outer cuticle, gen- 
erating free-walled colonies or regions. 

Zooidal morphology. Living stenolaemate colonies 
are made up of individual feeding units called zooids 
with lophophores (rings of tentacles) that are circu- 
lar in basal outline (see illustration). Lophophores 
of feeding zooids bear 8 to fewer than 20 tentacles 
that flare into usually simple conical or bell shapes 
when in the feeding position. The lophophore barely 
clears the skeletal aperture (terminal opening) when 
fully protruded, with the mouth most commonly 
at or just below the skeletal aperture. Tentacles 
of living stenolaemates (Cyclostomata) have lateral 
cilia tracts, but unlike other bryozoans lack frontal 
cilia. 

Zooidal body cavities are enclosed within tubular 
or prismatic zooecia (skeletons of individual zooids), 
gradually expanding from their proximal ends. The 
outer cuticle folds into each zooid as a vestibular wall 
and generally attaches by ligaments to the skeleton 
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Some morphologic features of autozooids of 
Cyclostomata, the only living order of stenolaemates. 


at some distance below the zooecial aperture. In all 
but one Mesozoic group, the zooidal orifice (wall 
opening through which lophophore and tentacles 
protrude) is closed by a ring of muscles, instead of 
a covering flap (epistome or operculum). Two mem- 
branes continue inward from the ligament, one ex- 
tending as a sleeve-shaped sheet of ectoderm and 
mesoderm attached at its inner end around the base 
of tentacles. The other inward-extending membrane 
(the membranous sac) divides the body cavity and 
surrounds the gut, and consists only of mesodermal 
cells, interpolated annular muscle cells, and a base- 
ment membrane. Contraction of the muscle cells of 
the membranous sac appears to be critical in protru- 
sion of the lophophore to its feeding position. 

Zooecia typically are relatively thin-walled in 
colony interiors (endozone) and relatively thick- 
walled in outer regions (exozone). Inner surfaces 
of zooecia are lined by a single layer of epithelium, 
and it has been hypothesized that as a prerequisite 
for calcification of the tubular zooecia, the outer 
body wall of the stenolaemate ancestor split into the 
inner, mesodermal membranous sac and the outer, 
calcite-secreting ectoderm. Most Paleozoic stenolae- 
mates had imperforate zooecia, but skeletal walls of 
most post-Paleozoic forms have pores (except basal 
walls). Ancient stenolaemates variously had planar 
transverse partitions (diaphragms), variously curved 
partitions or blisters, and spines subdividing or pro- 
jecting into zooecial cavities. Undivided zooecial cav- 
ities function as living chambers, otherwise only the 
portion of the zooecial cavity on the outer side of the 
last-formed basal diaphragm functions as the living 
chamber. Living chambers vary across the class from 
about 2 x 107? mm? to 1 mm’. 

Polymorphism. Many stenolaemate colonies are 
constituted of skeletally monomorphic zooids, but 
polymorphs are present in some. The most common 
polymorphs are structural (various kenozooids), or 
females (gonozooids) with brooding chambers. More 
rarely a few other types of polymorphs occur, such 
as nanozooids (reduced zooids with single tentacles) 
and eleozooids (avicularium-like zooids). 

Reproduction and growth. Individual colonies are 
hermaphroditic, some only with skeletally mono- 
morphic, apparently hermaphroditic feeding zooids, 
and others with female zooids distinguished by 
chambers that brood polyembryonic embryos (mul- 
tiple embryos developed by fission from a single fer- 
tilized egg). Larvae do not feed and are short-lived. 
They settle and metamorphose into a hemispherical 
protoecium from which a cylindrical tube extends to 
complete the ancestrular zooid (initial zooid of the 
colony). Colonies grow by asexual budding, and the 
polypide (organs and tissues) of each zooid is formed 
by proliferation of cells on the underside of the cuti- 
cle that overlies the skeletal orifice. Polypides usually 
live only a few weeks, after which they degenerate 
into a roughly spherical, brown-stained mass (brown 
body). A series of polypides may bud during the life 
of each zooid. Frank K. McKinney 

Bibliography. R. S. Boardman, General features of 
the Class Stenolaemata, in R. A. Robison (ed.), Trea- 
tise on Invertebrate Paleontology, pt. G: Bryozoa 


Stephanoberyciformes 


(revised), vol. 1, Geological Society of America and 
University of Kansas, Boulder, 1989; R. S. Boardman, 
Reflections on the morphology, anatomy, evolu- 
tion, and classification of the Class Stenolaemata 
(Bryozoa), Smithson. Contrib. Paleobiol., 86:1-59, 
1998; R. S. Boardman, A. H. Cheetham, and A. J. 
Rowell (eds.), Fossil Invertebrates, Blackwell Sci- 
entific Publications, Palo Alto, 1987; A. Ernst, Bry- 
ozoans, in B. D. Webby et al. (eds.), The Great 
Ordovician Biodiversification Event, pp. 147-156, 
Columbia University Press, New York, 2004; C. 
Nielsen and K. J. Pedersen, Cystid structure and 
protrusion of the polypide in Crisia (Bryozoa, 
Cyclostomata), Acta Zool. (Stockholm), 60:65-88, 
1979. 


a 
Stephanoberyciformes 


An order of fishes containing the pricklefishes and 
whalefishes. This order is recognized by G. D. 
Johnson and C. Patterson, but interrelationships of 
the taxa placed within it are debated among ichthy- 
ologists. Stephanoberyciformes is recognized as the 
sister group of all remaining acanthomorphs. It is 
defined by the following characters: pelvic girdle at- 
tached to cleithrum or coracoid of pectoral girdle; 
skull bone usually exceptionally thin (fragile bones 
often characteristic of pelagic fishes); subocular shelf 
absent; supramaxilla absent or reduced; and body 
shape variable, from elongate to short and rounded. 
The biology of stephanoberyciformes as a group is 
poorly known, and many species are known from 
only a few specimens. These are small deep-water 
fishes, ranging in total length from 2.1 to 43 cm (0.8 
to 17 in.), but most are less than 10 cm (4 in.), and 
occupying depths to 5300 m (17,400 ft or 3.3 mi). 
Several species are known to be oviparous (the as- 
sumed mode of reproduction of the entire order), 
with planktonic eggs and larvae. The order com- 
prises nine families, 28 genera, and about 75 de- 
scribed species, plus many others known but yet un- 
described. Included in the order are families taken 
from the Beryciformes, Lampriformes, and the Tra- 
chichthyiformes taxa as described by J. A. Moore. See 
DEEP-SEA FAUNA; OSTEICHTHYES; TELEOSTEI. 
Melamphaidae (bigscale fishes). Melamphaidae, 
comprising five genera and 36 species, can be dis- 
tinguished by the following features: dorsal fin with 
1-3 weak spines; pelvic fins thoracic or subthoracic, 
with 1 spine and 6 to 8 soft rays; caudal fin preceded 
dorsally and ventrally by 3 or 4 posteriorly directed 
spines; cycloid scales, often large and deciduous; lat- 
eral line absent, or limited to 1 or 2 pored scales; 
and lengths of 2.1 to 18 cm (0.8 to 7 in.), but most 
less than 10 cm @ in.). Some species make daily 
vertical migrations from 700 m (2300 ft), and oth- 
ers are bathypelagic to depths exceeding 3500 m 
(11,500 ft). They occur in all oceans, but are un- 
known in Arctic seas and the Mediterranean Sea. 
Stephanoberycidae (pricklefishes). Stephanobery- 
cidae (Fig. 1), comprising three monotypic genera, 
can be identified by the following characteristics: 
8 to 11 spines preceding the principal caudal fin rays 
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Fig. 1. Pricklefish (Stephanoberyx monae). (From G. Brown 
Goode and T. H. Bean, Oceanic Ichthyology, plate 55, 1896) 


dorsally and ventrally; dorsal and anal fin spines, if 
present, weak; 10 to 14 soft rays in each dorsal and 
anal fin; pelvic fins abdominal or subabdominal, each 
with 5 soft rays; scales smooth or spiny; and lateral 
line faint. The three species represent the family in 
the eastern and western Atlantic (including the Gulf 
of Mexico), the Indian Ocean off South Africa, and 
the western and central Pacific. They are 8.1 to 13 
cm @.2 to 5.1 in.) in total length and have a depth 
range of 1655 to 5300 m (5430 to 17,400 ft). 

Hispidoberycidae. Hispidoberycidae is a mono- 
typic family known from the northeastern Indian 
Ocean and the South China Sea. The sole species, 
Hispidoberyx ambagiosus, has scales covered with 
small spines, an operculum with a stout spine, a dor- 
sal fin with 4 or 5 spines and 10 soft rays, an anal fin 
with 3 spines and 9 soft rays, a total length of 18 cm 
(7 in.), and a depth range of 560 to 1000 m (1840 to 
3280 ft). 

Gibberichthyidae (gibberfishes). Gibberichthyidae 
comprises one genus and two species of rare deep- 
sea fishes. Gibberichthys latifrons occurs in the 
Indo-West Pacific, where adults occupy depths be- 
tween 750 and 2000 m (2460 and 6560 ft), and G 
pumilus occurs in the western Central Atlantic (in- 
cluding the Gulf of Mexico) and the western Pacific at 
depths between 320 and 1100 m (1050 and 3600 ft). 
Both species are about 13 cm (5.1 in.) in total length. 
Larvae are usually found at depths less than 50 m 
(164 ft). The family is distinguishable by the follow- 
ing features: subabdominal pelvic fins, each with 1 
spine and 5 or G soft rays; a series of 5 to 8 and 4 or 
5 partly isolated short spines preceding the soft dor- 
sal and anal fins, respectively; 7 to 9 dorsal and anal 
soft rays; cycloid scales; flanks with vertical rows of 
papillae over the vertical lateral line tubes; and swim 
bladder partially filled with fat. Small crustaceans are 
known foods of both prejuveniles and adults. See 
SWIM BLADDER. 

Whalefishes. The following three families are 
called whalefishes; however, they are far from whale 
size, with the species ranging from only 11 to 39 cm 
(4 to 15 in.). The whalefishes share the follow- 
ing characteristics: whale-shaped body; very large 
mouth and highly distensible stomach; eyes well de- 
veloped or degenerate; lateral line of large hollow 
tubes; luminous tissue on body; dorsal and anal fins 
far posteriorly and opposite one another; swim blad- 
der absent; and black body with red and orange- 
tipped fins. Whalefishes are bathypelagic. 

Rondeletiidae (redmouth whalefishes). Rondeleti- 
idae is distinguished by a lateral line of 14 to 26 pores 
in a vertical series, no fin spines, and the presence 





of pelvic fins. It consists of two species: Rondeletia 
bicolor (Fig. 2), from the western Atlantic off North 
Carolina and Suriname, which has a forward-directed 
spine above the eye and occurs at depths to 3000 m 
(9840 ft); and R. loricata, which occurs in temper- 
ate and tropical waters of the Atlantic, Indian, and 
Pacific oceans to depths of 1130 m (3700 ft), makes 
diurnal migrations from 750 m (2460 ft) up to 100 m 
(328 ft), and is known to feed on isopods and crus- 
tacean remains. Both species are only about 14 cm 
(5.5 in.) in total length. 

Barboursiidae (red whalefish). Barboursiidae is a 
family of only one species, Barbourisia rufa, which 
occurs in tropical and temperate latitudes through- 
out the world’s oceans, including the western 
Atlantic (Gulf of Mexico) and eastern Pacific (off 
north-central California). Identifying features are as 
follows: fin spines absent; large mouth, with the max- 
illa extending far beyond the eye; lower jaw project- 
ing beyond the upper; jaw teeth in broad villiform 
bands; body and fins with minute spinules, result- 
ing in a velvety feel; lateral line with distinct pores; 
maximum total length of about 43 cm (17 in.); and 
juveniles mesopelagic and adults benthopelagic to 
depths of 2000 m (6560 ft). 

Cetomimidae (flabby whalefishes). Cetomimidae 
(Fig. 3), with nine genera and 35 species (15 of 
which are known but yet undescribed), plus 36 
species in the family Melamphaidae account for 75% 
of the order Stephanoberyciformes. The following 
features identify the family: skin loose and without 
scales; eyes very small or vestigial; no pelvic fins; 
3 or 4 pairs of gill arches; photophores absent, but 
luminous organs often present around anus and dor- 
sal and anal fin bases; and in live specimens brown 
or orange color with bright orange or red jaws and 
fins. The very few males that are known are only 3 
to 5 cm (1.2 to 2 in.) in length and were previously 
classified as juveniles. The maximum total length is 
about 43 cm (17 in.). These fishes are oceanic in the 


Fig. 2. Redmouth whalefish (Rondeletia bicolor). (From G. 
Brown Goode and T. H. Bean, Oceanic Ichthyology, plate 
77, 1896) 





Fig. 3. Flabby whalefish (Cetomimidae sp.). (Photo from 
Alaska Fisheries Science Center, National Oceanic and 
Atmospheric Administration, National Marine Fisheries 
Service) 


Atlantic, Indian, and Pacific oceans, and are one of 
the deepest-dwelling families of the order, with one 
species, Cetichthys parini, at 5000 m (16,400 ft). See 
BIOLUMINESCENCE; PHOTOPHORE GLAND. 

Mirapinnidae (tapertails). Mirapinnidae, compris- 
ing three genera and five species, occurs in tropi- 
cal and subtropical zones of the Atlantic and Indian 
oceans and the western Pacific. The following char- 
acteristics identify the family: no scales; fins without 
spines; dorsal and anal fins opposite one another; 
dorsal fin with 16 to 33 soft rays and anal fin with 
14 to 29 rays; pectoral fin with 13 to 24 rays; and 
pelvic fin jugular with 4 to 10 rays. Two subfamilies 
are recognized. 

Mirapinninae, with one species, Mirapinna esau, 
is called hairyfish because of the short hairlike pile 
covering the body. It is further identified by pectoral 
fins inserted very high on the body, broad pelvic fins 
inserted in the jugular position well above the ventral 
profile, and overlapping halves of the caudal fin. 

Eutaeniophorinae has two genera and four 
species. The body is eel-like, with the dorsal and anal 
fins placed far posteriorly and opposite one another. 

Megalomycteridae (longnose fishes). Megalomyc- 
teridae comprises four genera and eight species, five 
of which are yet undescribed. The longnoses oc- 
cupy depths between 1500 and 4000 m (4900 and 
13,000 ft) in the Atlantic and Pacific oceans. They are 
identified by the following: exceptionally large olfac- 
tory organs; lack of fin spines; usually lacking pelvic 
fin, but if present thoracic with 1 to 3 rays; and less 
than 6 cm (2.4 in.) in total length. Herbert Boschung 
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Stepping motor 


An electromagnetic incremental-motion actuator 
which converts digital pulse inputs to analog out- 
put motion. The device is also termed a step motor. 
When energized in a programmed manner by a volt- 
age and current input, usually dc, a step motor can 
index in angular or linear increments. With proper 
control, the output steps are always equal in num- 
ber to the input command pulses. Fach pulse ad- 
vances the rotor shaft one step increment and latches 
it magnetically at the precise point to which it is 
stepped. Advances in digital computers and partic- 
ularly microcomputers revolutionized the controls 
of step motors. These motors are found in many 
industrial control systems, and large numbers are 
used in computer peripheral equipment, such as 
printers, tape drives, capstan drives, and memory- 
access mechanisms. Step motors are also used in 
numerical control systems, machine-tool controls, 
process control, and many systems in the aerospace 
industry. See COMPUTER GRAPHICS; COMPUTER NU- 
MERICAL CONTROL; COMPUTER PERIPHERAL DEVICES; 
CONTROL SYSTEMS; DIGITAL-TO-ANALOG CONVERTER; 
PROCESS CONTROL. 

Major types. There are many types of step mo- 
tors. Most of the widely used ones can be classified 
as variable-reluctance, permanent-magnet, or hybrid 
permanent-magnet types. These three categories can 
be compared with regard to various properties (see 
table). 

Variable-reluctance motor. A variable-reluctance step 
motor is simple to construct. The stator and rotor 
have different numbers of teeth. For example, a 
three-phase, single-stack motor can have 12 stator 
teeth and 8 rotor teeth, and be wound in a four-pole 
configuration (Fig. 1). When phase A is energized 




















Fig. 1. Three-phase, single-stack, variable-reluctance step 
motor with 12 stator teeth and 8 rotor teeth, wound ina 
four-pole configuration. Only the windings on phase A are 
shown. A, B, C are phases; N and S are north and south 
magnetic poles. 
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Comparison of major types of step motors 
Hybrid 
Property Variable-reluctance Permanent-magnet permanent-magnet 
Complexity Simple construction Relatively complex 
construction 
Performance and Low efficiency; Low performance Efficient; low current 
efficiency requires relatively due to high per unit torque 
high current per inductance and requirement 
unit torque stamped motor 
parts 
Windings and Unifilar windings Bifilar windings Bifilar windings and 
drives and unipolar driver and unipolar unipolar driver; or 
driver; or unifilar windings and 
unifilar windings bipolar driver 
and bipolar driver 
Detent No inherent detent Inherent detent Inherent detent 
Rotor inertia Low rotor inertia High rotor High rotor inertia due 
due to lack of a inertia due to to permanent magnet 
permanent magnet permanent magnet 








by a dc voltage across the terminals, four of the rotor 
teeth will line up with the four stator teeth of the 
phase. When phase B is energized and phase A is 
deenergized, the rotor will rotate clockwise by 15° 
so that the other four rotor teeth will line up with 
the four stator teeth of phase B. Energizing phase C 
will cause the rotor to rotate counterclockwise by 
15°. The principle of operation of the single-stack 
variable-reluctance step motor requires that some of 
the rotor teeth seek the minimum-reluctance path 
with the stator teeth of the energized phase. See RE- 
LUCTANCE. 

Permanent-magnet motor. The permanent-magnet step 
motor (Fig. 2) has a rotor which is radially magne- 
tized. Each stator section may have a single wind- 
ing (unifilar winding) or a winding with a center 
tap, in which case the winding is called bifilar. A 
unifilar-wound motor must be driven by a bipolar 
drive; that is, the drive current must be reversed back 
and forth to advance the rotor. In contrast, a bifilar- 
wound permanent-magnet motor can be driven with 
a unipolar drive. 

Hybrid permanent-magnet motor. This type of step motor 
also has a permanent-magnet rotor. Hybrid refers to 
the motor’s operation under the combined princi- 
ples of permanent-magnet and variable-reluctance 

















(a) 


Fig. 2. Two-phase permanent-magnet step motor. (a) Rotor, 
magnetized with six radial poles. N and S are north and 
south magnetic poles. (b) Cutaway view. 


step motors. The motor was originally designed as a 
two-phase ac synchronous motor for low-speed ap- 
plications. It was found that the motor could also step 
incrementally if the phase windings are excited with 
dc pulses. The rotor of this step motor (Fig. 3) has a 
permanent magnet positioned in the axial direction 
to produce a north-south unidirectional magnetic 
field. The teeth on the two sections of the rotor are 
offset by one-half of a rotor tooth pitch. In general, 
the stator teeth are all uniformly distributed with a 
tooth pitch which is different than that of the rotor. 
The stator may have unifilar two-phase windings or 
bifilar windings. In the bifilar case, each phase has 
two windings, and the motor is often referred to as 
a four-phase motor. 

Winding configurations. Most variable-reluctance 
step motors are wound with bifilar windings. 
Permanent-magnet and hybrid permanent-magnet 
step motors can be wound with either bifilar or unifi- 
lar windings. 

Unifilar windings. A unifilar winding is a single coil 
wound on a pole of a motor. When a direct current 
is sent through the winding, a flux is produced in 
the pole if the magnetic path is closed. This pro- 
duces a magnetic polarity at the pole. If the di- 
rection of the current is reversed, the direction of 
the flux will also be reversed. Thus, in order to re- 
verse the polarity of the magnetic pole, with a unifi- 
lar winding, the direction of the current must be 
reversed. This requires a bipolar driver, that is, a 
driver that can reverse polarity between positive and 
negative. 

Bifilar windings. A bifilar winding consists of two 
windings in opposite sense on the same pole. When 
a driver sends a current into one coil, a magnetic 
polarity is induced on the pole. By applying the 
same current to the other coil instead, the magnetic 
polarity of the pole is reversed. Thus, with a bifi- 
lar winding the magnetic pole polarity can be re- 
versed without reversing the polarity of the drive 
voltage. The advantage is that only a single-ended, or 
unipolar, power supply is needed. A disadvantage of 
the bifilar winding is that only one-half of the total 
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Fig. 3. Hybrid permanent-magnet step motor with 
eight-pole stator. N and S are north and south magnetic 
poles. (a) Axial cross section. (b) Radial cross sections at 
A-A’ and B-B’ on part a. 


windings are used at any time so that for a given frame 
size the motor is derated. However, bifilar windings 
can be connected in series or in parallel so that the 
entire windings can be utilized. Since the operations 
of permanent-magnet and hybrid permanent-magnet 
step motors depend on the reversal of the magnetic 
polarities of the stator poles, these motors must have 
either bifilar windings with a unipolar driver or unifi- 
lar windings with a bipolar driver. See WINDINGS IN 
ELECTRIC MACHINERY. 

Torque. Step motors are torque-generating de- 
vices. The torque can be classified as static and dy- 
namic. When a phase of a variable-reluctance step 
motor is energized with a direct current, a static hold- 
ing torque is developed. As the rotor is forced to ro- 
tate, ideally a sinusoidal static holding torque curve 
is generated for this phase. Similarly, torque is gen- 
erated when the other two phases of a three-phase 
motor are energized in turn. The equilibrium posi- 
tions are also the detent positions of the individual 
phases. In practice, the static holding torque may not 
be sinusoidal, depending on the design of the motor. 
A variable-reluctance motor must have at least three 
phases to rotate continuously. 

The torque components of a hybrid permanent- 
magnet motor are complex. The permanent mag- 
net in the rotor generates a fourth-harmonic detent 
torque when the stator windings are not energized 
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relative to the torque generated when the stator 
windings are energized. 

The pull-out torque of a step motor represents 
the dynamic performance of the motor. It is de- 
fined as the maximum frictional torque that can be 
driven by the motor at a given speed. The pull-out 
torque (torque-speed) characteristic of a step motor 
is similar in shape to that of a dc motor. See DIRECT- 
CURRENT MOTOR; TORQUE. 

Drivers. A driver stage for a variable-reluctance 
step motor has a power transistor for each motor 
phase. To energize a certain motor phase, a logic se- 
quencer simply sends out a signal to turn on the cor- 
responding power transistor so that a current sup- 
plied by the power source will flow through the 
motor phase windings. To drive the motor in a partic- 
ular sequence, the phases are energized accordingly 
one at a time. The phases can also be energized two 
at a time. 

The unipolar driver for a variable-reluctance step 
motor can also be used for permanent-magnet and 
hybrid permanent-magnet step motors with bipolar 
windings. As noted above, a bipolar driver is required 
for unifilar motor windings. 

The power transistors of the step motor driver can 
be controlled by a chopper driver to control the av- 
erage current to the motor windings. Typically, the 
motor is switched to a high supply voltage, several 
times the rated value, causing the current to build 
up rapidly. A current-sensing network turns the high 
voltage to a low voltage when the current reaches 
some preset threshold. The frequency of chopping 
can range from 1 to 30 kHz. See CHOPPING; MOTOR; 
SERVOMECHANISM. Benjamin C. Kuo 

Bibliography. P P Acarnley, Stepping Motors: A 
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1995; B. C. Kuo (ed.), Step Motors and Control Sys- 
tems, 1979. 
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The study of the three-dimensional arrangement of 
atoms or groups within molecules and the proper- 
ties which follow from such arrangement. Molecules 
that have identical molecular structures (that is, the 
same kind, number, and sequential arrangement of 
atoms) but differ in the relative spatial arrangement 
of component parts are stereoisomers. Inorganic and 
organic compounds exhibit stereoisomerism. Exam- 
ples are structures (1) to (8). In the example, pairs 
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of stereoisomers are related by permutation (trans- 
position) of bonded atoms or groups. In structures 
(3) and (4), a single permutation of Cl and H about 
either C atom produces two stereoisomers. Atom C2 
may be called a stereogenic atom (or a stereocenter). 

Significance. Stereochemistry has played a signif- 
icant role in the historical development of theories 
of molecular structure. The optical activity of sub- 
stances such as sugar and turpentine was discov- 
ered by J. B. Biot in 1813, and L. Pasteur was the 
first to separate or resolve enantiomers (nonsuper- 
imposable mirror images) from one another (1848). 
While Pasteur recognized that enantiomers must dif- 
fer in symmetry, it was not until 1874 that J. H. van’t 
Hoff and J. A. LeBel were able to relate symmetry, 
structure, and properties into the hypothesis of the 
tetrahedral carbon which formed the cornerstone 
of modern stereochemistry. The foundation of inor- 
ganic stereochemistry rests on the coordination the- 
ory of A. Werner (1893). 

Since the latter part of the nineteenth century, 
stereochemical studies have been prominent in the 
evolution of both inorganic and organic chemistry. 
These studies have concerned themselves with the 
determination of configuration, the interconversion 
of diastereomers and the assessment of their energy 
differences, and conformational analysis. 

Methods for the analysis and separation of 
stereoisomers have been devised, and stereochem- 
ical principles have been applied to the elucida- 
tion of reaction mechanisms, to the development 
of asymmetric syntheses, and to attempts to under- 
stand biological processes at the molecular level. 
See ASYMMETRIC SYNTHESIS; COORDINATION CHEM- 
ISTRY; RACEMIZATION. 

Symmetry. The nature and the number of 
stereoisomers of a molecule are determined by the 
permutation of atoms or groups (called ligands) at 
stereocenters as well as by the symmetry of the 
molecule. The symmetry elements to be considered 
are: planes of symmetry, axes of symmetry, centers 
of symmetry, and rotation-reflection (or alternating) 
axes of symmetry. Two types of stereoisomers are 
known. Those such as (7) and (8), which are de- 
void of reflection symmetry—which cannot be su- 


perposed on their image in a mirror—are called 
enantiomers. All other stereoisomers, such as the 
pairs (1)-(2), 3)-(4), and (5)-(6), are called diastere- 
omers. The configuration of a stereoisomer desig- 
nates the relative position of the atoms associated 
with a specific structure. The structures of stereoiso- 
mers (1) and (2) differ only in configuration. The 
same is true for (3) and (4), (5) and (7), and (7) and 
(8). 

Enantiomers are related to one another as a left 
hand is to a right hand. Such structures are said 
to be chiral (Greek cheir = hand) or dissymmetric. 
Unlike diastereomers, which differ in most physical 
and chemical properties, enantiomers have identical 
properties other than the sign (+ or —) of their op- 
tical activity. This identical behavior exists toward 
all agents and processes which are themselves achi- 
ral. Chiral agents act differently toward enantiomers, 
however. Biological specificity toward enantiomers 
is dramatic; for example, the enantiomers of the 
amino acid leucine have different tastes: one is bit- 
ter, while the other is sweet. See BIOLOGICAL SPECI- 
FICITY; MOLECULAR ISOMERISM. 

Configuration. In order to understand chemical 
processes involving stereoisomers, it is necessary 
to know the configurations of reactants and prod- 
ucts. The configuration of stereoisomers (1) and 
(2) are designated cis (chlorines on the same side) 
and trans (chlorines on opposite sides), respectively; 
(3) and (5) are trans; (4) and (6) are cis. Configura- 
tions of diastereomers can often be determined from 
their physical properties. For example, the isomer 
of C,H2Cl, with zero dipole moment is (5), while 
(4) has a finite dipole moment. See DIPOLE; DIPOLE 
MOMENT. 

The absolute configuration of chiral molecules 
such as (7) and (8) is the actual order of atoms about 
the stereogenic atom (tetrahedral carbon atom) 
which defines the specific enantiomer. While the 
three-dimensional picture of a molecule defines its 
stereochemistry just as the picture of a hand defines 
its handedness (right or left), it is not convenient or 
useful to rely on full three-dimensional representa- 
tions in most cases, and a shorthand notation to des- 
ignate configuration is widely accepted. Groups and 
atoms which surround the stereocenter are assigned 
priorities related by unambiguous rules to atomic 
number, for example, Br > Cl > F > H, and to sub- 
stitution pattern. The molecule is viewed from the 
tetrahedral face opposite the carbon substituent with 
the lowest priority, that is, from the face opposite 
hydrogen in the case of (7). The counterclockwise 
descent (7a) from high- to low-priority substituent 
(Br followed by Cl followed by F) is designated S 
(Latin sinister = left). Enantiomer (8) exhibits a sim- 
ilar but clockwise order (8a) designated R (Latin 
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rectus = right). When the order of groups imme- 
diately bound to the stereocenter is equivocal, the 
priority is determined by atoms or groups attached 
to the atoms whose priorities are equivocal. 

The same priorities may be used in the designation 
of the configurations of diastereomers such as (3) 
and (4). When the atoms or groups of highest priority 
lie on opposite sides of the double bond, as in (3), the 
stereoisomer is designated E (German entgegen = 
opposite). Isomer (4) in which the reference atoms 
are on the same side of the double bond are termed 
Z (German zusammen = together). This system is 
less ambiguous than the cis-trans system which is still 
applicable in simple cases. 

Enantiomers are characterized by the sign of their 
optical activity at a given wavelength, temperature, 
and specified solvent. The assignment of configura- 
tion to specific enantiomers is carried out by chem- 
ical transformations of known stereochemistry, by 
spectroscopic means such as circular dichroism and 
optical rotatory dispersion, and by diffraction of sin- 
gle crystals employing anomalous x-ray scattering. 
From such studies it is known that the lactic acid iso- 
mer that is experimentally found to be levorotatory, 
that is, whose optical activity is (—)- or counter- 
clockwise-rotating, has structure (9) corresponding 


COOH oH COOH 
Ho—— ch; = HO=— C— CH; = 
Z HO CH3 
n H 
(9b) @) Oa) 
[priority OH > 
COOH > CH3] 


to the R configuration (9a). Configuration has no 
simple connection to the sign of the optical rotation. 
See COTTON EFFECT; DICHROISM; OPTICAL ACTIV- 
ITY; OPTICAL ROTATORY DISPERSION; X-RAY DIFFRAC- 
TION. 

Two-dimensional (planar) projections of three- 
dimensional structures are very commonly used to 
represent configurations. Projection formulas com- 
press the tetrahedral geometry into the plane of the 
paper. In (9) the horizontal atoms or groups are for- 
ward of the plane (wedges) of the stereogenic atom, 
and the vertical groups are behind (solid or broken 
lines). Tipping formula (9) forward places the COOH 
group in the plane of the paper and the H atom be- 
hind, corresponding to structure (9a). The planar 
projection of (9) may be written as (9b). To avoid 
confusion in their use, the following conventions 
apply to planar projections: they must not be ro- 
tated in this plane through any but integral multiples 
of 180°; an odd number of exchanges of any pair 
of substituents is equivalent to transformation into 
its mirror image, and an even number of exchanges 
leaves the configuration unchanged. 

The relative configurations of stereoisomers ap- 
plicable to diastereomers are illustrated by the tar- 
taric acids (10) and (11). Structure (10) repre- 
sents (+)-tartaric acid (both stereocenters « have 
R configurations), and (11) represents meso-tartaric 
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acid, the diastereomer which is optically inactive 
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(the two stereocenters, one R and one S, are mirror 
images of one another. Older configurational symbol- 
ism (p and L) involving intermolecular comparisons 
to reference substances such as glyceraldehyde and 
serine persists, but it is giving way to the R-S desig- 
nations. 

Stereoregular polymers with defined configura- 
tions at stereogenic atoms or at carbon-carbon 
double bond stereocenters differ substantially in 
properties from their diastereomers. Polymeric car- 
bohydrates such as starch and cellulose and hydro- 
carbons such as rubber and gutta-percha exemplify 
natural diastereomer pairs. Synthetic polystyrene di- 
astereomers (12) [isotactic] and (13) [syndiotactic] 
differ only in the configuration at stereogenic atoms. 
The high-melting, fiber-forming isomer is isotactic 
polystyrene (12). 
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Configurational studies are powerful probes in the 
elucidation of reaction mechanisms. Replacement of 
substituent atoms or groups in a reaction is often at- 
tended by a change in configuration at the stereocen- 
ter (Walden inversion), or it may result in the loss of 
optical activity (racemization). In some cases reten- 
tion of configuration prevails. Such results provide 
evidence for the geometry of transition states or for 
the intervention of intermediates Gons or radicals). 

Resolution. Most chiral substances occur in nature 
as only one enantiomer. Few natural products are 
represented in nature by both enantiomers, and few 
also exist as mixtures with equal proportions of enan- 
tiomers called racemates. On the other hand, synthe- 
sis of the chiral substances in the laboratory or in in- 
dustry in the absence of chiral agents or conditions 
results in racemates which exhibit no measurable 
optical activity, since the activity of one enantiomer 
cancels that of the other. 

The separation of one or both enantiomers from 
a racemate, called resolution, requires the inter- 
vention of an optically active reagent, catalyst, or 
other chiral influence. The most common resolution 
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procedure involves the reversible conversion of an 
enantiomer mixture into a pair of diastereomers, 
with as widely different physical properties as prac- 
ticable, by reaction with a single pure enantiomer. 
For example, a (+)-amine reacts with a (+)-acid 
to give two diastereomeric salts: (+)-RNH3°(+)- 
R'COO® and (—)-R-NH;®(+)-R'COO®. 

These diastereomeric salts may be separated by 
fractional crystallization and may then be converted 
to the individual enantiomeric amines by cleavage 
with a strong base. The (+)-acid resolving agent may 
be recovered and reused. Covalent diastereomers 
may also be separated by crystallization or increas- 
ingly by gas, liquid, or thin-layer chromatography. 

Kinetic resolution takes advantage of differences 
in rates of reaction of enantiomers with chiral 
reagents. Enzymes are chiral catalysts which can pref- 
erentially catalyze reactions of one enantiomer. Enzy- 
matic resolutions play a major role in the syntheses 
of substances of biochemical importance. Chromato- 
graphic resolution on chiral stationary phases, which 
bind or dissolve one enantiomer more strongly than 
the other, constitutes another useful type of resolu- 
tion. 

In resolution by preferential crystallization or en- 
trainment, one enantiomer preferentially crystallizes 
upon seeding of supersaturated solutions with seed 
crystals of that enantiomer. Though relatively few 
racemates are amenable to this type of resolution, it 
is nevertheless of considerable importance on an in- 
dustrial scale. See CHROMATOGRAPHY; CRYSTALLIZA- 
TION. 

Conformational analysis. Molecules are not rigid 
collections of atoms. Torsional stereoisomerism 
arises as a consequence of the rotation (torsion) of 
atoms about bonds within molecules. This gives rise 
to stereoisomers called conformers which are inter- 
convertible by rotation about carbon-to-carbon sin- 
gle bonds. The existence of two extreme forms of 
ethane (in terms of energy), (14) [staggered] and 
(15) [eclipsed], was predicted by K. S. Pitzer in 
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1936 and later experimentally verified. An infinite 
number of forms (conformations) intermediate be- 
tween the equilibrium (Qow-energy) conformation 
(14) and the higher-energy conformation (15) is pos- 
sible. At room temperature most ethane molecules 
resemble (14). Though their presence is experimen- 
tally readily demonstrated, separation of conform- 
ers is normally not possible since the small energy 
difference between them [3 kcal/mole; 12.6 kilo- 
joules/mole in the case of (14), and (15) makes their 
interconversion facile]. The ease of interconversion 
explains why some enantiomeric structures, such as 
the gauche conformers (large groups, CH3, 60° apart) 


of n-butane (16), are incapable of resolution. 
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The relationship between the physical and chemi- 
cal properties of substances and their preferred con- 
formations (conformational analysis) has been the 
subject of many studies since 1950. Cyclohexane 
(17) exists overwhelmingly in the form of conformer 
(18; the chair form), which is the structural subunit 
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from which the carbon atom lattice of diamond is 
constructed. Substituents on six-membered rings are 
designated as either axial or equatorial, as in (18). 
Conformational analysis of synthetic hydrocarbon 
polymers such as polypropylene and biopolymers 
such as deoxyribonucleic acid (DNA) has led to an 
understanding of their properties. Much of the bi- 
ological activity of enzymes, for example, is made 
possible by the specific conformations adopted by 
these macromolecules. See CONFORMATIONAL ANAL- 
YSIS; ENZYME; POLYMER. 

Stereocontrolled synthesis. An understanding of 
the stereochemical consequences of chemical reac- 
tions and the determination of configuration of com- 
plex natural products has permitted the total syn- 
thesis of many stereochemically complex substances 
since around 1940. For example, cholesterol and gib- 
berellic acid each contain eight stereogenic carbon 
atoms, which makes possible, in principle, 2° = 256 
stereoisomers (128 racemates). Disregard for stereo- 
chemical consequences during synthesis would lead 
to extremely complex mixtures requiring repeated 
separation and to very low yields. 

Living organisms unerringly synthesize (biosyn- 
thesis) just one of these isomers in fully stereose- 
lective processes. Laboratory and commercial syn- 
theses of substances which have desirable biological 
properties have increasingly been devised so as to 
mimic biosyntheses by taking advantage of stereo- 
controlled reactions. These are of two types: stereo- 
specific reactions and stereoselective reactions. 
Stereospecific reactions are those in which differ- 
ent stereoisomers are transformed into stereochem- 
ically different products. For example, Z-2-butene 


(19) [cis] is epoxidized to the cis-epoxide (20) only. 
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The diastereomeric £-2-butene [trans] forms only the 
trans-epoxide. 

Stereoselective reactions are those in which a sin- 
gle reactant gives rise to a mixture of diastereomers 
in which one predominates. Those stereoselective 
reactions in which new stereocenters are formed in 
unequal amounts are called asymmetric syntheses. 
Transformations of optically active compounds into 
more complex substances which incorporate addi- 
tional stereocenters are also asymmetric syntheses. 

Syntheses of natural products which originate in 
achiral starting materials require at least one res- 
olution step if nonracemic products are desired. 
Alternatively, a total synthesis may well originate 
in a single enantiomer of a compound having but 
one or two stereogenic atoms. Additional stereocen- 
ters are introduced during the synthesis through 
stereospecific and especially through stereoselec- 
tive steps. Separation of diastereomers at various 
stages may be required. See STEREOSPECIFIC CATA- 
LYST. Samuel H. Wilen 
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The transmission of stereophonic audio signals over 
broadcast radio systems, including terrestrial analog 
amplitude-modulation (AM), frequency-modulation 
(FM) systems, and digital radio systems (terrestrial 
and satellite). See STEREOPHONIC SOUND. 

Analog systems. Two-channel stereo broadcasting 
of audio was first implemented for FM radio signals 
in the 88-108-MHz frequency band (allocated for 
FM radio broadcasting) in the 1960s, and became 
the dominant mode of transmission, in particular for 
music programming, in the 1970s. A multiplexing 
method was used to accomplish this and allowed 
the signal transmitted by a stereo broadcast station 
to be decipherable by either a stereo receiver or a 
monophonic receiver. For the monophonic receiver 
to receive a stereo broadcast, some component of 
the broadcast must be a single signal that includes 
information from both the left and the right chan- 
nels. This is called the Z + R signal. A monophonic 
receiver simply demodulates the Z + R signal and 
delivers it to the listener. 
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Baseband spectrum of an FM stereo signal. This audio and 
supersonic spectrum is frequency-modulated onto a 
radio-frequency carrier. 


The stereo receiver must use the L + R signal and 
other transmitted information to demodulate sepa- 
rate Z and R signals. For this, stereo broadcasts in- 
clude an L — R signal. In the stereo receiver, after 
detection of the Z + R and L — R signals, the Z and 
R signals are separated as shown in Eqs. (1) and (2). 


(L+R)+@—R)=2L qd) 
Œ+- -R= 2R (@) 


The subtraction used in Eq. (2) is realized through 
phase shifting and addition. In FM stereo trans- 
missions, the L + R signal is directly frequency- 
modulated onto the radio-frequency carrier, assur- 
ing compatibility with monophonic systems (see 
illustration). The Z — R signal is used to amplitude- 
modulate a 38-kHz hypersonic subcarrier. The 38- 
kHz carrier is suppressed, creating a double-sideband 
(DSB), suppressed-carrier AM signal at 38 kHz. This 
hypersonic signal is then frequency-modulated onto 
the radio-frequency carrier. A 19-kHz pilot signal is 
also transmitted to indicate that the broadcast is 
stereo and to provide an accurate frequency refer- 
ence for demodulation of the 38-kHz Z — R signal. 
See AMPLITUDE MODULATION; FREQUENCY MODULA- 
TION. 

In the FM stereo receiver, the entire baseband 
signal that has been frequency-modulated onto a 
radio-frequency carrier ( L + R, pilot, and double- 
sideband L — R) is first detected with an FM dis- 
criminator. The Z + R signal is isolated by a low- 
pass filter. The pilot tone is doubled and mixed 
with the double-sideband suppressed-carrier L — 
R signal, so that the Z — R signal can be demod- 
ulated by an AM envelope detector. The L + R 
and Z — R audio signals are then passed through a 
network that implements the functions expressed 
in Eqs. (1) and (2). The resulting left- and right- 
channel audio signals are amplified and presented 
to the listener. See AMPLITUDE-MODULATION DETEC- 
TOR; ELECTRIC FILTER; FREQUENCY-MODULATION DE- 
TECTOR; FREQUENCY-MODULATION RADIO. 

It is also possible to send stereo audio signals using 
more than two channels over FM radio transmis- 
sions, although this technology is not standardized 
for use in the United States by the Federal Commu- 
nications Commission (FCC), as is the two-channel 
technology just described. SRS Circle Surround, Neu- 
ral Audio, and Dolby Laboratories have all developed 
proprietary surround sound technologies which are 
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compatible with two-channel stereo FM broadcasts 
and which provide up to “6.1” channels of surround 
sound information (the “.1” represents a seventh, 
low-frequency channel with frequency response lim- 
ited to approximately 100 Hz). 

Stereo broadcasting for the AM band (550- 
1701 kHz) was first authorized by the FCC in 1982, 
when five different competing (and incompatible) 
systems were authorized because the FCC was un- 
willing to select just one and chose instead to let 
the marketplace decide. This proved to be a fateful 
decision since over the next 11 years the various 
AM stereo systems competed for market share. In 
the end, due in part to the confusion created by so 
many systems being available, AM stereo never be- 
came a popular service. In 1993, the FCC revisited 
their decision and finally selected a single system, 
called compatible quadrature amplitude modulation 
(C-QUAM), but receiver manufacturers and broad- 
casters never fully embraced this technology. In 2002 
the FCC authorized a new digital radio service for the 
AM band, called in-band/on-channel (BOC) digital 
radio, which supports stereo audio but is incompat- 
ible with C-QUAM, effectively ending the analog AM 
stereo saga in the United States. 

Digital systems. A number of digital radio systems 
are being implemented around the world, and in 
most cases these systems support the transmission 
of not only two-channel stereo but also “surround 
sound” stereo signals with five or more channels. 
In 2005, satellite digital audio radio service (SDARS) 
provider XM Radio began broadcasting some of its 
music channels in 5.1-channel surround using tech- 
nology developed by Neural Audio, and a number of 
terrestrial FM radio broadcasters were also experi- 
menting with surround sound using the IBOC digital 
radio broadcast system authorized for the FM band 
by the FCC in 2002. David H. Layer; Jeffrey Ward 
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A system of sound recording or transmission in 
which signals are captured, mixed, or synthesized 
using two or more audio channels in such a way 
as to deliver a spatial or three-dimensional auditory 
impression to a listener when these audio chan- 
nels are connected to loudspeakers in a listening 
room. Binaural or “head related” stereo is a term 
used to describe the capture or synthesis of a pair 
of signals that closely resemble those present at a 
listener’s ears in a natural spatial sound field. Such 
signals are intended for reproduction using a pair 


of headphones worn by the listener, although they 
can be reproduced using loudspeakers if suitable 
signal processing is employed. See BINAURAL SOUND 
SYSTEM. 

Stereo recording techniques were patented in the 
1930s, but commercial development was not under- 
taken at that time. In the late 1930s and early 1940s, 
a few motion picture films were produced using two 
optical tracks for stereo playback. A variety of multi- 
channel film systems were introduced in the 1950s. 
The commercial recording industry introduced two- 
track stereo tape recordings in 1954 and single- 
groove, two-channel stereo disks in 1957. The first 
stereo broadcasts on FM radio were made in 1961. 
In the 1970s, several quadraphonic (four-channel) 
systems were introduced, but none achieved lasting 
commercial success. From the late 1990s, “surround 
sound” stereo systems involving five or more chan- 
nels began to achieve commercial success, primar- 
ily as an adjunct to home television systems. Wave- 
field synthesis systems, which reached the market 
in the early years of the twenty-first century, use 
very large numbers of loudspeakers in an attempt 
to recreate acoustical wavefronts in a physically ac- 
curate fashion. See STEREOPHONIC RADIO TRANSMIS- 
SION. 

Auditory cues in stereophony. The aim of stereo- 
phonic recording and reproduction is to deliver con- 
sistent auditory spatial cues to the listener to give 
rise to an intended spatial impression. It is nonethe- 
less common for stereophony to rely on a degree of 
auditory illusion for this purpose, as it is not usually 
possible to reconstruct acoustical sound fields in a 
physically accurate manner over a wide listening area 
when using a limited number of loudspeakers. 

In order to determine the origin (location) of a 
sound, the brain considers the differences in sound 
pressure level and arrival time at each ear. The ex- 
ternal ear (pinna) also gives rise to modifications to 
the frequency spectrum of sound signals, which are 
unique to each source location and can represent 
height information as well as horizontal cues. Fluc- 
tuations in these cues contribute to a sense of spa- 
ciousness or width. See HEARING (HUMAN). 

Stereophonic signals are created either by cap- 
turing the relevant cues using two or more micro- 
phones, or by artificially introducing the necessary 
interchannel differences using audio signal process- 
ing. The perceived source images that result from 
these cues during reproduction are known as phan- 
tom images because they are not real sources. In bin- 
aural stereo or wavefield synthesis, they are some- 
times known as virtual sources. 

Stereophonic sound pickup. There are three basic 
microphone techniques for two-channel stereo- 
phonic pickup: coincident, spaced, and individual in- 
strument (also called close miking). The techniques 
are sometimes combined. 

The coincident technique uses two directional 
microphones located very close together, often en- 
closed in the same case. The coincident technique 
relies only on capturing differences in sound level 
between the channels to encode different source 


positions. This can give rise to very precise stereo 
imaging. See MICROPHONE. 

Spaced microphone techniques encode spatial 
cues using both time and level differences between 
the channels. These differences lead to directional 
perception based either on the “precedence effect,” 
whereby the sound is localized toward the earlier 
loudspeaker signal, or on “summing localization,” in 
which the perceived source position is the result of 
a trade-off between the two cues. Summing local- 
ization tends to work with interchannel time delays 
up to just over 1 millisecond, combined with inter- 
channel level differences of up to 18 dB. Techniques 
can rely on large time differences and small level dif- 
ferences, or vice versa, but the quality of the stereo 
image resulting from different combinations is not 
identical. Techniques relying on summing localiza- 
tion tend to be known as “near-coincident” tech- 
niques because of the relatively small microphone 
spacings involved. Because of the time delay, spaced 
techniques can cause noticeable phase cancellations 
in monophonic reproduction. 

Since the late 1960s, it has become common for 
much stereo program material to be recorded using 
a large number of microphones, each located close 
to one source or a small group of sources. By pro- 
portionally mixing the microphone outputs between 
the stereo channels, it is possible to provide the in- 
terchannel level differences necessary for perceiving 
phantom images. Artificial effects and reverberation 
may be added subsequently to create a spatial im- 
pression, or signals from microphones located in the 
reverberant field of a studio may be mixed with the 
dry signals. 

Surround sound recording extends these concepts 
to more than two channels to create an immersive 
sound field. However, it is common to find that 
accurate stereophonic imaging is attempted only 
in front of the listener, using techniques similar 
to those described above, while the rear channels 
are used to enhance the experience of spacious- 
ness and envelopment. This is particularly true of 
systems based on the International Telecommunica- 
tions Union-Radiocommunication Sector document 
ITU-R BS.775 layout, known as 3-2 stereo or 5.1 
surround, described below. See SOUND RECORDING; 
SOUND-RECORDING STUDIO. 

Stereophonic reproduction. The ability to create 
convincing stereophonic reproduction is highly de- 
pendent upon the performance of the loudspeakers, 
the acoustics of the listening room, and the hearing 
process of the listener. 

The loudspeakers should be identical and capable 
of broad frequency response with low distortion. 
Imaging is degraded by narrow or irregular disper- 
sion of frequencies above about 500 Hz. Most high- 
quality loudspeaker systems have a horizontal cover- 
age angle of 90 to 120°, above 500 Hz. The vertical 
angle is usually less, and often is 40 to 60°. The uni- 
formity of the coverage pattern is an important mea- 
sure of the loudspeakers’ quality. All loudspeakers 
have wider coverage at low frequencies. See LOUD- 
SPEAKER. 


Stereophonic sound 


For two-channel stereo, the ideal arrangement is 
one where the loudspeakers subtend an angle of 
+30° at the listening position. This arrangement is 
maintained in the ITU BS.775 configuration for 3- 
2 Stereo. (5.1 surround), in which a center loud- 
speaker is added between the left and right front 
loudspeakers, as well as two rear-side loudspeakers 
at +110°. The rear-side locations were chosen as a 
compromise between the requirements for movie 
applications and those for music reproduction. In 
movie applications, the capacity to deliver effects 
from the rear is desirable, whereas for music re- 
production the research evidence suggests a need 
to recreate lateral reflections to optimize envelop- 
ment. 

The loudspeakers should be located in similar 
acoustical surroundings. The surfaces near each 
loudspeaker should be similar in acoustical qual- 
ity in order that sound reflected from the area 
of each loudspeaker will be the same for the lis- 
tener. The reverberation time of the listening room 
should ideally be less than about 0.5 s. Hard sur- 
faces near the loudspeakers can reflect sound en- 
ergy, which when delayed only a few milliseconds 
by its slightly longer path will combine with the 
direct sound in a frequency-dependent manner to 
produce irregularities in what should be a smooth 
frequency response at the listener’s position. This 
“comb filter” effect can be reduced or avoided by 
making all surfaces around the loudspeakers acous- 
tically very absorbent. In some professional listen- 
ing rooms, such as recording or broadcast con- 
trol rooms, this surface treatment extends from the 
loudspeakers to the listener’s position; all wall sur- 
faces behind the listener are made reflective and dif- 
fusive. 

A modern high-quality loudspeaker system con- 
sists of two or more radiating units in the same en- 
closure, each reproducing only a part of the sound 
spectrum. These radiators cannot coexist in the same 
location; therefore, there can be path length differ- 
ences (hence, time differences) between them and 
the listener. Because the hearing process is sensi- 
tive to short time delays, phantom images of sound 
sources being partly reproduced by two or more radi- 
ators in each loudspeaker will tend to smear or split 
in space. However, even among trained listeners, not 
all people can hear, or are distracted by, this effect. 
For those who are annoyed, several manufacturers 
have introduced time- (or phase-) aligned loudspeak- 
ers. Time alignment requires that path lengths be 
made equal by physical location of the radiators or 
by use of an electronic delay in order to synchronize 
arrival times at the listener’s position. See SOUND- 
REPRODUCING SYSTEMS. 

Rollins Brook; Francis Rumsey 
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Stereoscopy 


The phenomenon of simultaneous vision with two 
eyes, producing a visual experience of the third di- 
mension, that is, a vivid perception of the relative 
distances of objects in space. In this experience the 
observer seems to see the space between the ob- 
jects located at different distances from the eyes. The 
stereoscopic effect is unique and cannot be easily de- 
scribed to one who does not possess it. Stereopsis, 
or stereoscopic vision, provides the individual with 
the most acute sense of relative depth and is of vital 
importance in visual tasks requiring the precise lo- 
cation of objects. 

Stereopsis is believed to have an innate origin in 
the anatomic and physiologic structures of the reti- 
nas of the eyes and the visual cortex. It is present in 
normal binocular vision because the two eyes view 
objects in space from two points, so that the retinal 
image patterns of the same object points in space are 
slightly different in the two eyes. The stereoscope, 
with which different pictures can be presented to 
each eye, demonstrates the fundamental difference 
between stereoscopic perception of depth and the 
conception of depth and distance from the monoc- 
ular view. In the illustration, each of the two eyes 
views a pair of vertical lines A and B drawn on cards. 
The separation of these lines for the right eye is 
greater than that for the left eye. If the difference 
in separation is not too great, the images of the 
lines fuse when the two targets are observed by the 
two eyes. There is almost an immediate stereoscopic 
spatial experience that the two lines are located in 
space, line B being definitely more distant than line A. 
No hint of this spatial experience occurs in the obser- 
vation of each target alone. This difference between 
the images in the two eyes provides the stimulus for 
emergence of the stereoscopic experience. 

Stereoscopic acuity and the presence of stereop- 
sis may be tested by several methods or instruments 
such as the ordinary hand stereoscope and similar de- 
vices, vectograph targets, the Howard-Dolman peg 
test, the Verhoeff stereopter, the Hering falling bead 
test, pins with colored heads stuck in a board to 
test for near vision, and afocal meridional magnify- 
ing lenses placed before one eye while the subject 


right eye 


Diagram of stereoscopic vision. 





views a special field (leaf-room or tilting table). See 
VISION. Kenneth N. Ogle 
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Steric effect (chemistry) 


The influence of the spatial configuration of reacting 
substances upon the rate, nature, and extent of reac- 
tion. The sizes and shapes of atoms and molecules, 
the electrical charge distribution, and the geometry 
of bond angles influence the courses of chemical 
reactions. The steric course of organochemical re- 
actions is greatly dependent on the mode of bond 
cleavage and formation, the environment of the re- 
action site, and the nature of the reaction conditions 
(reagents, reaction time, and temperature). The ef- 
fect of steric factors is best understood in ionic re- 
actions in solution. The nucleophilic substitution re- 
action at a saturated carbon atom can serve as an 
illustration. 

Saturated nucleophilic substitution. While the react- 
ing carbon atom is in an electron-deficient state in 
the transition state (state of highest energy, some- 
where between starting material and product) in a 
nucelophilic substitution process, the reaction can 
be varied from a two-step, unimolecular ionization 
to a one-step, bimolecular transformation. The for- 
mer mode of reaction, a solvolysis or Sy1 process, 
converts a tetrahedral carbon into a solvated pla- 
nar carbonium ion intermediate, and hence leads to 
racemization (randomization of configuration); for 
example, reaction (1) where the R terms represent 
organic groups. 
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See REACTIVE INTERMEDIATES. 

The second reaction path proceeds by a simulta- 
neous rupture of the old bond and creation of a new 
one, either by inversion of configuration, a Walden 
inversion or Sy2 process, for example, reaction (2); 
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or in a few cases by a front-side displacement 
of part of the substituent already present, an Sy 
i process, and hence, retention of configuration, 


for example, reaction (3). 
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These substitution reactions are highly solvent- 
dependent; for example, tert-butyl chloride under- 
goes solvolysis close to 500,000 times faster in water, 
a solvent of high ionizing power, than in ethanol, a 
solvent of low dielectric properties. The nature of R, 
R’, and R” is one of the factors determining which of 
the above pathways a compound prefers for its sub- 
stitution. The larger the size of these three groups, 
the greater is the tendency to relieve steric strain 
by extrusion of X, and thus the more the need for an 
Syl process. The smaller the size of the environment, 
the greater is the accessibility of the reagent from the 
backside, and thus the more tendency toward an Sy2 
process. As a consequence, tertiary compounds un- 
dergo racemization readily, whereas primary systems 
prefer inversion. 

Both the rate and the steric course of an ionic dis- 
placement may depend often on the ability of groups 


Steric effect (chemistry) 


adjacent to the reaction site to accommodate a pos- 
itive charge. Substitution of a-halo ethers and allyl 
or benzyl halides occurs much faster than a simi- 
lar reaction of unsubstituted halides because of the 
intermediacy of the stabilized cations or cationlike 
transition states in notation (4). 

In view of the charge distribution over more than 
one atom in these cases, sometimes the incoming 
reagent forms a bond at a site different from that of 
the leaving group. As a consequence, an Sy2’ pro- 
cess, reaction (5), or an Syi’ path, reaction (6), may 
result. Both processes yield retention of configura- 
tion; that is, the orientation of the new substituent 
on its carbon atom is identical with that originally 
held by the former functional group on a site that is 
two carbon atoms removed. 

In the presence of participating neighboring 
groups, even solvolyses can lead to retention of con- 
figuration. Certain rigidly held homoallyl systems un- 
dergo substitution at a site that is three carbons re- 
moved from the position of the leaving group, but 
with retention of configuration, as represented by 
reaction (7). 

Double inversion is responsible for the retained 
configuration of the products of solvolysis of a-halo 
acid salts, reaction (8). 
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Solvolysis of trans-B-acetoxy systems in nonaque- 
ous media leads to trans products; in the presence 
of water, cis compounds are obtained, reactions (9), 
where Ac = acetyl group (CH;CO—). 

Base treatment of trans-halohydrins leads to 
trans-vicinal glycols. The intermediate epoxide is 
isolable. Although ring opening of the latter may 
yield two different trans products, the diaxial one 
is formed preferentially in cycloalkane cases, reac- 
tion (10). 

Rearrangement. Migration of neighboring groups 
toward the reaction site, resulting in skeletal re- 
arrangements, is a common occurrence. Both the 
internal displacement of the leaving group by the 
migrating group and the subsequent external dis- 
placement of the migrating group by the solvent or 
added reagent proceed in a trans sense, that is, by 
back-side approach. Thus, the overall steric conse- 
quence of one migration sequence is retention of 
configuration. 

There are several examples of 1,2-hydride migra- 
tion, for example, reaction (11). 


Transannular hydride shifts are quite similar in na- 
ture, as in reaction (12). 

The 1,2 migration of phenyl groups proceeds by 
way of fairly stable phenonium ions, reaction (13). 

The Wagner-Meerwein rearrangement of saturated 
neighboring groups shows directional effects simi- 
lar to those of the above migrations; for example, 
the conversion of camphene hydrochloride to bornyl 
chloride is stereospecific, with retention of configu- 
ration, reaction (14). 

Neopentyl halides solvolyze to tertiary amyl 
derivatives, reaction (15). 

Cyclohexane systems with equatorial leaving 
groups may undergo contraction to five-membered 
rings, reaction (16). 

Organic compounds possessing potential leaving 
groups at the bridgehead of small bicyclic ring sys- 
tems undergo substitution processes only sluggishly. 
In the absence of ready access at the backside of the 
reaction center, the Sy2 pathway is excluded. The in- 
ability of the compounds to form planar carbonium 
ions precludes a Sy1 route. However, displacements 
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do occur slowly at elevated temperatures, presum- 
ably via nonplanar cations. 

Unsaturated nucleophilic substitution. Nucleophilic 
substitution reactions at unsaturated carbon atoms 
can take place by two possible mechanistic routes, 
an elimination-addition process and an addition- 
elimination scheme. The former route is best illus- 
trated by the transformation of aromatic halides into 
anilines, reaction (17). 

The latter is encountered in the interconversion 
of carboxylic acids and their derivatives, for exam- 
ple, reaction (18). Because the central carbon atom 
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has greater steric requirements in the reaction inter- 
mediate than in the starting material, the reaction 
velocity is strongly dependent on the size and num- 
ber of neighboring groups; that is, an increase in the 
bulkiness of R is reflected in a decrease of the rate of 
the reaction. 

The addition-elimination mechanism is portra- 
yed also by the aromatic nucleophilic substitution 
reaction, for example, (19). In order to be able to sta- 
bilize the reaction intermediate, the all-important 
nitro group must be coplanar with the benzene 
ring. As a consequence, ortho substituents, which 
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may block this steric requirement, retard the reac- 
tion rate. Unusual aromatic nucleophilic substitu- 
tions have been observed in cases where steric hin- 
drance by ortho substituents has prevented addition 
to aromatic ketones to occur, for example, reaction 
(20). 

Addition reactions. The steric course of addition 
reactions at unsaturated sites depends largely on the 
reagent. Catalytic hydrogenation, a nonhomogene- 
ous process of undetermined mechanism, occurs in 
a cis manner. In the absence of any steric interfer- 
ence, it leads to thermodynamically stable products. 
In the presence of steric hindrance, the two new 
hydrogen atoms are usually introduced on the least 
hindered side of the unsaturated compounds. How- 
ever, sometimes some bulky polar groups actually 
aid, rather than retard, adsorption of the catalyst on 
their side of the reducing compound, thereby lead- 
ing to products of opposite configuration. 

The oxidation of olefins to vicinal glycols by per- 





manganate salts or osmium tetroxide also proceeds 
in a cis fashion and involves the least-hindered side 
of the reacting substrate. The Diels-Alder reaction 
behaves similarly, for example, reaction (21). All 
addition processes, during which two new bonds 
are formed more or less simultaneously, yield cis 
adducts. See DIELS-ALDER REACTION. 

Ionic addition reactions of olefins and acetylenes 
occur in a trans manner. The mode of addition is 
such as to lead to product via the most stable cations 
(Markovnikoff addition), for example, reaction (22). 

Halogen addition to cyclic olefins leads to trans 
diaxial dihalides which, on standing, isomerize to 
the more stable trans diequatorial dihalides, reac- 
tion (23). 

Ionic addition reactions of carbonyl compounds 
follow a steric course very similar to those of olefins. 
However, the reagents are mostly nucleophilic, and 
some reactions are equilibrium processes, for exam- 
ple, reaction (24). The orientation of attack and the 
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reaction rate are governed by the environment of the 
carbonyl group (C=O). 

Addition reactions of conjugated carbonyl systems 
can occur through cation as well as anion intermedi- 
ates, but they uniformly place the nucleophilic part 
of the reagents on the 6-carbon atoms, for example, 
reactions (25) and (26). 

The reaction of carbonyl compounds as enol or 
enolate anions with electrophilic reagents can take 
two different courses. If the process is kinetically 
controlled, the electrophile, a proton, halonium 
ion, or others, interacts with the substrate on its 
least-hindered side. However, if the reaction is 
thermodynamically controlled, then, independent 
of mechanism, the most stable product is obtained. 

Elimination reactions. Elimination reactions can be 
carried out by pyrolysis of esters, halides, or amine 
oxides in the liquid or vapor phase. These elimina- 
tions always involve a rupture of vicinal cis bonds. 

Alternatively, similar cleavage processes can be 
made to occur ionically in solution, in which case 
they proceed in a trans fashion. The direction of elim- 
ination depends greatly on the molecularity of the 
process, as well as on the sizes of the leaving group 
and the attacking base. The two-step, unimolecular 
cleavage, an F1 process, leads predominantly to the 
more substituted, hence more stable, olefins, for ex- 
ample, reaction (27). 

The one-step bimolecular elimination (an E2 pro- 


cess) of neutral compounds yields similar products, 
for example, reaction (28). However, an E2 reac- 
tion on positive ions, ammonium or sulfonium salts, 
affords the less substituted olefin in preponderant 
yield, reaction (29). 

Reactions leading to the more stable products are 
said to follow the Saytzeff rule, whereas those yield- 
ing less stable olefins obey the Hofmann rule. Be- 
cause the transition state in the E2 reaction is of low- 
est energy when all atoms involved in the elimination 
are in a plane, the fastest rates among cyclic com- 
pounds are encountered in the cases which permit 
a diaxial alignment of vicinal substituents. 

Many ionic elimination reactions are known which 
involve the rupture of more than two bonds, for 
example, reaction (30). 
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The £2’ processes, eliminations of two groups of 


carbon atoms separated by an olefinic linkage, ap- 
pear to be cis in nature, reaction (1). 


o H7 
ed (A OH© c Ee 
a \ E / ~~ es ee \ / AS GB) 
— = c—C 
/ \ 4 N 


474 


Sterilization 


N(CH3)2 
R Z : R 
Is 


Electrophilic substitution. Steric factors have a fair 
control over the course of the aromatic electrophilic 
substitution reaction. In reactions of compounds 
containing ortho-para directing substituents, the 
ortho/para product ratio is usually greater than 1, 
and increases with the size of the substituent and that 
of the reacting species. The rate-accelerating partici- 
pation of electrondonating groups, located ortho or 
para to the incoming substituent, in stabilizing the 
transition state is greatly diminished in the presence 
of bulky ortho neighbors which would prevent the 
groups from attaining coplanarity with the benzene, 
for example, reaction (32). 
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Sterilization 


An act of destroying all forms of life on and in an 
object. A substance is sterile, from a microbiological 
point of view, when it is free of all living microor- 
ganisms. Sterilization is used principally to prevent 
spoilage of food and other substances and to prevent 
the transmission of diseases by destroying microbes 
that may cause them in humans and animals. 

Microorganisms can be killed either by physical 
agents, such as heat and irradiation, or by chemical 
substances. Regardless of the manner in which they 
are killed, they generally die at a constant rate under 
specified environmental conditions. If the logarithm 
of the number of survivors is plotted against time, 
the resulting curve is a straight line. 

When testing a substance for sterility, care must 
be taken to employ appropriate techniques. A bac- 
terial cell is considered to be killed when it is no 
longer capable of reproducing itself under suitable 
environmental conditions. If an inadequate medium 
is employed to subculture the treated bacteria, the 


substance being tested may be wrongly considered 
to be sterile. 

By far the most resistant of all forms of life, to 
both physical and chemical killing agents, are some 
of the bacterial endospores. If they did not exist, ster- 
ilization of such materials as bacteriological media 
and equipment, hospital supplies, and canned foods 
would be much simpler. 

Heat sterilization. This is the most common 
method of sterilizing bacteriological media, foods, 
hospital supplies, and many other substances. Either 
moist heat (hot water or steam) or dry heat can be 
employed, depending upon the nature of the sub- 
stance to be sterilized. Moist heat is also used in pas- 
teurization, which is not considered a true steriliza- 
tion technique because all microorganisms are not 
killed; only certain pathogenic organisms and other 
undesirable bacteria are destroyed. See PASTEURIZA- 
TION. 

Moist-heat sterilization. Some bacterial endospores are 
capable of surviving several hours at 212°F (100°C). 
Therefore, for moist-heat sterilization, an autoclave, 
pressure cooker, or retort, with steam under pres- 
sure, is required to achieve higher temperatures 
(Fig. 1). 

Most bacteriological media are sterilized by auto- 
claving with steam at 250°F (121°C) with 15 lb pres- 
sure (10? kilopascals) for 20 min or more, depend- 
ing upon the volume of material being heated. Some 
spores are capable of surviving moist-heat steriliza- 
tion equivalent to at least 7 min at 250°F (121°C). 

Steam under atmospheric pressure in an Arnold 
sterilizer is sometimes employed for specialized bac- 
teriological media that are easily heat damaged. Be- 
cause many bacterial spores survive this treatment, 
it is obviously inadequate to ensure sterility. 

Tyndallization. The food or medium is steamed for 
a few minutes at atmospheric pressure on three or 
four successive occasions, separated by 12- to 18-h 
intervals of incubation at a favorable growing tem- 
perature. In theory the intervals of incubation allow 
any surviving bacterial spores to germinate into more 
heat-sensitive, vegetative cells, which then would 
be killed during the next heat treatment. However, 
spores like vegetative cells may require special condi- 
tions such as an appropriate medium, proper oxygen 
tension, or proper temperature to germinate and re- 
produce. These conditions may not be realized dur- 
ing the intervals between heat treatment, and no 
matter how often the steaming is repeated ungermi- 
nated spores may survive and eventually germinate 
when conditions have been changed. The survival 
of ungerminated spores reduces the effectiveness of 
this method and it has been supplanted by other 
methods. 

Hot-air sterilization. Glassware and other heat-resistant 
materials which need to be dry after treatment are 
usually sterilized in a hot-air sterilizer. Dry steril- 
ization requires heating at higher temperatures and 
for longer periods of time than does sterilization by 
steam under pressure. Temperatures of 320-330°F 
(160-165°C) for at least 2 h is generally employed in 
hot-air sterilization. Dry heat kills the germs through 


denaturation of protein which may involve oxidative 
processes. 

Radiation sterilization. Many kinds of radiations are 
lethal, not only to microorganisms but to other forms 
of life. These radiations include both high-energy par- 
ticles as well as portions of the electromagnetic spec- 
trum. The mechanism of the lethal action of these 
radiations is not entirely clear. It may involve a di- 
rect energy absorption at some vital part of the cell 
(direct target theory), or the production of highly 
reactive, ionized, free radicals near some vital part 
of the cell Gindirect effect). Bacterial endospores are 
relatively resistant to all types of radiation. See RADI- 
ATION BIOLOGY. 

Ultraviolet radiation. Radiant energy in the ultraviolet 
region of the spectrum is highly bactericidal, espe- 
cially at wavelengths of approximately 265 nanome- 
ters. Lamps which generate ultraviolet radiation in 
this region are useful for the sterilization of air and 
smooth surfaces. Ultraviolet rays have very low pen- 
etrative capacity, since even a thin layer of glass ab- 
sorbs a high percentage of the rays. Some irradiated 
cells that are presumably dead may be photoreacti- 
vated with visible light. 

Gamma rays. These are high-energy, electromag- 
netic radiations similar to x-rays. They have great 
penetrative capacity, and their energy is dissipated 
in the production of ionized particles from the ma- 
terial being irradiated. Radioactive isotopes, such 
as cobalt-60, are a common source of gamma rays 
(Fig. 2). Gamma irradiation of foods has received 
much attention as a means of sterilizing foods with- 
out cooking them. Sterilization requires a radiation 
dose of approximately 5,000,000 rads (5 x 10° ergs 
of energy absorbed per gram). 

Cathode rays. These high-speed electrons (beta rays) 
may be generated with various types of electron ac- 
celerators. This type of radiation has relatively low 
penetrative capacity, depending upon the energy 
level of the emitted electron beam. Cathode rays 
sterilize in a manner identical to gamma rays and 
without significantly raising the temperature of the 
material being irradiated. They have received some 
application in the sterilization of surgical supplies 
and drugs and in the experimental sterilization of 
foods (Fig. 3). 

Filtration sterilization. This is the physical removal 
of microorganisms from liquids by filtering through 
materials having relatively small pores. Sterilization 
by filtration is employed with liquid that may be de- 
stroyed by heat, such as blood serum, enzyme so- 
lutions, antibiotics, and some bacteriological media 
and medium constituents. Examples of such filters 
are the Berkefeld filter (diatomaceous earth), Pasteur- 
Chamberland filter (porcelain), Seitz filter (asbestos 
pad), and the sintered glass filter. Most of these filters 
are available in different pore sizes. 

The mean pore size of bacteriological filters is not 
the only determinant in their effectiveness. The elec- 
tric charge of the pore surfaces tends to adsorb the 
bacteria and thus prevent their passage. Most bac- 
teria have a net negative electrical charge on their 
surfaces. Usually, bacteriological filters will permit 
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Fig. 1. Diagrammatic sketch of an autoclave. (American Sterilizer Co.) 


the passage of viruses, which are then called filter- 
able. 

A Millipore filter is a specially prepared membrane 
molecular filter designed to remove bacteria from 
water, air, and other materials, for the purpose of 
estimating quantitatively the bacterial population. A 
sterile filter disk is assembled in a filtration unit, and 
a specified volume of water or solution is drawn 
through the disk which then retains the bacteria. The 
filter disk is removed and placed in a sterile petri 
dish containing an absorbent pad, previously satu- 
rated with an appropriate bacteriological medium. 
Upon incubation, colonies will develop on the filter 
disk wherever bacteria were entrapped at the time 
of filtering. Special differential, or selective, media 
can be employed for the purpose of detecting quan- 
titatively specific types of bacteria from the original 
material. 
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Chemical sterilization. Chemicals are used to 
sterilize solutions, air, or the surfaces of solids. 
Such chemicals are called bactericidal substances. 
In lower concentrations they become bacteriostatic 
rather than bactericidal; that is, they prevent the 
growth of bacteria but may not kill them. Other terms 
having similar meanings are employed. A disinfec- 
tant is a chemical that kills the vegetative cells of 
pathogenic microorganisms but not necessarily the 
endospores of spore-forming pathogens. An antisep- 
tic is a chemical applied to living tissue that prevents 
or retards the growth of microorganisms, especially 
pathogenic bacteria, but which does not necessarily 
kill them. 

The death of microorganisms subjected to bacteri- 
cidal substances can be expressed exponentially, in 
that a straight-line graph is produced when the loga- 
rithm of survivors is plotted against time. The more 
concentrated the chemical employed, the greater is 
the rate of death. 

There are hundreds of chemicals that may be con- 
sidered to have sterilizing or bactericidal properties, 
depending upon the particular use for which they 
are intended. A chemical may be particularly use- 
ful for one purpose but not for another. Many are 
widely used for sterilization or disnfection of air, 
water, tabletops, surgical instruments, and so on. 

The desirable features sought in a chemical steril- 
izer are toxicity to microorganisms but nontoxicity 
to humans and animals, stability, solubility, inability 
to react with extraneous organic materials, penetra- 


Fig. 2. Cobalt-60 furnace which is designed for 
experimental food sterilization. 
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tive capacity, detergent capacity, noncorrosiveness, 
and minimal undesirable staining effects. Rarely does 
one chemical combine all these desirable features. 

Among chemicals that have been found useful as 
sterilizing agents are the phenols, alcohols, chlorine 
compounds, iodine, heavy metals and metal com- 
plexes, dyes, and synthetic detergents, including the 
quaternary ammonium compounds. 

Chlorine. Chlorine and chlorine-containing com- 
pounds represent the most widely used group of 
disinfectants. Chlorine gas is often used to purify mu- 
nicipal water supplies. 

Various compounds of chlorine, such as the 
hypochlorites and chloramine, have many industrial 
and domestic uses as disinfectants or antiseptics. 

Ozone. This is a highly oxidizing gas (O3) used as a 
deodorizer, and also for disinfection of air and water. 
It has found some use in the food fields, but effec- 
tive bactericidal concentrations may be irritating and 
toxic to humans. 

Hydrogen peroxide. This chemical (H202) has high ox- 
idizing and bleaching qualities and is usually em- 
ployed in a 3% solution for topical application and 
disinfection of cuts, scratches, and minor wounds. 
It decomposes into water and oxygen and therefore 
is used where no taste, odor, or toxic residues are 
permitted. 

Volatile organic compounds. Such compounds as 
formaldehyde and ethylene oxide have been used for 
the disinfection of sickrooms occupied by patients 
suffering from contagious disease (terminal disin- 
fection), and of solids that do not permit heat treat- 
ment. These volatile substances have the advanta- 
ges of effective penetrative capacity and ease of 
removal after treatment. 

Volatile organic substances are sometimes 
employed as bacteriostatic agents to preserve 


bacteriological medium constituents until they are 
heat-sterilized. Charles F. Niven, Jr. 

Bibliography. S. S. Block (ed.), Disinfection, Steril- 
ization, and Preservation, 5th ed., 2001; P. Murray 
et al., Medical Microbiology, 2000. 
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Any of a subset of biological molecules known as 
lipids that share a specific, fused four-ring struc- 
ture comprising three rings of six carbon atoms and 
one ring of five carbon atoms. Different types of 
steroids have different chemical modifications to this 
common ring structure (see illustration). Steroids 
are found in animals, plants, fungi, and protozoa. 
In mammals, steroids are components of cell mem- 
branes, aid in digestion, and act as hormones to con- 
trol a wide range of physiological responses. Because 
of their role in governing physiological processes, 
naturally occurring and synthetic steroids are used 
extensively in medicine to treat a wide range of con- 
ditions. See LIPID; STEROL. 

Cholesterol. Cholesterol is the most abundant 
steroid in animals. It is found primarily in the plasma 
membrane of cells, where it is a major component 
of the lipid bilayer. It is also found, usually to a lesser 
extent, in the membranes of subcellular organelles 
such as mitochondria. Because cholesterol possesses 
a fused ring structure, it has greater rigidity than 
the other lipid components of these membranes and 
therefore tends to reduce the fluidity of membranes 
by restricting the movements of the more flexible 
lipid components. Thus, cholesterol plays an impor- 
tant role in determining the physical characteristics 
of biological membranes. In addition, cholesterol is 
one of the major components of lipid rafts, localized 
regions within the plasma membrane that are more 
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Structures of some important steroids. 


ordered and rigid than their surroundings. Lipid rafts 
associate with specific proteins and are believed to 
play an important role in signal transduction. While 
plants have very little cholesterol, a similar steroid, 
stigmasterol, plays an analogous role in the mem- 
branes of their cells. See CELL (BIOLOGY); CELL MEM- 
BRANES. 

Because of its nonpolar nature, cholesterol has low 
solubility is water. Consequently, cholesterol must 
be transported through the bloodstream attached to 
carrier proteins known as lipoproteins. Low-density 
lipoproteins (LDL) transport cholesterol from the 
liver to various tissues in the body. High-density 
lipoproteins (HDL) transport cholesterol from vari- 
ous body tissues to the liver. HDL can be thought 
of as a “cholesterol scavenger” because it can take 
excess cholesterol directly from cell membranes. 

High levels of cholesterol-containing LDL in the 
bloodstream are strongly correlated with arterioscle- 
rosis (hardening of the arteries) and elevated risk 
of arterial blockage and heart attack. This is due to 
the deposition of excess cholesterol in the smooth 
muscle cells of the inner arterial wall. Over time, 
these deposits become calcified and can ultimately 
lead to artery blockage. High HDL levels have been 
correlated with reduced risks of heart disease. See 
CHOLESTEROL; LIPOPROTEIN. 

Steroid hormones. In addition to being an impor- 
tant component of cell membranes, cholesterol is a 
precursor of a variety of critical steroid hormones. 
Among these are glucocorticoids, androgens, estro- 
gens, progestins, and mineralocorticoids. Each fam- 
ily of steroid hormones elicits a different physiologi- 
cal response. 

Glucocorticoids. Cortisol (also known as hydrocor- 
tisone) is an example of a glucocorticoid. This 
hormone is produced by the adrenal gland and 
participates in the regulation of carbohydrate 


Steroid 


Testosterone 
(an androgen) 


477 


OH 


478 


Sterol 


metabolism. Cortisol causes an increase in the re- 
lease of glucose from the liver and a decrease in the 
uptake of glucose by other tissues of the body. Con- 
sequently, cortisol effectively raises blood sugar (glu- 
cose) levels and acts in opposition to insulin. Corti- 
sol also has anti-inflammatory and antiallergic effects, 
and is widely used medicinally to treat asthma, aller- 
gies, and various skin conditions. 

Androgens. Testosterone is the primary androgen 
hormone. It is produced in the gonads and is a 
male sex hormone. Testosterone is required for the 
fetal development of male genitalia. During puberty, 
testosterone is required for the development of sec- 
ondary sex characteristics in males, such as facial and 
pubic hair and a lowered voice. Testosterone also 
promotes protein synthesis that in turn leads to mus- 
cle production. Synthetic androgenic steroids that 
mimic testosterone are used medicinally to treat the 
severe weight loss that accompanies certain wasting 
diseases. To increase muscle mass and strength, many 
athletes have abused synthetic androgenic steroids. 
See ADRENAL GLAND. 

Estrogens. Estradiol is the primary estrogen hor- 
mone. It is produced by the ovaries and placenta and 
is a female sex hormone. During puberty, estradiol 
is required for the development of female secondary 
sex characteristics, such as pubic hair and enlarged 
breasts. Estradiol causes the thickening of the uter- 
ine lining that occurs during the early phase of the 
menstral cycle. It also suppresses the secretion of 
follicle-stimulating hormone by the pituitary gland, 
thereby preventing ovulation. Because of its ability to 
suppress ovulation, estradiol is a major component 
of oral contraceptives. See ESTROGEN. 

Progestins. Progesterone is the primary progestin. 
This hormone is produced by the adrenal gland and 
the corpus luteum (an ovarian follicle that has rup- 
tured after releasing an egg). Progesterone is a fe- 
male sex hormone. It causes changes in the uterine 
lining that enable implantation of a fertilized egg. 
If pregnancy results, the placenta starts to produce 
progesterone to prevent spontaneous abortion and 
prepare the mammary glands for milk production. 
Synthetic progestins are also major components of 
oral contraceptives. See PROGESTERONE. 

Mineralocorticoids. Aldosterone is an example of a 
mineralocorticoid hormone. It is produced by the 
adrenal gland and regulates electrolyte balance. It 
acts on the kidneys to stimulate potassium ion ex- 
cretion and to repress sodium ion excretion. See AL- 
DOSTERONE. 

Mechanism ofaction. Steroid hormones elicit the phys- 
iological responses described above by altering the 
expression of various genes. Because steroids are 
nonpolar, they readily pass through the cell mem- 
brane. Once inside the cell, the steroid molecules 
bind to specific protein receptors in the cytoplasm. 
The hormone-protein receptor complexes then 
enter the nucleus and bind to specific regions of 
the DNA known as response elements. In many 
cases, binding of the hormone-protein receptor com- 
plex to response elements in the DNA induces 


the expression of genes adjacent to the response 
element. In other cases, this binding represses 
the expression of genes adjacent to the response 
element. 

Bile acids. In addition to being a precursor of 
steroid hormones, cholesterol is a precursor of bile 
acids (also known as bile salts), such as cholic acid 
and glycocholic acid. Cholesterol is converted to 
bile acids in the liver, and these compounds are 
secreted into the small intestine via the gall blad- 
der. In the small intestine, bile acids act as emulsi- 
fying agents to aid in the digestion and absorption 
of fats. While most of the bile acid in the small in- 
testine is reabsorbed and recycled, some is not. This 
nonabsorption represents the major means of excre- 
tion of excess cholesterol by the body. See LIVER. 

M. Todd Washington 

Bibliography. T. M. Devlin (ed.), Textbook of Bio- 
chemistry with Clinical Correlation, 5th ed., Wiley, 
New York, 2002; D. Voet and J. G. Voet, Biochem- 
istry, 3d ed., Wiley, New York, 2004. 
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Any of a group of naturally occurring or synthetic 
organic compounds with a steroid ring structure, 
having a hydroxyl (—OH) group, usually attached 
to carbon-3. This hydroxyl group is often esteri- 
fied with a fatty acid (for example, cholesterol es- 
ter). The hydrocarbon chain of the fatty-acid sub- 
stituent varies in length, usually from 16 to 20 carbon 
atoms, and can be saturated (containing no double 
bonds) or unsaturated (containing one or more dou- 
ble bonds). Sterols commonly contain one or more 
double bonds in the ring structure and also a vari- 
ety of substituents attached to the rings. Sterols and 
their fatty-acid esters are essentially water insoluble. 
For transport in an aqueous milieu (for example, the 
bloodstream of mammals), sterols and other lipids 
are bound to specific proteins, forming lipoprotein 
particles. These particles are classified based on their 
composition and density. One lipoprotein class is ab- 
normally high in the blood of humans prone to heart 
attacks. 

Sterols are widely distributed in nature. The 
biosynthetic pathway leading to the steroid ring 
structure is utilized by higher and lower forms of 
life. Modifications of the steroid ring structure are 
made by specific enzyme systems, producing the 
sterol characteristic for each species such as ergo- 
sterol in yeast. The common biosynthetic pathway 
begins with the two-carbon-atom compound, ac- 
etate, which is converted by several enzymic steps 
to a six-carbon-atom compound, mevalonate. Next, 
mevalonate is converted to a five-carbon-atom in- 
termediate, isopentenyl pyrophosphate. There fol- 
lows a series of condensations of six isopentenyl 
pyrophosphate molecules to yield a 30-carbon-atom 
intermediate, squalene. Squalene is next cyclized to 
a 30-carbon-atom sterol, lanosterol. The structure of 
lanosterol is then modified to yield the principal 


sterol of a species. The major regulatory step in 
the sterol biosynthetic pathway occurs early in the 
process. Drugs that lower blood cholesterol levels 
in humans are designed to inhibit this regulatory 
enzyme. In addition to their conversion to sterols, 
several intermediates in the pathway are precursors 
of other important biological compounds, includ- 
ing chlorophyll in plants, vitamins A, D, E, and K, 
and regulators of membrane functions and metabolic 
pathways. 

A universal role of sterols is to function as 
part of membrane structures. In addition, some in- 
sects require sterols in their diets. Cholesterol also 
serves as a precursor of steroid hormones (estro- 
gens, androgens, glucocorticoids, and mineralocor- 
ticoids) and bile acids. See CHOLESTEROL; STEROID. 

Mary E. Dempsey 

Bibliography. K. Bloch, The biological synthesis 
of cholesterol, Science, 150:19-28, 1965; G. W. 
Patterson and W. D. Nes (eds.), Physiology and Bio- 
chemistry of Sterols, 1992. 
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Stichosomida 


An order of nematodes (roundworms) formerly con- 
stituting at least two important groups of animal 
parasites: the mermithids, which are parasites of 
invertebrates, and the trichinellids, which are par- 
asites of vertebrates. These nematodes are charac- 
terized by a pharynx that is narrow and thin-walled 
anteriorly and is surrounded posteriorly by unicel- 
lular, glandular stichocytes, each with a duct into 
the pharyngeal lumen (Figs. 1 and 2). The phar- 
ynx extends one-fourth to nine-tenths of the body 
length in various taxa and is almost devoid of mus- 
culature. The region of the pharynx surrounded by 
stichocytes is known as the stichosome. Recent phy- 
logenetic analysis based on a synthesis of molec- 
ular and morphological data suggests that the sti- 
chosome may be an example of parallel evolution 
and that the trichinellids and mermithids are more 
appropriately separated as two orders: Trichinel- 
lida, with at least six families, and Mermithida, 
with two families. However, debate on the higher 
classification of nematodes continues, and opinion 
will no doubt refocus as new data emerge and 
are applied. See MEDICAL PARASITOLOGY; NEMATA 
(NEMATODA). 

Two life history examples provide some insight 
into the biology of these organisms. In Trichinella 
spiralis, encysted larvae are ingested in infected mus- 
cle tissue (raw or undercooked pork is the classic ex- 
ample in the case of trichinosis in humans). The cyst 
surrounding the larva is digested in the new host, 
and the larvae molt to adults, mate, and embed in 
the intestinal epithelium; then females produce eggs 
which hatch (1000 larvae per female in 5 days). The 
hatched larvae are distributed via the circulatory sys- 
tem and migrate into surrounding cells; the larvae die 
unless they are in striated muscle fibers. Secretory 
products of the stichocytes induce DNA endoredu- 
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Fig. 1. Anterior end of Trichinella spiralis showing 
stichostome. (Adapted from B. G. Chitwood, 1930) 


plication (replication of nuclear DNA without mi- 
totic cell division) and transformation of the muscle 
fiber into a multinucleate nurse cell, which becomes 
encapsulated by collagen and supplied with capil- 
laries. The life cycle continues when the muscle is 
eaten by another animal. 

In Romanomermis culicivorax, the preparasitic 
larva is equipped with a stylet which allows pen- 
etration through the cuticle of the mosquito larva 
host. The stylet is lost in subsequent larval stages, 


Stichosomida 


480 


Stickleback 





Fig. 2. Portion of the stichosome of the whipworm, 
Trichuris trichiura. (Photomicrograph by Sung-Jong Hong, 
The Korean Society for Parasitology) 


and the stichosome separates from the intestine, 
which becomes a trophosome (a modified nutritive 
storage organ) with no anal opening to the exte- 
rior. Nutrient uptake from the insect hemolymph 
is through a very thin cuticle, perhaps enhanced 
by small-diameter pores. Interestingly, the epidermal 
cells adjacent to the cuticle have their absorptive sur- 
face area increased by outwardly directed microvilli. 
The continued role of the stichosome is unclear, al- 
though it appears to be involved in protein synthe- 
sis. The larva undergoes one molt in the insect host 
and then ruptures the body wall as it emerges. After 
a final molt to the adult stage, the female deposits 
eggs. 

The nature and function of stichocyte secretions, 
particularly of vertebrate parasites, constitute an area 
of continued interest in research on host-parasite re- 
lationships. Howard Ferris 
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ology of Nematodes, Taylor and Francis, London 
and New York, 2002; E. A. Munn and P D. Munn, 
Feeding and digestion, pp. 211-232 in D. L. Lee 
(ed.), The Biology of Nematodes, Taylor and Francis, 
London and New York, 2002; G. O. Poinar, The Nat- 
ural History of Nematodes, Prentice Hall, Engle- 
wood Cliffs, NJ, 1983; G. O. Poinar and R. Hess, Ro- 
manomermis culicivorax: Morphological evidence 
of transcuticular uptake, Exp. Parasitol., 42:27-33, 
1977. 
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Stickleback 


Any member of the fish family Gasterosteidae, 
order Gasterosteiformes. The body may be elongate, 
naked, or with bony scutes along the sides. Three 
to 16 well-developed isolated dorsal spines precede 


a normal dorsal fin having 6 to 14 rays. Additional 
identifying features are as follows: 12 or 13 caudal 
fin rays; a single spine and 1 or 2 soft rays in the 
pelvic fin; 3 branchiostegal rays; circumorbital ring 
incomplete posteriorly; epineurals present; and 28 
to 42 vertebrae. A maximum length of about 18 cm 
(7 in.) is reached in Spinachia spinachia. See GAS- 
TEROSTEIFORMES; OSTEICHTHYES; TELEOSTEI. 

Sticklebacks are small freshwater and marine 
fishes of the cold and temperate waters of the North- 
ern Hemisphere. All species are of some economic 
importance since their diet includes mosquito lar- 
vae, and the freshwater species are especially pop- 
ular aquarium fishes. Also, sticklebacks are famous 
as subjects of numerous studies regarding evolution, 
genetics, behavior, and physiology. See INSTINCTIVE 
BEHAVIOR; SEXUAL DIMORPHISM. 

Notable species. The fifteen-spined stickleback 
(Spinachia spinachia) and the four-spined stickle- 
back (Apeltes quadracus) are marine shorefishes. 
Spinachia spinachia is found from Norway to the 
Bay of Biscay (France-Spain). The adult is about 18 cm 
(7 in.) in length and has a life span of 1 year. In 
low tides waters, the male builds a nest of algae 
stuck together with his mucus secretions. The fe- 
male enters the nest and deposits 150 to 200 eggs, 
which are then fertilized by the male. The female dies 
shortly after egg deposition, and the male guards the 
nest and eggs. Apeltes quadracus occurs in fresh- 
waters along the Atlantic coast from Nova Scotia to 
Virginia. 

Pungitius pungitius (Fig. 1) measures about 
7.5 cm Q@ in.) in length and occurs in fresh and brack- 
ish waters of North America and Europe. This species 
is known as the nine-spine stickleback, but the num- 
ber of spines varies. Nest building is relegated to the 
male, which also protects the eggs and fry until they 
hatch in about 1 week and protects the fry. 





Fig. 1. Nine-spine stickleback (Pungitius pungitius). (Photo 
by Konrad P. Schmidt) 
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Fig. 2. Three-spine stickleback (Gasterosteus aculeatus). 
(Bob Jenkins and Noel Burkhead; EFISH 
<http://www.cnr.vt.edu/efish/>) 


The best-known species is the three-spine stickle- 
back (Gasterosteus aculeatus) [Fig. 2], the behav- 
ior of which has been extensively studied. The male 
makes a depression in the sandy bottom; here he 
builds a nest of vegetation cemented together by 
mucous strands produced by the kidneys, which are 
enlarged at breeding time. During the breeding sea- 
son, the abdomen of the males becomes red and, 
when exhibited in the territory of the nesting area, 
evokes an innate behavioral response and attracts 
the females. The male also performs an elaborate 
courtship dance to entice the female to enter the 
nest. When the female does so, the male nuzzles her 
near the base of the tail and thus stimulates her to 
oviposit. The male then fertilizes the eggs and re- 
mains to guard and aerate them. See REPRODUCTIVE 
BEHAVIOR. Herbert Boschung; Charles B. Curtin 

Bibliography. M. A. Bell and S. A. Foster (eds.), 
The Evolutionary Biology of the Threespine Stickle- 
back, Oxford University Press, 1994; R. Froese and 
D. Pauly (eds.), FishBase, World Wide Web elec- 
tronic publication, version (05/2005); J. S. Nelson, 
Fishes of the World, 4th ed., Wiley, New York, 
2006; R. J. Wootton, The Biology of the Stick- 
lebacks, Academic Press, 1976; R. J. Wootton, A 
Functional Biology of Sticklebacks, University of 
California Press, 1984. 
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Stilbite 


A mineral belonging to the zeolite family of silicates. 
It crystallizes in the monoclinic system in crystals 
that are tabular parallel to the side pinacoid. Most 
characteristic are sheaflike aggregates of thin tabu- 
lar crystals. There is perfect cleavage parallel to the 
side pinacoid, and here the mineral has a pearly lus- 
ter; elsewhere the luster is vitreous. The color is 
usually white but may be brown, red, or yellow. 
Hardness is 3'/,-4 on Mohs scale; specific gravity 
is 2.1-2.2. Stilbite is a calcium-sodium aluminum sili- 
cate, Ca(Al,Si7O;s) - 7H2O, but usually contains some 
sodium replacing calcium. 

Stilbite is a secondary mineral usually found in 
cavities in basalts and related rocks. Much less com- 
monly it is found in granites, gneisses, and metal- 
bearing veins. It is associated with other zeolites, 
datolite, prehnite, and calcite. Some notable locali- 
ties are in Iceland, India, Scotland, Nova Scotia, and 
in the United States at Bergen Hill, New Jersey, and 
the Lake Superior copper district in Michigan. See 
SILICATE MINERALS; ZEOLITE. Cornelius S. Hurlbut, Jr. 


T) 
Stirling engine 

An engine in which work is performed by the ex- 
pansion of a gas at high temperature to which heat 
is supplied through a wall. Like the internal com- 
bustion engine, a Stirling engine provides work by 
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Fig. 1. Principle of Stirling engine, displacer type. 
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means of a cycle in which a piston compresses gas at 
a low temperature and allows it to expand at a high 
temperature. In the former case the heat is provided 
by the internal combustion of fuel in the cylinder, 
but in the Stirling engine the heat (obtained from ex- 
ternally burning fuel) is supplied to the gas through 
the wall of the cylinder (Fig. 1). See INTERNAL COM- 
BUSTION ENGINE. 

The rapid changes desired in the gas temperature 
are achieved by means ofa second piston in the cylin- 
der, called a displacer, which in moving up and down 
transfers the gas back and forth between two spaces, 
one at a fixed high temperature and the other at a 
fixed low temperature. When the displacer in Fig. 2 
is raised, the gas will flow from the hot space via the 
heater and cooler tubes into the cold space. When 
it is moved downward, the gas will return to the hot 
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Fig. 2. Stylized Stirling process. 
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Fig. 3. Piston and displacer movement for a Stirling engine. 
(a) The continuous movement of the piston and displacer 
shown as a function of crank angle a. There is no clear-cut 
division between the various phases of the cycle. The 
variations in volume of the hot space Vg and the cold space 
Vc are plotted separately in the lower part of the diagram. 
(b) The P-V diagram of the cycle. 


space along the same path. During the first trans- 
fer stroke the gas has to yield up a large amount 
of heat to the cooler; an equal quantity of heat has 
to be taken up from the heater during the second 
stroke. 

The regenerator shown in Figs. 1 and 2 has been 
inserted between the heater tube and cooler tube in 
order to prevent unnecessary waste of this heat. It is 
a space filled with porous material to which the hot 
gas gives up its heat before entering the cooler; the 
cooled gas recovers the stored heat on its way back 
to the heater. 


The displacer system serves to heat and cool 
the gas periodically; associated with it is a piston 
which compresses the gas while it is in the cold 
space of the cylinder and allows it to expand while 
in the hot space. Since expansion takes place at a 
higher temperature than compression, it produces 
a surplus of work over that required for the com- 
pression. 

Stirling cycle. Any practical version of the engine 
will embody some kind of crank and connecting rod 
mechanism, in consequence of which there will be 
no sharp transitions between the successive phases 
indicated in Fig. 2; but this will not alter the principle 
of the cycle (nor detract from its efficiency). If, for 
the sake of simplicity, the piston and displacer are 
assumed to move discontinuously, the cycle can be 
divided into the following four phases. (1) The piston 
is in its lowest position, the displacer in its highest. 
All the gas is in the cold space. (2) The displacer is 
still in its highest position; the piston has compressed 
the gas at low temperature. (3) The piston is still in 
its highest position; the displacer has moved down 
and transferred the compressed gas from the cold to 
the hot space. (4) The hot gas has expanded, pushing 
the pistons, followed by the displacer, to their lowest 
positions. The displacer is about to rise and return 
the gas to the cold space, the piston remaining where 
it is, to give phase I again. 

The actual piston and displacer movements might 
be as indicated in Fig. 3, which shows that the only 
essential condition for obtaining a surplus of work 
is that the maximum volume of the hot space occur 
before that of the cold space. This condition shows 
that more configurations with pistons and cylinders 
are possible than just the type with displacer in order 
to get a Stirling cycle. One of the most compact sys- 
tems is shown in Fig. 4, which is the system known 
as the double-acting engine. In the double-acting en- 
gine there is a hot space (expansion space) at the top 
and a cold one (compression space) at the bottom 
of each of the four cylinders shown. The hot space 
of a cylinder is connected to the cold one through 
a heater, a regenerator, and a cooler. The pistons P, 
of the cylinders move with a suitable phase shift be- 
tween them. In the case of four cylinders, as shown 
in Fig. 4, this shift is 90°. 

The theory ofan actual engine is very complicated. 
In order to provide an understanding of the quanti- 
ties that play a role, the formulas of the power and 
efficiency of an engine of highly idealized form will 
be given. On the assumption that the volumes of the 
hot (expansion) space Vz and that of the cold (com- 
pression) space V¢ (Fig. 3) vary with the crank angle 
in a purely sinusoidal way, that both expansion and 
compression are isothermal at respectively Tz and 
Tc, and that all kinds of losses caused by the flow 
resistances in the tubes and regenerator, heat losses 
in the regenerator, and so on are ignored, the power 
P can be expressed as in Eq. (1). Here t = T¢/Tz, 


p=đä=0 V in® (dd) 
T * Pm sin © 
60 Emax p 1 


5 
Pe 


n is the speed in rpm, Pm is the mean pressure of the 
pressure variation, and ô and © are functions of the 
temperature ratio, swept volume ratio, dead spaces, 
and phase angle between the two swept volumes. 
Under these conditions the thermal efficiency 7 is, 
of course, that of the Carnot cycle, and is expressed 
in Eq. (2). 


(2) 


See CARNOT CYCLE; THERMODYNAMIC CYCLE. 

Engine design. Actual engines have been built 
in the Philips Laboratories at Eindhoven, Nether- 
lands, as prototypes in the range of 10-500 hp 
(7.5-375 kW) per cylinder. After years of research 
on the Stirling engine, actual thermal efficiencies 
of 30-45% (depending on the specific output 
and temperature ratio) and a specific power of 
1.88 hp/in.* (85 kW/liter) swept volume of the pis- 
ton were obtained with the displacer type of engine 
equipped with rhombic drive, as shown in Fig. 5. 
In the figure the piston and the displacer drive 
concentric rods, which are coupled to the rhom- 
bic drive turning twin timing gears. The cooler, 
regenerator, and heater are arranged as annular sys- 
tems around the cylindrical working space. The 
heater tubes surround the combustion chamber. 
In the preheater the gas at 1470°F (800°C) from 
the heater is cooled to 300-390°F (150-200°C) 
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Fig. 4. Principle of the double-acting engine. 
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while heating the combustion air to about 1200°F 
(650°C). 

The rhombic drive mechanism allows complete 
balancing even of a single-cylinder engine and of a 
separate buffer space, thus avoiding heavy forces act- 
ing on the drive. The results of measurements on the 
first engine of this type, with hydrogen as the work- 
ing fluid, are shown in Fig. 6. An approximate heat 
balance is output, 40%; exhaust and radiation, 10%; 
and heat rejection by cooling water, 50%. Control 
of engine output is by regulation of the pressure of 
the working fluid in the engine, while the tempera- 
ture of the heater is kept constant by a thermostat; 
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Fig. 5. Philips Stirling engine in cross section. (Philips Gloeilampenfabrieken) 
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Fig. 6. Test measurements for the Stirling engine. (a) Measured shaft power P, and efficiency of a 40-hp (30-kW) single- 
cylinder test engine with rhombic drive, plotted as a function of the engine speed n, at different values of the maximum 
pressure of the working fluid in the engine (pmax). 1 hp = 0.75 kW; 1 kgf/cm? = 98 kPa = 14.2 Ibf/in.? (b) Power and 

efficiency of the 40-hp (30-kW) test engine, given as a function of the heater temperature and as a function of the inlet 
temperature of the cooling water. The curves apply to n = 1500 rpm and pmax = 140 kgf/cm? = 13.7 MPa = 1990 Ibf/in.?; °F = 


CC x 1.8) + 32. 


hence the efficiency shows little dependence on the 
load. 

The closed system of the Stirling engine endows 
this engine with many advantages and also some 
shortcomings. The continuous external heating of 
the closed system makes it possible to burn various 
kinds of liquid fuels and gases, without any mod- 
ification whatsoever. This multifuel facility can be 
demonstrated with a 10-hp or 7.5-kW (at 3000 rpm) 
generator set (Fig. 7). The engine can operate 
on alcohol, various lead-containing gasolines, diesel 
fuel, lubricating oil, olive oil, salad oil, crude oil, 
propane, butane, and natural gas. Furthermore, it al- 
lows combustion to take place in such a way that 
air pollution is some orders of magnitude less than 
that due to internal combustion engines. Through 
the intermediary of a suitable heat transport sys- 
tem (for example, heat pipes) any heat source at a 
sufficiently high temperature can be used for this 
engine—radioisotopes, a nuclear reactor, heat stor- 
age, solar heat, or even the burning of coal or 
wood. 

The almost sinusoidal cylinder pressure variation 


and continuous heating make the Stirling engine very 
quiet in operation. An engine having four or more 
cylinders gives a virtually constant torque per rev- 
olution, as well as a constant dynamometer torque 
over a wide speed range, which is particularly valu- 
able for traction purposes. The present configuration 
makes complete balancing possible, thus eliminating 
vibrations. There is no oil consumption and virtually 
no contamination because a new type of seal for the 
reciprocating rods shuts off the cycle hermetically 
from the drive mechanism. Figure 8 shows an en- 
gine of this configuration. 

Where direct or indirect air cooling is required, 
the closed cycle has the drawback that more heat 
has to be removed from the cooler than in compa- 
rable engines with open systems, where a greater 
quantity of heat inevitably escapes through the 
exhaust. 

If it is envisaged as someday taking the place of ex- 
isting engines, the Stirling might be ideal as a propul- 
sion engine in yachts and passenger ships, and in 
road vehicles, such as city buses, where a large ra- 
diator is acceptable. The system of continuous 
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Fig. 8. A 3-kW Stirling engine generator set. (General Motors Corp.) 
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Stishovite 


external heating is also able to open fields of ap- 
plication inaccessible to internal combustion en- 
gines. Roelof J. Meijer 

Bibliography. R. Darlington, Stirling and Hot Air 
Engines, 2005; T. Finkelstein and A. J. Organ, Air 
Engines: The History, Science, and Reality of the 
Perfect Engine, 2001; W. R. Martini, Stirling Engine 
Design Manual, 2004. 
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Stishovite 


Naturally occurring stishovite, SiO2, is a mineral 
formed under very high pressure with the silicon 
atom in sixfold, or octahedral, coordination instead 
of the usual fourfold, or tetrahedral, coordination. It 
was discovered in the same type of Coconino sand- 
stone from the Meteor Crater of Arizona. Stishovite 
has since been found in rocks from the Ries Crater 
of Germany. The presence of stishovite indicates for- 
mation pressures in excess of 10° Ib/in.? (7.5 gigapas- 
cals). The possibility of the existence of stishovite at 
great depths strongly influences the interpretations 
of geophysicists and solid-state physicists regarding 
the phase transitions of mineral matter, as well as the 
interpretation of seismic data in the study of such 
regions of the interior of the Earth. See SILICA MINER- 
ALS. 
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Comparison of the mean index of refraction and specific 
gravity of stishovite and other polymorphic modifications of 
silica. 


Natural occurrence. Stishovite occurs in submi- 
crometer size in very small amounts (less than 1% of 
the rock) in samples of Coconino sandstones from 
the Meteor Crater of Arizona, which contains up to 
10% of coesite, the other high-pressure polymorph 
of silica. Besides stishovite and coesite, these rocks 
consist mainly of quartz, with small amounts of sil- 
ica glass (lechatelierite) and traces of zircon and, in 
some cases, rutile. Because of its extremely fine grain 
size and because of the sparsity of this mineral in the 
rock, positive identification of the mineral is possible 
only by the x-ray diffraction method after chemical 
concentration. See COESITE; METEORITE. 

Properties. Stishovite is much more resistant to at- 
tack by concentrated hydrofluoric acid than coesite. 
It can therefore be extracted or concentrated by pro- 
longed treatment of the stishovite-bearing sandstone 
in hydrofluoric acid. Concentrates of stishovite, all of 
submicrometer size, are pale gray and have a roughly 
prismatic habit. Because of its fine-grained habit, only 
the mean index of refraction of the natural material 
can be measured. It is 1.80. 

Crystallographically, stishovite is SiO2 in the ru- 
tile structure, tetragonal with a = 0.41790 nanome- 
ter and c = 0.26649 nm (with a standard error of 
+0.00004 nm). The axial ratio is c/a is 0.6377, and 
the volume of the tetragonal unit cell is 0.046541 
nm?. The specific gravity of stishovite, calculated 
from the x-ray data, is 4.28, as reported by Chao and 
others, compared with the value of 4.35 for the syn- 
thetic material reported by Stishov and Popova. It is 
46% denser than coesite and much denser than other 
modifications of silica (see illus.). Edward C. T. Chao 

Bibliography. G. Beggerow, High-Pressure Proper- 
ties of Matter, 1980; E. C. T. Chao et al., Stishovite, 
SiOz, a very high pressure new mineral from Me- 
teor Crater, Arizona, J. Geophys. Res., 67(1):419- 
421, 1962; P J. Heaney, C. T. Prewitt, and G. V. 
Gibbs (eds.), Silica: Physical Behavior, Geochem- 
istry, and Materials Applications (Reviews in Min- 
eralogy, Vol. 29), 1994. 
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Stochastic control theory 


A branch of control theory which aims at predicting 
and minimizing the magnitudes and limits of the ran- 
dom deviations of a control system output through 
optimizing the design of the controller. Such devi- 
ations occur when random noise and disturbance 
processes are present in a control system, so that 
the system does not follow its prescribed course 
but deviates from the latter by a randomly varying 
amount. 

In contrast to deterministic signals, random signals 
cannot be described as given functions of time such 
as a step, a ramp, or a sine wave. The exact function 
is unknown to the system designer; only some of its 
statistical properties are known. 

A random signal may be generated by one of na- 
ture’s processes, for instance, radar noise and wind- 
or wave-induced forces and moments on a radar 


antenna or a ship. Alternatively, a random signal may 
be generated by human intelligence, for instance, the 
bearing of a zigzagging aircraft, or the contour to be 
followed by a duplicating machine. 

One outstanding experimental fact about nature’s 
random processes is that these signals are approxi- 
mately gaussian. The word gaussian refers to a mathe- 
matical concept which describes one or more signals 
ii, i2, ... , t, having the following properties: 

1. The amplitude of each signal is normally dis- 
tributed. 

2. The joint distribution function of any number 
of signals at the same or different times taken from 
the set is a multivariate normal distribution. This ex- 
perimental fact is not surprising in view of the fact 
that a random process of nature is usually the sum 
total of the effects of a large number of independent 
contributing factors. For instance, thermal noise is 
due to the thermal motions of billions of electrons 
and atoms. An ocean-wave height at any particular 
time and place is the sum of wind-generated waves 
at previous times over a large area. Gaussian sig- 
nals have the property that when passed through 
linear systems they remain gaussian. See DISTRIBU- 
TION (PROBABILITY); ELECTRICAL NOISE; STOCHASTIC 
PROCESS. 

In the following, the mathematical characteriza- 
tion of the random processes, their transmission 
properties, measurements and simulation, effects on 
the system output, and control system optimization 
will be introduced. 

Mathematical characterization. The random pro- 
cesses encountered in control work can be classi- 
fied as stationary and nonstationary processes. The 
underlying mechanism that generates a random pro- 
cess can usually be described in physical or mathe- 
matical terms. For instance, the underlying mecha- 
nism that generates shot-effect noise is thermionic 
emission; the underlying mechanism that generates 
ocean waves is essentially wind force in conjunc- 
tion with the Earth’s gravity; and the outcome of 
dice throwing is determined by a probability of 
1/6 for each face of each die independent of all 
others. 

If the generating mechanism does not change with 
time, any measured average property of the random 
process such as mean or variance is independent of 
the time of measurement aside from statistical fluc- 
tuations, and the random process is called station- 
ary. For instance, ocean-wave height in a given sea 
state, thermal noise, and a telegraphic signal of a 
certain language are examples of stationary random 
processes. If the generating mechanism does change, 
the random process is called nonstationary, in which 
case the generating mechanism may change in a pre- 
determined fashion or at random. An example of the 
former is found in missile launching, where the un- 
wanted fluctuations in vertical and lateral thrusts are 
generated by one or a group of rocket motors of 
which the operating condition is a predetermined 
function of time. Alternatively, the generating mech- 
anism may change at random, as in the course of 
flight of an aircraft, where the wind disturbance de- 
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pends on the local atmospheric condition which may 
be calm or turbulent. 

Statistical ensemble. The random processes or events 
generated by the same underlying mechanism form 
a statistical ensemble. The ensemble average is the 
averaged value over all possible events weighted ac- 
cording to the probabilities of such events. For a sta- 
tionary process there is little reason to distinguish 
between the ensemble average at any particular time 
and the time average of a single event. The life history 
of a single event can be chopped up into an infinite 
number of events. Since these events are generated 
by the same underlying mechanism which does not 
change with time, they can be thought of as arbitrar- 
ily selected from the entire ensemble, and therefore 
they represent the latter. The assumption of equiva- 
lence between time and ensemble averages is called 
the ergodic hypothesis. See STATISTICAL MECHANICS; 
TOPOLOGICAL DYNAMICS. 

The ensemble average of a random variable or 
function is referred to as its mean value. For station- 
ary systems, the mean value can be obtained by tak- 
ing either time or ensemble average. Let x denote the 
mean value of a random variable x, and X its random 
variation from the mean: x = X + X. As X can be ab- 
sorbed as part of the deterministic input, the random 
signals may be defined as variations about the mean 
values, and thus they have zero mean themselves. 

Correlation functions. The statistical properties of a set 
of random signals (which may include the random 
inputs and system variables in response to these in- 
puts) can be described by their correlation functions. 
The correlation function $,,(4;,2) is defined as the 
ensemble average of signal a(d) at time ¢; and b(@ at 
time f, Eq. (1). When the two signals a® and b@ 


dav, t2) = a(t )bC2) a) 


are different, a(tı,t2) is called the cross-correlation 
function. When a(@ and b(@® are one and the same, 
Pav(h,f2) is called the autocorrelation function. 

For stationary systems, the only significant inde- 
pendent variable is t — t4, denoted as t, Eq. (2). The 


Pab lT) = aDbE + T) (2) 


time interval t is called the correlation time. The 
useful properties of the correlation function given 
in Eqs. (3)-(7) follow from Eq. (2). Equation (6) 


baa) = act? (3) 

Pab =T) = boat) (4) 
Paal T) = baaCT) 6) 
Qab] < VPaaObon(O) ©) 
lhaa < aa) a” 


is obtained by taking the ensemble average of 
Eq. (8), yielding Eq. (9), and combining this with 
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Eq. (10). Equation (7) is a special case of Eq. (6) 
with b@® = aM. 
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Equations (3)-(7) show that the autocorrelation 
function is even in T, and its value at t = O is the 
mean-square value of the random signal itself. The 
magnitude of the cross-correlation function cannot 
exceed the product of the root-mean-square values 
of the two signals. 

Spectral-density functions. If the chopped signals a7) 
and b;(f are defined as the same random signals 
a@ and b(@, respectively, in the interval 0 < t < T 
but zero outside this interval, and A7(s) and B7(s) 
represent their Laplace transforms, then a function 
®,,(s) can be defined by Eq. (11). Equation (11) im- 
plies that Eqs. (12) and (13) are valid. Equations (12) 


Nee 
Pals) = äm parC OBr® anp 
DapGo) = Poal jw) a2) 
Daag Go) = Daa(—jo))0 a3) 


and (13) show that ®,,(jw) is the complex conju- 
gate of ®,,(jw) and that ®,,(jw) is a real positive 
even function of w. The function ®,,(s) is called the 
spectral-density function when a and b are the same, 
or the cross-spectral-density function when a() and 
b@® represent different signals. 

It is readily shown that ¢,,(t) and ®,,(s) are re- 
lated by Laplace transforms, Eqs. (14) and (15). The 


1 joo 
bart) = = f Pale” ds (14) 


2mf J joo 
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transform relationship between correlation func- 
tions and spectral densities as represented by Eqs. 
(14) and (15) is called Wiener’s theorem. 

Transmission properties. When stochastic pro- 
cesses are applied to a physical system, the random 
output variations are related to the inputs and to each 
other. These relationships are expressed in terms of 
the transmission properties. 

The illustration shows a multiple-input and 
multiple-output system where the vector x(4) de- 
notes inputs x,(), x2), ... , X,(@ and yŒ denotes 
outputs 11), 2, ... , ¥m@, through filters with 
transfer functions H,(s), Eq. (16). The filters are 
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Transmission of stochastic processes through the linear 
filter H. 


assumed to be stable (with all their poles in the left 
half of the s-plane). The spectral densities of compo- 
nents of y are expressed in terms of that of x by 
Eq. (17). Equation (17) is readily derived from 


Dyry GO) = X Y HJO) Pry GOH (jo) 7) 
k $ 


Eq. (11) and can be written in matrix form, Eq. (18), 


DGO) = H(— jo), (jo) HH’ (jo) a8) 


where ®,, ®,, and H are m x m,n x n, and m x n 
matrices with ®,,, ®,.;, Hy as their i—jth compo- 
nents respectively. Note that H(—jw) and How) are 
hermitian adjoints of each other. Equations (17) and 
(18) are perfectly general and can be used to obtain 
cross-spectral density between input and output of 
a filter as well. For instance, by regarding x as the 
m + 1- to m + n-th components of an enlarged out- 
put vector, Eq. (18) readily leads to Eq. (19), where 


Py Gw) = PGH Gw) (19) 


Phixy is ann X m matrix with Phi,y; as its ith 
component. See LINEAR SYSTEM ANALYSIS. 

If the process is nonstationary, frequency domain 
analysis cannot be used. Equation (18) is replaced by 
Eq. (20), where 4,¢,t) denotes the response of filter 
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Nz at time ż to unit impulse input at time T. 

White stochastic process. A stochastic process x 
with n components x1, X2, ... , Xn is said to be white 
if there is no correlation between any of its compo- 
nents at different times, Eq. (21). This includes auto- 
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correlations with 7= k. Equation (21) has an interest- 
ing implication: at t= /’, the autocorrelation function 
for a white process must have infinite instantaneous 
amplitude if it is to produce any physically significant 
effect. 


This can be proven as follows: A physical system 
can be represented by (or approximated as) a stable 
filter with finite impulse response h(¢,t). The mean- 
square value of its response a(t) to a white noise 
input x@ can be obtained from Eq. (20), yielding 
Eq. (22). If it is assumed to the contrary that x(/) is 
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finite, then ¢,,(t,t’) < M*. This condition, together 
with Eq. (21), gives the integral of Eq. (22) to be zero. 

Equation (21) and the requirement of finite physi- 
cal effect lead to the correlation function of Eq. (23) 


Piet, t') = WeSC — t’) (23) 


for the white noise process, where the matrix W 
with Wi(D as its elements is positive semidefinite 
for all ¢ (that is, v’Wv > 0 for any vector v with real 
components); and 5() is the Dirac function defined 
by Eqs. (24) and (25). For a stationary white process, 


sO = Oift £0 (24) 
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Wiener’s theorem and Eq. (23) give the spectral den- 
sity as Eq. (26).Equation (26) illustrates the origin of 
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the description “white” in the sense that the spectral 
density is independent of frequency. 

A white process is purely a mathematical concept 
as no physical process can have infinite amplitude or 
infinite bandwidth. It is nevertheless a very impor- 
tant concept in simulation of stochastic processes 
and in system optimization. A random process can 
be regarded as white in an approximate sense if 
Eq. (21) is satisfied for |t—¢’| > A, where A is a time 
interval much smaller than the response time of the 
control system under consideration. An equivalent 
condition is for Eq. (26) to hold for all w < wz, where 
wp is at least an order of magnitude larger than the 
system bandwidth. 

Simulation of stochastic processes. A white process 
can be simulated either digitally or by an analog 
method. Digitally it is simulated by a sequence of 
random numbers at a rate at least a decade higher 
than the system bandwidth. The numbers are then 
used directly in a simulation study or converted into 
an electrical signal in actual tests. In analog white 
noise generators, shot effect noise or thermal noise is 
amplified by a wide-band amplifier. Correlated white 
processes with matrix W are generated by using in- 
dependent white noise sources and a resistive net- 
work. The required number of independent sources 
is equal to the rank of W. 

Nonwhite stochastic processes can be simulated 
as outputs of filters with white noise sources as in- 
puts. Assuming Phi,(jw) to be the identity matrix, 
Eqs. (18) and (20) give respectively Eqs. (27) and 
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(28). The problem is then to find H(ja) so that $, (jw) 


p Gw) = H(—jo)H" (jo) (27) 
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is equal to the spectral density matrix of the stochas- 

tic processes to be simulated or to find b(t) for the 

nonstationary case. See SIMULATION. 

Regulator problem. The deterministic regulator 
problem is encountered in the optimal control of 
linear systems. Its state equation may be modified to 
include a white stochastic process wf), as shown 
in Eq. (29). Here x() is the state vector, u@ is the 


x = A®x+ BOu+ w (29) 
X(to) specified 


input vector, and w(® is a white stochastic process 
with correlation function given by Eq. (23). The per- 
formance index J, as defined by Eq. (30), where S; 


J =x" Sx G) 
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to 


and Q@ are symmetric, nonnegative definite matri- 
ces which penalize the system for large excursions, 
depends now on w). Therefore its expected value 
or ensemble average over all possible w(t) must be 
used, Eq. G1). 


Performance index = EV) Bl) 


See OPTIMAL CONTROL (LINEAR SYSTEMS). 

As wÇ) is white, its future value has no correlation 
with its past, and consequently has no correlation 
with x£). Furthermore, w(d), ¢ > to, is completely un- 
predictable and nothing can be done about it. The 
best that can be done is to minimize the contribu- 
tion to E(J) by x(@. Thus, the optimal control law of 
Eq. (2) with S defined by the Riccati differential 
equation (33) and the boundary condition S(@) = S; 


uH) = -ROB OSOA G2) 
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is still valid. This heuristic reasoning has been sub- 

stantiated by rigorous mathematical proof. 
However, the minimum value of E(/) is different 

from that given by Eq. (34) because of the added 
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disturbance wf). Straightforward calculations give 
Eq. 65), where Tr stands for the trace of the matrix 
inside the braces. 
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In practical control systems, the state vector X£) 
is not exactly known because of instrument noise 
and such, and the disturbance w(/) may not have 
white spectrum. However, with any given Phiy(jw) 
or dwt), w@® can be simulated as the output of 
linear filters with white noise processes v() as its 
inputs, as in Eqs. (36) and (37). A generalized vec- 


wt) = CH w+v (36) 


pt) = 6¢-0) (37) 


tor z(@ is defined by Eq. @8). If matrices D, E, and 
F are defined by Eqs. (39), where 1 and 0 denote 
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the identity matrix and null matrix of appropriate di- 
mensions, then Eqs. (29) and (36) can be combined 
into one, giving Eq. (40). Since Fv is a white distur- 
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bance, the problem is reduced to an optimal regula- 
tor problem with white disturbance. However, z(@ 
is not completely known. 

One of the most important results in estimator the- 
ory is that the residue error z® — 2), where 2 
is the optimum estimate of z(), cannot be inferred 
in any way from past measurements. Consequently 
the best that can be done is to apply regulatory ac- 
tion to the known part of z(), using Eq. (41), where 


ud) = —R 'ME'MOSOZY (41) 


S(@ is the solution to Riccati equation for the new 
problem defined by Eq. (40), and 2@ is the best or 
optimum estimate of z(H. Equation (41) states that 
the optimal overall controller is obtained by applying 
the optimal control law to the separately optimally 
estimated state Z(d). It is known as the separation 
theorem. 

For single-input, single-output, time-invariant sys- 
tems with infinite time interval, both the estimation 
problem and the control problem may be reformu- 
lated, leading to an integral equation known as the 
Wiener-Hopf equation. The solution to this equation 
has been obtained and is a classical mathematical re- 
sult in the theory of stochastic processes. Further, 
the method discussed above as well as the Wiener- 
Hopf approach may also be applied to discrete-time 
systems. See ESTIMATION THEORY. 

Structural uncertainty. In Eq. (29), the random vari- 
able w(/) is an additive term. Quite often the sys- 
tem structure is not completely known, and A@ or 
B@ has stochastic, uncertain, or unknown, compo- 
nents. These are known as structural uncertainty. 
If the structural uncertainty is white gaussian, the 
optimal control law remains the same, with one 


or more terms added to the Riccati equation. If 
the structural uncertainty is nonwhite, or constant 
but unknown, it can be predicted to some extent 
from past measurements. Adaptive controllers are 
generally used. If the structural uncertainty varies 
in some unknown manner, minimax, least-sensitive, 
or guaranteed-cost controllers are used. Minimax 
means minimizing the largest possible value of the 
performance index. Least-sensitive means designing 
the controller in such a way that the performance 
index is not sensitive to the structural uncertainty. 
Guaranteed-cost aims at simple robust design which 
guarantees that the performance index does not ex- 
ceed some given limit. See ADAPTIVE CONTROL; CON- 
TROL SYSTEMS; OPTIMAL CONTROL THEORY. 
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A physical stochastic process is any process gov- 
erned by probabilistic laws. Examples are (1) devel- 
opment of a population as controlled by mendelian 
genetics; (2) brownian motion of microscopic parti- 
cles subjected to molecular impacts or, on a different 
scale, the motion of stars in space; (3) succession of 
plays in a gambling house; and (4) passage of cars by 
a specified highway point. 

In each case, a probabilistic system is evolving; 
that is, its state is changing with time. Thus the state 
at time ¢ depends on chance: It is a random vari- 
able x). The parameter set of values of t involved is 
usually (and will always be in this article) either an 
interval (continuous parameter stochastic process) 
or a set of integers (discrete parameter stochastic 
process). Some authors, however, apply the term 
stochastic process only to the continuous parame- 
ter case. 

If the state of the system is described by a single 
number, x() is numerical-valued. In other cases, x(4) 
may be vector-valued or even more complicated. The 
discussion in this article will usually be restricted to 
the numerical case. As the state changes, its values 
determine a function of time, the sample function; 
and the probability laws governing the process de- 
termine the probabilities assigned to the various pos- 
sible properties of sample functions. 

A mathematical stochastic process is a mathemat- 
ical structure inspired by the concept of a physical 
stochastic process, and studied because it is a math- 
ematical model of a physical stochastic process or 


because of its intrinsic mathematical interest and its 
applications both in and outside the field of proba- 
bility. The mathematical stochastic process is defined 
simply as a family of random variables. That is, a pa- 
rameter set is specified, and to each parameter point 
ta random variable x(®) is specified. If one recalls that 
a random variable is itself a function, if one denotes a 
point of the domain of the random variable x() by w, 
and if one denotes the value of this random variable 
at w by x@,w), it results that the stochastic process is 
completely specified by the function of the pair (¢,w) 
just defined, together with the assignment of prob- 
abilities. If ¢ is fixed, this function of two variables 
defines a function of w, namely, the random variable 
denoted by x@. If w is fixed, this function of two 
variables defines a function of t, a sample function 
of the process. 

Probabilities are ordinarily assigned to a stochastic 
process by assigning joint probability distributions 
to its random variables. These joint distributions, to- 
gether with the probabilities derived from them, can 
be interpreted as probabilities of properties of sam- 
ple functions. For example, if tọ is a parameter value, 
the probability that a sample function is positive at 
time fo is the probability that the random variable 
x(t) has a positive value. The fundamental theorem 
at this level is that, to any self-consistent assignment 
of joint probability distributions, there corresponds 
a stochastic process. 

The concept of a stochastic process is so general 
that the study of stochastic processes includes all 
of probability theory. Although it would be impossi- 
ble to make a useful but more restrictive definition, 
what the probabilist usually has in mind in using the 
term stochastic process (unless he is interested in 
the mathematical foundations of the general theory) 
is a stochastic process whose random variables have 
some sort of interesting mutual relations. For exam- 
ple, one such relation is that of independence, and 
one type of stochastic process which was studied 
long before the term was invented is a sequence of in- 
dependent random variables. For historical reasons, 
such a sequence is not commonly thought of as a 
stochastic process (although its continuous parame- 
ter analog, a process with independent increments, 
to be discussed below, is). The rest of this article is 
devoted to a general discussion of specific types of 
stochastic processes which have received the most 
attention, because they are important in mathemati- 
cal and nonmathematical applications. 

Stationary processes. These are the stochastic pro- 
cesses for which the joint distribution of any fi- 
nite number of the random variables is unaffected 
by translations of the parameter; that is, the dis- 
tribution of xti + Þ), ..., X@, + 4) does not 
depend on b. For a more complete discussion see 
PROBABILITY 

Markov processes. A Markov process is a process 
for which, if the present is given, the future and 
past are independent of each other. More precisely, if 
t <- < t, are parameter values, and if 1 < j < n, 
then the sets of random variables [x@Œ), ..., XŒ-D] 
and [Xt), -- - X€n)] are mutually independent for 
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given x(t). Equivalently, the conditional probability 
distribution of x(t„) for given x(t), ..., XCy—1) de- 
pends only on the specified value of x(¢,_1) and is in 
fact the conditional probability distribution of x(,), 
given x(f,_1). An important and simple example is 
the Markov chain, in which the number of states 
is finite or denumerably infinite. (The terminology 
varies somewhat here.) One simple type of discrete- 
parameter Markov chain is the following. Let (pj) be 
a set of numbers, where 7 and j range over a finite 
or infinite set of integers. (In physical language, the 
number pj; will be the probability that some system 
has a transition from state 7 to state j in one step.) 
The numbers py are to satisfy relation (1). The ran- 
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dom variables of the associated Markov process are 
integral-valued, denoted by x(0), x(1), .... If fọ is pre- 
scribed as the initial state, that is, if x(O) is assigned 
the value 7p identically, the probability that x(k) has 
the value Zp, for k = 1, ..., N, is product (2). For 
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this process, py is the probability that x(@ + 1) has 
the value j if x) has the value 7. The number pj, is 
also (and this is the characteristic property of Markov 
processes) the probability that x(7 + 1) has the value 
j if x07) has the value 7 and if also x(m — 1) has any 
prescribed value a@,, x(7 — 2) the value az, and so on. 
What makes this a special Markov chain, aside from 
the fact that the states are denoted by integers, is 
that the conditional probability just described does 
not depend on n. The chain is therefore described as 
having stationary transition probabilities. If the initial 
state is given a distribution, say by prescribing that 
x(O) have the value 7 with probability p;, evaluation 
(2) becomes sum (3).Here the p,’s are any nonnega- 
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tive numbers with sum 1. If the initial distribution is 
chosen, as is not always possible, in such a way that 
the probability that (72) has the value j is p;, not only 
for n = 0 but for all values of n, the resulting process 
is stationary. 

In constructing the corresponding continuous- 
parameter Markov chain, it is supposed that, for each 
pair (7), there is a function p,(.), defined for strictly 
positive ¢, satisfying relations (4). The equations of 


Py = D0 di Py¥O =1 
J 
(4) 
Pi(S+D => Pu) PeylD 
k 


the system in the last line are known as the Chapman- 
Kolmogorov equations. A Markov stochastic process 
with continuous parameter ranging from 0 to oo can 
be constructed for which, if again p; is the proba- 
bility that x(0) has the value j, and if 0 < ft; <--- 
< tn, the probability that x(t) has the value 7, for 
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k=1,---, Nis given by sum (5). For this process, 
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if s > 0, the probability that x(u + s) has the value 
J, if x(u) has the value 7, is p,(s). The number p,(s) 
is also the probability (and this is the characteris- 
tic property of Markov processes) that x(u + s) has 
the value j if x(u) has the value 7; and if also x(u) 
has any specified value a1,x(u2), the value a2, and so 
on, where t, t2, ..., are any positive numbers less 
than u. This example is not the general continuous- 
parameter Markov chain because the transition prob- 
ability just described does not depend on u, that is, 
because the chain has stationary transition probabil- 
ities. The process is stationary if the probability that 
x(u) has the value j does not depend on u, for all j. 
The second line of relation (4) has a simple interpre- 
tation: The probability, if x(u) has the value ż, that 
x(u + s + ®) has the value f, is the sum over k of the 
probability that x(u + s) has the value k multiplied 
by the probability that, if x(u) has the value 7, and if 
x(u + $) has the value k, then x(u + s + D has the 
value j. Without the Markov property, the second fac- 
tor might depend on i. If the number of states is not 
finite or denumerably infinite, the preceding discus- 
sion is modified by replacing sums in notations (1), 
©), (£, and (5) by integrals. 

Typical applications. Typical questions that have been 
raised, and solved to a varying degree, about Markov 
processes with stationary transition probabilities are 
the following. They are phrased in the continuous- 
parameter case, for the Markov chain just described, 
and it is assumed that relation (6) holds. For con- 
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venience one defines p;(0) as 1 if i = j and as 0 
otherwise. 

1. Does p,@® have a limit when t > oo? In other 
words, for each j is there a limiting probability that 
the system is in state j as time passes? The answer 
is yes, and the limiting probability depends only on 
the end state j, not on the initial state 7, if transitions 
between all pairs of states are possible. 

2. What are the asymptotic properties of p@® as 
t — 0? The answer is that p',(0) always exists and 
is finite except possibly when 7 = j. Under further 
hypotheses on the transition probability functions, 
always satisfied if there are only finitely many states, 
pO) is finite and relations (7) hold. These equations 
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can be used to determine the transition probability 
functions in terms of assigned derivatives when t = 
0. For example, if c is a strictly positive constant, and 
if pj) is specified as c if j = i + 1, as —c if j = i, 
and as 0 otherwise, it is shown that pj = 0 ifj < i, 


and that otherwise relation (8) is true. The process 
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with these transition probabilities is known as the 
Poisson process. 

3. What are the properties of the sample func- 
tions? Under further restrictions on the process, the 
sample functions are constant on intervals, changing 
in jumps from one state to the next, and —p;(0)/p;:C(0) 
is the probability that, if the system is in state 7, 
its next jump will be into state j. If it is in the 7th 
state, the time the system remains in this state there- 
after is a random variable with density ge~'7, where 
q = —pikO). For example, in the case of the Pois- 
son process described above, it is shown that, under 
proper normalization, and if x(O0) = 0, the sample 
functions are integral-valued and monotone, increas- 
ing in unit jumps. This process is a mathematical 
model for the physical process of radioactive decay. 
That is, x(f) can be interpreted as the number of ra- 
dioactive disintegrations of a substance by time t. In 
other interpretations, x(4) is taken as the number of 
telephone calls initiated by time ¢, or the number 
of cars that have passed a given highway point by 
time ¢. The constant c is the rate at which these var- 
ious events occur. In fact, the expected value of 
x(u + b) — xu), that is, the expected number of 
events in a time interval of length ) (4 here is of 
course positive) is cb, and the probability that an 
event will occur in an interval of length 4, regardless 
of the past history of the process, is ch up to higher 
powers of b. 

There are many special types of Markov chains, for 
which more detailed questions become important. 
For example, consider the branching processes. In 
a system of particles, all of the same type, a particle 
will occasionally split, independently of its past his- 
tory and of the other particles, into j > 0 particles 
with probability qj. Ifa particle is observed at time ż, 
the probability that it will split by time £ + >» is ch, up 
to higher powers of h, where c is a strictly positive 
constant. Then the number of particles at time ż is 
a random variable x). The x(@ process is a Markov 
process, and p;(0) is easily determined in terms of 
q; and c. A more general branching process would 
permit particles of several types, and each particle 
would be allowed to split into particles of the various 
types. The rate c would depend on the particle type. 
In this case, x() is defined as a vector whose ith com- 
ponent is the number of particles of type 7 at time f. 
The x(@ stochastic process is a vector-valued Markov 
process. In studying branching processes, the most 
natural questions to ask are: What is the probability 
that the population will die out? If it does not die 
out, what is the asymptotic distribution of popula- 
tion as time passes? The answers are too technical to 
be given here. 

Transition probabilities. If the states of a Markov pro- 
cess comprise all real numbers, the character of 
the process may be similar to that of a chain 
but may also be quite different. For example, the 


sample functions of the process may be continuous. 
The most important examples of this type are the dif- 
fusion processes. Simplifying somewhat, but not as- 
suming stationary transition probabilities, consider 
a Markov process for which the probability distri- 
bution of the state at time t, given state £ at time 
s < t, has density p(s, £, t, n). Then the basic condi- 
tions satisfied by the transition density, correspond- 
ing to relation (1), are relations (9). Now suppose 
that limits (10) and (11) exist and have the indicated 
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values. These limit relations make m and o? the in- 
stantaneous rates of change of the displacement and 
its variance, given a specified time and state. If m 
and o are sufficiently regular, and if a further condi- 
tion is imposed which, roughly, makes improbable 
significant sample function changes in short times, 
the corresponding Markov process, when properly 
normalized, will have continuous sample functions. 
Moreover, the transition density will then satisfy 
backward-diffusion equation (12) and forward diffu- 
sion equation (13), also known as the Fokker-Planck 
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equation. Conversely, given a pair of coefficient func- 
tions m and 0 and o these second-order parabolic 
equations can be used to derive the corresponding 
transition densities. 

The simplest nontrivial example of a diffusion 
process corresponds to the specification m = 0 
and o a constant function. In this case, the diffu- 
sion process is the brownian motion process, or 
Wiener process: The increment x) — x(s) has a 
gaussian distribution with mean value 0 and variance 
o?°|t —s|. This is a mathematical model for the physi- 
cal brownian motion. That is, ifx(@) — x(s) represents 
the displacement in a given direction of a brownian 
particle between times s and ¢, this process is a good 
model for the actual motion. 

Martingales. A martingale is a stochastic process 
with the property that, if ti <--- < tn are param- 
eter values, the expected value of x(¢,), for given 
X), +++ , Xy_1D is equal to x(,_1). That is, the ex- 
pected future value, given present and past values, 


is equal to the present value. The interpretation that 
a martingale can be thought of as the fortune of a 
player after the successive plays of a fair gambling 
game is obvious. 

Typical results on martingales are the following. 
If a sequence of random variables is a martingale, 
it converges under weak conditions which impose 
certain “bounds” on the random variables, for exam- 
ple, if the expectation of the absolute value of the 
nth random variable is bounded independently of 
n. The sample functions of a properly normalized 
continuous-parameter martingale do not have oscil- 
latory discontinuities. 

The applications of martingale theory are too tech- 
nical to be given here, but one suggestive example 
will be given, which indicates at least how the the- 
ory can be usefully applied to information theory. 
Let y, X1, X2, ... be random variables, and let y,, be 
the expected value of y knowing x), ..., X,. Then 
Yn is a random variable which is a function of x, 
...,X, and the sequence yı, yz, ... is a martingale. 
That is, the expected values of a random variable, if 
one knows more and more, defines a sequence of 
random variables which is a martingale. 

Processes with independent increments. Such a pro- 
cess is a continuous-parameter process with the 
property that, if t; < --- <7, are parameter values, 
the successive increments in notation (14) are mutu- 

XE) = XE), -o o XE) — XEn_1) (14) 
ally notation (14) independent. If y® = x@® — xo), 
where t is fixed, the y@ process is then a Markov 
process. Both the Poisson and the brownian motion 
processes described above have independent incre- 
ments. 

Typical results on these processes include the fol- 
lowing. If such a process is properly normalized, its 
sample functions do not have oscillatory disconti- 
nuities. Moreover, the distribution of any increment 
x(@) — x(s) is infinitely divisible, and there is a stan- 
dard form for the characteristic function of any such 
distribution. See GAME THEORY; INFORMATION THE- 
ORY; LINEAR PROGRAMMING; OPERATIONS RESEARCH. 

Joseph L. Doob 
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Stoichiometry 

The field of chemistry that includes all chemical 
measurements, such as the measurements of atomic 
and molecular weights and sizes, gas volumes, vapor 
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densities, deviation from the gas laws, and the struc- 
ture of molecules. In the long struggle to determine 
the relative weights of the atoms, scientists relied 
upon combining ratios, specific heats, and measure- 
ments of gas volumes. All such measurements, and 
the calculations that relate them to each other, consti- 
tute the field of stoichiometry. Since measurements 
are expressed in mathematical terms, stoichiometry 
can be considered to be the mathematics of general 
chemistry. Thus, stoichiometry is not part of inor- 
ganic, organic, physical, or analytical chemistry, but 
is an essential part of all of them. Chemistry is an 
exact science, and it depends upon exact measure- 
ments of weights, lengths, and volumes, and on the 
amounts of energy which are absorbed or evolved in 
chemical reactions. 

In a more usual usage, the term stoichiometry 
refers to the relationships between the measured 
quantitites of substances or of energy involved in 
a chemical reaction; the calculations of these quan- 
tities include the assumption of the validity of the 
laws of definite proportions and of conservation of 
matter and energy. See CONSERVATION OF ENERGY; 
CONSERVATION OF MASS. 

Laboratory measurements are made in terms of a 
unit of weight (grams, ounces, and such), volume 
(milliliters and such) or energy (ergs, calories, and 
such), but since atoms and molecules of different 
substances have different weights, it is necessary to 
convert numbers of atoms or molecules into the mea- 
sured quantities. These quantities can be expressed 
in any convenient units. In laboratory work, weights 
are usually expressed in grams, but in chemical engi- 
neering, in pounds or tons. See ATOMIC MASS; GRAM- 
MOLECULAR WEIGHT; MOLECULAR WEIGHT. 

A typical stoichiometric problem involves predict- 
ing the weight of reactant needed to produce a de- 
sired amount of a product in a chemical reaction. For 
example, phosphorus can be extracted from calcium 
phosphate, Ca,;(POg)2, by a certain process with a 
90% yield (some calcium phosphate fails to react or 
some phosphorus is lost). In a specific problem, it 
might be necessary to determine the mass of calcium 
phosphate required to prepare 16.12 Ib of phospho- 
rus by this process. The balanced equation for the 
preparation is shown in reaction (1). In this reac- 


2Ca3(PO4)o + 10C + 6Si02 > 
6CaSi0z3 ++10CO+P, (1) 


tion, 2 moles of calcium phosphate are required to 
produce 1 mole of phosphorus. Two moles of cal- 
cium phosphate have a mass of 620 Ib, and 1 mole of 
phosphorus as P, has a mass of 124 lb. Using these 
relationships, calculation (2) is made. Since the yield 


2 moles Ca3(PO4) 620 Ib Ca3(PO4)2 
~ 124 Ib Py 

X lb C POH 
= 16.12 Ib Py 


X = 80.6 lb 





1 mole P4 


(2) 


of phosphorus is only 90%, extra Ca;(PO,4)2 must be 
used: 88.1 Ib is the mass of calcium phosphate re- 
quired to yield 16.12 Ib of phosphorus by this pro- 
cess. 

Calculations of this sort are important in chemical 
engineering processes, in which amounts and yields 
of products must be known. The same reasoning is 
used in calculations of energy generated or required. 
In this case, the energy involved in the reaction of 
a known weight of the material in question must be 
known or determined. There are many variations on 
this theme. For example, if it is necessary to deter- 
mine the composition of a mixture of copper and 
zinc powders, a given weight of the material can be 
dissolved in hydrochloric acid, which reacts with the 
zinc, but not with the copper [reaction (3)]. From 


Zn + 2HCI —> ZnCl + H2 6) 


the volume of hydrogen gas liberated, the amount of 
zinc dissolved can be calculated, and hence the com- 
position of the mixture. Again, if 5.0 g of sodium is 
placed in a vessel containing 5.0 g of chlorine, a vi- 
olent reaction takes place [reaction (4)]. However, 


2Na + Cl > 2NaCl (4) 


some of the sodium remains unattacked. It can be 
calculated that 5.0 g of chlorine can react with only 
3.2 g of sodium; 1.8 g of sodium will remain. The 
chlorine is said to be the limiting reagent—when it 
is used up, the reaction must stop. See AVOGADRO’S 
LAW; COMBINING VOLUMES, LAW OF; DALTON’S LAW; 
DEFINITE COMPOSITION, LAW OF; MULTIPLE PROPOR- 
TIONS, LAW OF; SPECIFIC HEAT. 

The calculations discussed in this article involve 
compounds in which the ratio of atoms is generally 
simple. For a discussion of compounds in which the 
relative number of atoms cannot be expressed as ra- 
tios of small whole numbers. See NONSTOICHIOMET- 
RIC COMPOUNDS.. John C. Bailar, Jr. 

Bibliography. J. A. Olmsted and G. M. Williams, 
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istry, 9th ed., 2005. 
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Stoker 


A mechanical means for feeding coal into, and for 
burning coal in, a furnace. There are three basic 
types of stokers. Chain or traveling-grate stokers have 
a moving grate on which the coal burns; they carry 
the coal from a hopper into the furnace and move the 
ash out. Spreader stokers mechanically or pneumati- 
cally distribute the coal from a hopper at the furnace 
front wall and move it onto the grate which usually 
moves continuously to dispose of the ash after the 
coal is burned. Underfeed stokers are arranged to 
force fresh coal from the hopper to the bottom of 
the burning coal bed, usually by means of a screw 
conveyor. The ash is forced off the edges of the retort 
peripherally to the ashpit or is removed by hand. See 
STEAM-GENERATING FURNACE. George W. Kessler 
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Stokes’ theorem The assertion that under certain light 
restrictions the surface integral of (A § x % ) - v over a surface 
patch S is equal to the line integral of § - t taken around C, the 
boundary curve of S, provided the sense of transcription of C is 
right-handed relative to v. This can be expresed as 


f (vx a)-vas = f -rds 


Here % is a vector function, v is one of the two unit normals to 
the two-sided surface S, s is arc length measured positively in 
the sense which is right-handed relative to v, and t is the unit 
tangent vector to C in the sense of increasing s. See CALCULUS 
OF VECTORS. [H.VC.] 


Stolonifera An order of the Alcyonaria which lacks coe- 
nenchyme. They form either simple or rather complex colonies. 
The polyp has a cylindrical body with a retractile oral portion 
which can withdraw into a solid anthostele or calyx protected 
by many calcareous spicules. The base of the mature polyp is 
attached to a creeping stolon which is a ribbonlike network or 
thin flat mat from which daughter polyps arise. Daughter polyps 
never bud from the wall of the primary polyp. Each polyp is 
connected by solenial tubes of the stolons, or by transverse plat- 
forms. See ALCYONARIA. [K.At.] 


Stomach The tubular or saccular abdominal organ of the 
digestive system adapted for temporary food storage and pre- 
liminary stages of food breakdown. 

In some primitive vertebrates the stomach may be little more 
than a simple tube quite similar to other portions of the gas- 
trointestinal tract. In other forms the stomach is a distinct, and 
frequently large, saclike structure of variable shape. Carnivo- 
rous forms typically have a better-developed stomach than her- 
bivores, probably reflecting the larger but less numerous feedings 
characteristic of the former, but exceptions are numerous. 

In birds the stomach consists of a proventriculus and a gizzard. 
The former is well supplied with glands which secrete softening 
and digestive enzymes; the latter is a strong, muscular grinding 
organ whose action is often enhanced by the ingestion of small 
stones. 

Mammals have stomachs which vary considerably in structure. 
Although a single chamber is most common, some mammals, 
such as cows and their relatives (ruminants), have as many as 
four. These chambers may have developed either from modifi- 
cations of the posterior portions of the esophagus or from alter- 
ations of the stomach itself. 

The human stomach is located beneath the diaphragm, 
through which the posterior, terminal end of the esophagus 
passes. The stomach appears as a dilated tube continuous with 
the distal end of the esophagus. The upper curvature of the stom- 
ach is usually above and to the left of the esophageal orifice. This 
expanded anterior portion is the fundus and is commonly filled 
with air or gas. The body (corpus) of the stomach is directed 
toward the attenuated right extremity or pyloric region and is 
subject to variations in size and shape, depending upon func- 
tional activities, habits, disease, and volume of the contents. The 
pyloric walls are marked by the heavy sphincter muscle which 
controls the passage of chyme (a semiliquid fluid produced by 
the mechanical and chemical changes of preliminary digestion) 
into the duodenum. 

The stomach of vertebrates is lined by a mucous membrane 
that is usually thrown into longitudinal folds called rugae. Most of 
the surface is covered with mucus-secreting epithelial cells, but 
scattered throughout the lining are many small glandular pits 
which are lined with one or more types of secretory cells. See 
DIGESTIVE SYSTEM. (T.S.P] 
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Stomatopoda The only order of the superorder Ho- 
plocarida belonging to the subclass Malacostraca of the class 
Crustacea. This order of the mantis shrimps contains four fami- 
lies: Lysioquillidae, Bathysquillidae, Squillidae, and Gonodactyl- 
idae. Stomatopoda are among the larger Crustacea; the size 
of an adult ranges from about 0.5 in. (12 mm) to over 12 in. 
(300 mm). 

The bodies of stomatopods are narrow and elongate, almost 
eruciform. Only part of the cephalic and thoracic somites are 
fused and covered by the dorsal shield, the carapace. Those 
cephalic somites which bear the antennulae and the eyes are 
free and visible anterior to the carapace, and the last four tho- 
racic somites are similarly exposed. The tail fan consists of a 
well-developed, sometimes peculiarly sculptured or deformed, 
median plate, the telson, and the two uropods. Anteriorly, the 
carapace bears a flattened movable plate, the rostrum. The eyes 
are large, stalked, and movable. Of eight pairs of thoracic ap- 
pendages the first is narrow, slender, and hairy, probably being 
used for cleaning purposes. The second thoracic leg is very strong 
and heavy. It has become a large raptorial claw that shows a great 
resemblance to that of the praying mantis, and for this reason 
the Stomatopoda are often given the name mantis shrimps. 

The Stomatopoda are marine animals rarely found in brackish 
water. Most species are confined to tropical and subtropical ar- 
eas, though some occur in the boreal and antiboreal regions. The 
majority of stomatopods live in the littoral and sublittoral zones, 
but a few species have been found in greater depths, down to 
2500 ft (760 m). See MALACOSTRACA. [L.B.Ho.] 
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Stone and stone products The term stone is applied 
to rock that is cut, shaped, broken, crushed, and otherwise phys- 
ically modified for commercial use. The two main divisions are 
dimension stone and crushed stone. Other descriptive terms may 
be used, for example, building stone, roofing stone, or precious 
stone. See GEM; ROCK. 

The term dimension stone is applied to blocks that are cut and 
milled to specified sizes, shapes, and surface finishes. The princi- 
pal uses are for building and ornamental applications. Granites, 
limestones, sandstones, and marbles are widely used; basalts, 
diabases, and other dark igneous rocks are used less extensively. 
Soapstone is used to some extent. Rock suitable for use as di- 
mension stone must be obtainable in large, sound blocks, free 
from incipient cracks, seams, and blemishes, and must be with- 
out mineral grains that might cause stains as a result of weath- 
ering. It must have an attractive color, and generally a uniform 
texture. 

Slate differs from other dimension stone because it can be 
split into thin sheets of any desired thickness. Commercial slate 
must be uniform in quality and texture and reasonably free from 
knots, streaks, or other imperfections, and have good splitting 
properties. Roofing slates are important products of most slate 
quarries. However, the roofing-slate industry has declined con- 
siderably because of competition from other types of roofing. 
Slate is also used for milled products such as blackboards, elec- 
trical panels, window and door sills and caps, baseboards, stair 
treads, and floor tile. See SLATE. 

Nearly all the principal types of stone—granite, diabase, 
basalt, limestone, dolomite, sandstone, and marble—may be 
used as sources of commercial crushed stone; limestone is by far 
the most important. Crushed stone is made from sound, hard 
stone, free from surface alteration by weathering. Stone that 
breaks in chunky, more or less cubical fragments is preferred. 
Commercial stone should be free from certain deleterious im- 
purities, such as opalescent quartz, and free from clay or silt. 
Crushed stone is used principally as concrete aggregate, as road 
stone, or as railway ballast. Other uses for limestone are as a 
fluxing material to remove impurities from ores smelted in metal- 
lurgical furnaces, in the manufacture of alkali chemicals, calcium 
carbide, glass, paper, paint, and sugar, and for filter beds and for 
making mineral wool. [R.L.B.] 
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Storage tank A container for storing liquids or gases. 
A tank may be constructed of ferrous or nonferrous metals or 
alloys, reinforced concrete, wood, or filament-wound plastics, 
depending upon its use. Tanks resting on the ground have flat 
bottoms: those supported on towers have either flat or curved 
bottoms. Standpipes, which are usually cylindrical shells of steel 
or reinforced concrete resting on the ground, are frequently of 
great height and comparatively small diameter. They are built to 
contain water for a distribution system, and height is required 
to maintain pressure in the system. Tanks for other liquids and 
for gases, where storage is more important than pressure, are 
generally lower and of greater diameter. [C.M.A.] 
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Storm An atmospheric disturbance involving perturbations 
of the prevailing pressure and wind fields on scales ranging 
from tornadoes (0.6 mi or 1 km across) to extratropical cyclones 
(1.2-1900 mi or 2-3000 km across); also, the associated weather 
(rain storm, blizzard, and the like). Storms influence human ac- 
tivity in such matters as agriculture, transportation, building con- 
struction, water impoundment and flood control, and the gen- 
eration, transmission, and consumption of electric energy. See 
WIND. 

The form assumed by a storm depends on the nature of 
its environment, especially the large-scale flow patterns and 
the horizontal and vertical variation of temperature; thus the 
storms most characteristic of a given region vary according to 
latitude, physiographic features, and season. Extratropical cy- 
clones and anticyclones are the chief disturbances over roughly 
half the Earth’s surface. Their circulations control the embed- 
ded smaller-scale storms. Large-scale disturbances of the tropics 
differ fundamentally from those of extratropical latitudes. See 
HURRICANE; SQUALL; THUNDERSTORM; TORNADO; TROPICAL METEO- 
ROLOGY. 

Cyclones form mainly in close proximity to the jet stream, that 
is, in strongly baroclinic regions where there is a large increase 
of wind with height. Weather patterns in cyclones are highly 
variable, depending on moisture content and thermodynamic 
stability of air masses drawn into their circulations. Warm and 
occluded fronts, east of and extending into the cyclone center, 
are regions of gradual upgliding motions, with widespread cloud 
and precipitation but usually no pronounced concentration of 
stormy conditions. Extensive cloudiness also is often present in 
the warm sector. Passage of the cold front is marked by a sud- 
den wind shift, often with the onset of gusty conditions, with 
a pronounced tendency for clearing because of general subsi- 
dence behind the front. Showers may be present in the cold air 
if it is moist and unstable because of heating from the surface. 
Thunderstorms, with accompanying squalls and heavy rain, are 
often set off by sudden lifting of warm, moist air at or near the 
cold front, and these frequently move eastward into the warm 
sector. See CYCLONE; JET STREAM; WEATHER. 

Extratropical cyclones alternate with high-pressure systems or 
anticyclones, whose circulation is generally opposite to that of 
the cyclone. The circulations of highs are not so intense as in 
well-developed cyclones, and winds are weak near their centers. 
In low levels the air spirals outward from a high; descent in up- 
per levels results in warming and drying aloft. Anticyclones fall 
into two main categories, the warm “subtropical” and the cold 
“polar” highs. 

Between the scales of ordinary air turbulence and of cy- 
clones, there exist a variety of circulations over a middle-scale or 
mesoscale range, loosely defined as from about one-half up to 
a few hundred miles. Alternatively, these are sometimes referred 
to as subsynoptic-scale disturbances because their dimensions 
are so small that they elude adequate description by the ordi- 
nary synoptic network of surface weather stations. Thus their 
detection often depends upon observation by indirect sensing 
systems. See METEOROLOGICAL SATELLITES; RADAR METEOROLOGY; 
STORM DETECTION. [C.WN.] 


Storm detection [Identifying storm formation, monitor- 
ing subsequent storm evolution, and assessing the potential for 
destruction of life and property through application of various 
methods and techniques. Doppler radars, satellite-borne instru- 
ments, lightning detection networks, and surface observing net- 
works are used to detect the genesis of storms, to diagnose their 
nature, and to issue warnings when a threat to life and property 
exists. See STORM. 

Radar surveillance. Radars emit pulses of electromagnetic 
radiation that are broadcast in a beam, whose angular resolu- 
tion is about 1° with a range resolution of about 0.5 km (0.3 mi). 
The radar beam may intercept precipitation particles in a storm 
that reflect a fraction of the transmitted energy to the transmitter 
site (generally called reflectivity or the scatter cross section per 
unit volume). As the transmitter sweeps out a volume by rotating 
and tilting the transmitting antenna, the reflectivity pattern of the 
precipitation particles embodied in the storm is defined. Doppler 
radars also can measure the velocities of precipitation particles 
along the beam (radial velocity). Reflectivity and velocity pat- 
terns of the storm hydrometeors then make it possible to diag- 
nose horizontal and vertical circulations that may arise within the 
storm, and to estimate the type and severity of weather elements 
attending the storm, such as rainfall, hail, damaging winds, and 
tornadoes. See DOPPLER RADAR; PRECIPITATION (METEOROLOGY); 
RADAR; RADAR METEOROLOGY; WIND. 

Satellite surveillance. Since the early 1960s, meteorologi- 
cal data from satellites have had an increasing impact on storm 
detection and monitoring. In December 1966 the first geosta- 
tionary Applications Technology Satellite (ATS 1) allowed fore- 
casters in the United States to observe storms in animation. A 
Geostationary Operational Environmental Satellite (GOES) pro- 
gram was initiated within the National Oceanic and Atmospheric 
Administration (NOAA) with the launch of GOES 1 in October 
1975. The visible and infrared spin scan radiometer (VISSR) pro- 
vided imagery, which significantly advanced the ability of mete- 
orologists to detect and observe storms by providing frequent- 
interval visible and infrared imagery of the Earth surface, cloud 
cover, and atmospheric moisture patterns. 

The first of NOAA’s next generation of geostationary satellites, 
GOES 8 was launched in the spring of 1994. GOES 8 introduced 
improved capabilities to detect and observe storms. The GOES 
8 system includes no conflict between imaging and sounding 
operation, multispectral imaging with improved resolution and 
better signal-to-noise in the infrared bands, and more accurate 
temperature and moisture soundings of the storm environment. 
The Earth’s atmosphere is observed nearly continuously. 

Derived-product images showing fog and stratus areas from 
GOES 8 are created by combining direct satellite measurements, 
such as by subtracting brightness temperatures at two different 
wavelengths. GOES 8 shows the fog and stratus much more 
clearly because of its improved resolution. This capability en- 
ables forecasters to detect boundaries between rain-cooled ar- 
eas having fog or low clouds, and clear areas. Such boundaries 
are frequently associated with future thunderstorm development. 
The sounder on GOES 8 is capable of fully supporting routine 
forecasting operations. This advanced sounding capability con- 
sists of better vertical resolution in both temperature and mois- 
ture, and improved coverage of soundings in and around cloudy 
weather systems. See CLOUD; FOG; METEOROLOGICAL SATELLITES; 
SATELLITE METEOROLOGY. 

Surface observing systems. Larger convective storm sys- 
tems such as squall lines and mesoscale convective systems 
can be detected (but not fully described) by the temperature, 
moisture, wind, and pressure patterns observed by appropri- 
ate surface instrumentation. Automatic observing systems pro- 
vide frequent data on pressure, temperature, humidity, wind, 
cloud base, and most precipitation types, intensity, and accu- 
mulation. Analyses of these data, combined with improved con- 
ceptual models of convective storm systems, enable forecasters 


to detect and monitor the intense mesoscale fluctuations in pres- 
sure and winds that often accompany the passage of convective 
weather systems such as bow echoes, derechos (strong, straight- 
line winds), and squall lines. A bow echo is a specific radar reflec- 
tivity pattern associated with a line of thunderstorms. The mid- 
dle portion of the thunderstorm line is observed to move faster 
than the adjacent portions, causing the line of storms to assume 
a bowed-out configuration. Other analyses of these mesoscale 
data fields aid the forecaster in detecting favorable areas for thun- 
derstorm cell regeneration, which may produce slowly moving 
mesoscale convective storms attended by heavy rains and flash 
floods. See SQUALL LINE; WEATHER OBSERVATIONS. 
Cloud-to-ground lightning detectors. Lightning location 
stations provide forecasters with the location, polarity, peak cur- 
rent, and number of strokes in a flash to ground within seconds 
of the flash occurrence. Useful applications have emerged with 
regard to the detection and tracking of thunderstorms, squall 
lines, other mesoscale convective systems, and the weather ac- 
tivity that accompany these phenomena, such as tornadoes 
and hail. See LIGHTNING; MESOMETEOROLOGY; SFERICS; THUNDER- 
STORM; WEATHER FORECASTING AND PREDICTION. [C.FCh.] 


Storm electricity Processes responsible for the separa- 
tion of positive and negative electric charges in the atmosphere 
during storms, including the spectacular manifestation of this 
charge separation: lightning discharges. Cloud electrification is 
almost invariably associated with convective activity and with the 
formation of precipitation in the form of liquid water (rain) and 
ice particles (graupel and hail). The most vigorous convection 
and active lightning occurs in the summertime, when the energy 
source for convection, water vapor, is most prevalent. Winter 
snowstorms can also be strongly electrified, but they produce 
far less lightning than summer storms. Electrified storm clouds 
occasionally occur in complete isolation; more commonly they 
are found in convective clusters or in lines that may extend hor- 
izontally for hundreds of kilometers. See PRECIPITATION (METEO- 
ROLOGY). 

Measurements of electric field at the ground and from instru- 
mented balloons within thunderclouds have disclosed an elec- 
trostatic structure that appears to be fairly systematic throughout 
the world. The measurements show that the principal variations 
in charge occur in the vertical and are affected by the tempera- 
ture of the cloud. The charge structure within a thundercloud is 
tripolar, with a region of dominant negative charge sandwiched 
between an upper region of positive charge and a subsidiary 
lower region of positive charge. In addition to the charge accu- 
mulations described within the cloud, electrical measurements 
disclose the existence of charge-screening layers at the upper 
cloud boundary and a layer of positive charge near the Earth’s 
surface beneath the cloud. These secondary charge accumula- 
tions arising from charge migration outside the cloud are caused 
by electrostatic forces of attraction set up by the charges within 
the cloud. 

Large differences of electric potential are associated with the 
distribution of charge maintained by active thunderclouds. These 
large differences in potential are maintained by charging currents 
that result from the motions of air and particles. The charging 
currents range from milliamperes in small clouds that are not 
producing lightning to several amperes for large storms with high 
rates of lightning. See CLOUD PHYSICS. 

In response to charge separation within a thundercloud, the 
electric field increases to a value of approximately 10° V/m 
(300,000 V/ft) at which point dielectric breakdown occurs and 
lightning is initiated. Most lightning extends through the cloud 
at speeds of 10,000-100,000 m/s (22,000-220,000 mi/h). The 
peak temperature of lightning, which is a highly ionized plasma, 
may exceed 30,000 K (54,000°F). The acoustic disturbance 
caused by the sudden heating of the atmosphere by lightning 
is thunder. See LIGHTNING; THUNDER. 
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Meteorologists have shown a growing interest in the large- 
scale display of real-time lightning activity, since lightning is one 
of the most sensitive indicators of convective activity. Research 
has expanded into relationships between lightning characteristics 
and the meteorological evolution of different types of storms. The 
discovery of the sensitive dependence of local lightning activity 
on the temperature of surface air has led to research focused on 
the use of the global electrical circuit as a diagnostic for global 
temperature change. See ATMOSPHERIC ELECTRICITY; STORM DE- 
TECTION; THUNDERSTORM. [E.Wi.] 


Storm surge An anomalous rise in water elevations 
caused by severe storms approaching the coast. A storm surge 
can be succinctly described as a large wave that moves with the 
storm that caused it. The surge is intensified in the nearshore, 
shallower regions where the surface stress caused by the strong 
onshore winds pile up water against the coast, generating an op- 
posing pressure head in the offshore direction. However, there 
are so many other forces at play in the dynamics of the storm 
surge phenomenon, such as bottom friction, Earth’s rotation, 
inertia, and interaction with the coastal geometry, that a sim- 
ple static model cannot explain all the complexities involved. 
Scientists and engineers have dedicated many years in the de- 
velopment and application of sophisticated computer models to 
accurately predict the effects of storm surges. 

The intensity and dimension of the storm causing a surge, 
and thus the severity of the ensuing surge elevations, depend 
on the origin and atmospheric characteristics of the storm itself. 
Hurricanes and severe extratropical storms are the cause of most 
significant surges. In general, hurricanes are more frequent in low 
to middle latitudes, and extratropical storms are more frequent 
in middle to high latitudes. See HURRICANE; STORM. [S.R.S.] 


Straight-line mechanism A mechanism that pro- 
duces a straight-line (or nearly so) output motion from an input 
element that rotates, oscillates, or moves in a straight line. Com- 
mon machine elements, such as linkages, gears, and cams, are 
often used in ingenious ways to produce the required controlled 
motion. The more elegant designs use the properties of special 
points on one of the links of a four-bar linkage. See MECHANISM. 

Four-bar linkages that generate approximate straight lines are 
not new. In 1784 James Watt applied the concept to the vertical- 
cylinder beam engine. By selecting the appropriate link lengths, 
the designer can easily develop a mechanism with a high-quality 
approximate straight line. Contemporary kinematicians have 
contributed to more comprehensive studies of the properties of 
the mechanisms that generate approximate straight lines. The 
work not only describes the various classical mechanisms, but 
also provides design information on the quality (the amount of 
deviation from a straight line) and the length of the straight-line 
output. See FOUR-BAR LINKAGE; LINKAGE (MECHANISM). 

Gears can also be used to generate straight-line motions. The 
most common combination would be a rack-and-pinion gear. 
See GEAR. 

Cam mechanisms are generally not classified as straight-line 
motion generators, but translating followers easily fall into the 
classical definition. See CAM MECHANISM. [J.A.Sm.] 


Strain gage A device which measures mechanical defor- 
mation (strain). Normally it is attached to a structural element, 
and uses the change of electrical resistance of a wire or semicon- 
ductor under tension. Capacity, inductance, and reluctance are 
also used. 

The strain gage converts a small mechanical motion to an 
electrical signal by virtue of the fact that when a metal (wire 
or foil) or semiconductor is stretched, its resistance is increased. 
The change in resistance is a measure of the mechanical motion. 
In addition to their use in strain measurement, these gages are 
used in sensors for measuring the load on a mechanical member, 
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forces due to acceleration on a mass, or stress on a diaphragm 
or bellows. See STRAIN. [J.H.Z.] 


Strake The slender forward extension of the inboard region 
of the wing of a combat aircraft used to provide increased lift 
in the high angle-of-attack maneuvering condition. In contrast 
to the normal attached-flow design principles, these strakes 
are built to allow the flow to separate along the leading edge 
in the high angle-of-attack range and to roll up into strong 
leading-edge vortices. The illustration shows a typical strake in- 
stallation and the vortices generated. These are highly stable, and 


strake ——_ \ 


vortex — 


Strake and resulting vortex flow. 


their strong swirling motion creates a lower pressure area on the 
strake upper surface, resulting in a large incremental increase in 
lift force known as vortex lift. This vortex lift increases rapidly in 
the high angle-of-attack range and is less susceptible to the nor- 
mal stalling characteristics encountered with conventional lifting 
surfaces. See AIRFOIL; SUBSONIC FLIGHT; VORTEX; WING. __[E.C.Po.] 


Strand line = The line that marks the separation of land and 
water along the margin of a pond, lake, sea, or ocean; also 
called the shoreline. The strand line is very dynamic. It changes 
with the tides, storms, and seasons, and as long-term sea-level 
changes take place. The sediments on the beach respond to 
these changes, as do the organisms that live in this dynamic 
environment. On a beach organisms move with the tides, and 
on a rocky coast they tend to be organized relative to the strand 
line because of special limitations or adaptations to exposure. 
Geologists who study ancient coastal environments com- 
monly try to establish where the strand line might be in the rock 
strata. This can sometimes be determined by a combination of 
the nature and geometry of individual laminations in the rock, 
by identifying sedimentary structures that occur at or near the 
strand line. The most indicative of these structures are swash 
marks, which are very thin accumulations of sand grains that 
mark the landward uprush of a wave on the beach. See FAcIES 
(GEOLOGY); PALEOGEOGRAPHY; STRATIGRAPHY. [R.A.D.] 


Strange particles Bound states of quarks, in which at 
least one of these constituents is of the strange (s) type. Strange 
quarks are heavier than the up (u) and down (d) quarks, which 
form the neutrons and protons in the atomic nucleus. Neutrons 
(udd) and protons (uud) are the lightest examples of a fam- 
ily of particles composed of three quarks, known as baryons. 
These and other composite particles which interact dominantly 
through the strong (nuclear) force are known as hadrons. The 
first strange hadron discovered (in cosmic rays in 1947) was 
named the lambda baryon, A; it is made of the three-quark 
combination uds. A baryon containing a strange quark is also 
called a hyperon. Although strange particles interact through the 
strong (nuclear) force, the strange quark itself can decay only by 
conversion to a quark of different type (such as u or d) through 
the weak interaction. For this reason, strange particles have very 
long lifetimes, of the order of 107! s, compared to the lifetimes 


of the order of 107% s for particles which decay directly through 
the strong interaction. This long lifetime was the origin of the 
term strange particles. See BARYON; HADRON; NEUTRON; PROTON; 
STRONG NUCLEAR INTERACTIONS. 

In addition to strange baryons, strange mesons occur. The 
lightest of these are the kaons (K* = uš and K? = d5 ) and the 


antikaons (K° = sd andK~ = su _). Kaons and their antiparticles 
have been very important in the study of the weak interaction 
and in the detection of the very weak CP violation, which causes 
a slow transition between neutral kaons and neutral antikaons. 
See ELEMENTARY PARTICLE; MESON; QUARKS. [TBar.] 


Strangles A highly contagious disease of the upper respi- 
ratory tract of horses and other members of the family Equidae, 
characterized by inflammation of the pharynx and abscess for- 
mation in lymph nodes. This disease occurs in horses of all ages 
throughout the world. The causative agent is Streptococcus equi, 
a clonal pathogen apparently derived from an ancestral strain of 
S. zooepidemicus. It is an obligate parasite of horses, donkeys, 
and mules. See STREPTOCOCCUS. 

Strangles is most common and most severe in young horses, 
and is very prevalent on breeding farms. The causative agent has 
been reported to survive for 7 weeks in pus but dies in a week or 
two on pasture. Transmission is either direct by nose or mouth 
contact or aerosol, or indirect by flies, drinking buckets, pasture, 
and feed. The disease is highly contagious under conditions of 
crowding, exposure to severe climatic conditions such as rain and 
cold, and prolonged transportation. Carrier animals, although of 
rare occurrence, are critical in maintenance of the streptococcus 
and in initiation of outbreaks. 

The mean incubation period is about 10 days, with a range of 
3-14 days. The animal becomes quieter, has fever of 39-40.5°C 
(102-105°F), nasal discharge, loss of appetite, and swelling of 
one or more lymph nodes of the mouth. Pressure of a lymph 
node on the airway may cause respiratory difficulty. Abscesses 
in affected lymph nodes rupture in 7-14 days, and rapid clinical 
improvement and recovery then ensues. Recovery is associated 
with formation of protective antibodies in the nasopharynx and 
in the serum. See ANTIBODY. 

Streptococcus equi is easily demonstrated in smears of pus 
from abscesses and in culture of pus or nasal swabs on colistin— 
nalidixic acid blood agar. Acutely affected animals also show 
elevated white blood cell counts and elevated fibrinogen. 

Commercially available vaccines are injected in a schedule of 
two or three primary inoculations followed by annual boosters. 
However, the clinical attack rate may be reduced by only 50%, a 
level of protection much lower than that following the naturally 
occurring disease. See IMMUNITY. 

Procaine penicillin G is the antibiotic of choice and quickly 
brings about reduction of fever and lymph node enlargement. 
See BIOLOGICALS. (J.ET] 


Stratigraphy A discipline involving the description and in- 
terpretation of layered sediments and rocks, and especially their 
correlation and dating. Correlation is a procedure for determin- 
ing the relative age of one deposit with respect to another. The 
term “dating” refers to any technique employed to obtain a nu- 
merical age, for example, by making use of the decay of ra- 
dioactive isotopes found in some minerals in sedimentary rocks 
or, more commonly, in associated igneous rocks. To a large ex- 
tent, layered rocks are ones that accumulated through sedimen- 
tary processes beneath the sea, within lakes, or by the action of 
rivers, the wind, or glaciers; but in places such deposits contain 
significant amounts of volcanic material emplaced as lava flows 
or as ash ejected from volcanoes during explosive eruptions. 
See DATING METHODS; IGNEOUS ROCKS; ROCK AGE DETERMINATION; 
SEDIMENTARY ROCKS. 

Sedimentary successions are locally many thousands of me- 
ters thick owing to subsidence of the Earth’scrust over millions of 


years. Sedimentary basins therefore provide the best available 
record of Earth history over nearly 4 billion years. That record 
includes information about surficial processes and the varying 
environment at the Earth’s surface, and about climate, changing 
sea level, the history of life, variations in ocean chemistry, and 
reversals of the Earth’s magnetic field. Sediments also provide a 
record of crustal deformation (folding and faulting) and of large- 
scale horizontal motions of the Earth’s lithospheric plates (con- 
tinental drift). Stratigraphy applies not only to strata that have 
remained flat-lying and little altered since their time of deposi- 
tion, but also to rocks that may have been strongly deformed or 
recrystallized (metamorphosed) at great depths within the Earth’s 
crust, and subsequently exposed at the Earth’ssurface as a result 
of uplift and erosion. As long as original depositional layers can 
be identified, some form of stratigraphy can be undertaken. See 
BASIN; CONTINENTAL DRIFT; FAULT AND FAULT STRUCTURES; SEDIMEN- 
TOLOGY. 

An important idea first articulated by the Danish naturalist 
Nicolaus Steno in 1669 is that in any succession of strata the 
oldest layer must have accumulated at the bottom, and succes- 
sively younger layers above. It is not necessary to rely on the 
present orientation of layers to determine their relative ages be- 
cause most sediments and sedimentary rocks contain numerous 
features, such as current-deposited ripples, minor erosion sur- 
faces, or fossils of organisms in growth position, that have a 
well-defined polarity with respect to the up direction at the time 
of deposition (so-called geopetal indicators). This principle of 
superposition therefore applies equally well to tilted and even 
overturned strata. Only where a succession is cut by a fault is 
a simple interpretation of stratigraphic relations not necessar- 
ily possible, and in some cases older rocks may overlie younger 
rocks structurally. See DEPOSITIONAL SYSTEMS AND ENVIRONMENTS. 

The very existence of layers with well-defined boundaries im- 
plies that the sedimentary record is fundamentally discontinu- 
ous. Discontinuities are present in the stratigraphic record at a 
broad range of scales, from that of a single layer or bed to phys- 
ical surfaces that can be traced laterally for many hundreds of 
kilometers. Large-scale surfaces of erosion or nondeposition are 
known as unconformities, and they can be identified on the basis 
of both physical and paleontological criteria. See PALEONTOLOGY; 
UNCONFORMITY. 

Most stratal discontinuities possess time-stratigraphic signifi- 
cance because strata below a discontinuity tend to be every- 
where older than strata above. To the extent that unconformities 
can be recognized and traced widely within a sedimentary basin, 
it is possible to analyze sedimentary rocks in a genetic frame- 
work, that is, with reference to the way they accumulated. This 
is the basis for the modern discipline of sequence stratigraphy, 
so named because intervals bounded by unconformities have 
come to be called sequences. 

Traditional stratigraphic analysis has focused on variations in 
the intrinsic character or properties of sediments and rocks— 
properties such as composition, texture, and included fossils 
(lithostratigraphy and biostratigraphy)—and on the lateral trac- 
ing of distinctive marker beds such as those composed of ash 
from a single volcanic eruption (tephrostratigraphy). The tech- 
niques of magnetostratigraphy and chemostratigraphy are also 
based on intrinsic characteristics, although these techniques re- 
quire sophisticated laboratory analysis. Sequence stratigraphy 
attempts to integrate these approaches in the context of stratal 
geometry, thereby providing a unifying framework in which to 
investigate the time relations between sediment and rock bod- 
ies as well as to measure their numerical ages (chronostratigra- 
phy and geochronology). Seismic stratigraphy is a variant of the 
technique of sequence stratigraphy in which unconformities are 
identified and traced in seismic reflection profiles on the basis 
of reflection geometry. See GEOCHRONOMETRY; SEISMIC STRATIG- 
RAPHY; SEISMOLOGY. 

Conventional stratigraphy currently recognizes two kinds of 
stratigraphic unit: material units, distinguished on the basis of 
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some specified property or properties or physical limits; and tem- 
poral or time-related units. A common example of a material unit 
is the formation, a lithostratigraphic unit defined on the basis of 
lithic characteristics and position within a stratigraphic succes- 
sion. Each formation is referred to a section or locality where 
it is well developed (a type section), and assigned an appro- 
priate geographic name combined with the word formation or 
a descriptive lithic term such as limestone, sandstone, or shale 
(for example, Tapeats Sandstone). Some formations are divisible 
into two or more smaller-scale units called members and beds. 
In other cases, formations of similar lithic character or related 
genesis are combined into composite units called groups and 
supergroups. 

Sequence stratigraphy differs from conventional stratigraphy 
in two important respects. The first is that basic units (sequences) 
are defined on the basis of bounding unconformities and cor- 
relative conformities rather than material characteristics or age. 
The second is that sequence stratigraphy is fundamentally not a 
system for stratigraphic classification, but a procedure for deter- 
mining how sediments accumulate. See SEQUENCE STRATIGRAPHY. 

[N.C.B.] 


Stratosphere 


Stratosphere The atmospheric layer that is immediately 
above the troposphere and contains most of the Earth’s ozone. 
Here temperature increases upward because of absorption of 
solar ultraviolet light by ozone. Since ozone is created in sunlight 
from oxygen, a by-product of photosynthesis, the stratosphere 
exists because of life on Earth. In turn, the ozone layer allows life 
to thrive by absorbing harmful solar ultraviolet radiation. The 
mixing ratio of ozone is largest (10 parts per million by volume) 
near an altitude of 30 km (18 mi) over the Equator. The dis- 
tribution of ozone is controlled by solar radiation, temperature, 
wind, reactive trace chemicals, and volcanic aerosols. See AT- 
MOSPHERE; TROPOSPHERE. 

The heating that results from absorption of ultraviolet radiation 
by ozone causes temperatures generally to increase from the 
bottom of the stratosphere (tropopause) to the top (stratopause) 
near 50 km (30 mi), reaching 280 K (45°F) over the summer 
pole. This temperature inversion limits vertical mixing, so that 
air typically spends months to years in the stratosphere. See 
TEMPERATURE INVERSION; TROPOPAUSE. 

The lower stratosphere contains a layer of small liquid droplets. 
Typically less than 1 micrometer in diameter, they are made pri- 
marily of sulfuric acid and water. Occasional large volcanic erup- 
tions maintain this aerosol layer by injecting sulfur dioxide into 
the stratosphere, which is converted to sulfuric acid and incorpo- 
rated into droplets. Enhanced aerosol amounts from an eruption 
can last several years. By reflecting sunlight, the aerosol layer can 
alter the climate at the Earth’s surface. By absorbing upwelling 
infrared radiation from the Earth’s surface, the aerosol layer can 
warm the stratosphere. The aerosols also provide surfaces for 
a special set of chemical reactions that affect the ozone layer. 
Liquid droplets and frozen particles generally convert chlorine- 
bearing compounds to forms that can destroy ozone. They also 
tend to take up nitric acid and water and to fall slowly, thereby re- 
moving nitrogen and water from the stratosphere. The eruption 
of Mount Pinatubo (Philippines) in June 1991 is believed to have 
disturbed the Earth system for several years, raising stratospheric 
temperatures by more than 1 K (1.8°F) and reducing global sur- 
face temperatures by about 0.5 K (0.9°F). See AEROSOL. 

Ozone production is balanced by losses due to reactions with 
chemicals in the nitrogen, chlorine, hydrogen, and bromine fami- 
lies. Reaction rates are governed by temperature, which depends 
on amounts of radiatively important species such as carbon 
dioxide. Human activities are increasing the amounts of these 
molecules and are thereby affecting the ozone layer. Evidence 
for anthropogenic ozone loss has been found in the Antarctic 
lower stratosphere. Near polar stratospheric clouds, chlorine and 
bromine compounds are converted to species that, when the Sun 
comes up in the southern spring, are broken apart by ultraviolet 
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radiation and rapidly destroy ozone. This sudden loss of ozone is 
known as the anthropogenic Antarctic ozone hole. See STRATO- 
SPHERIC OZONE. [M.H.H.] 


Stratospheric ozone While ozone is found in trace 
quantities throughout the atmosphere, the largest concentrations 
are located in the lower stratosphere in a layer between 9 and 
18 mi (15 and 30 km). Atmospheric ozone plays a critical role for 
the biosphere by absorbing the ultraviolet radiation with wave- 
length (A) 240-320 nanometers. This radiation is lethal to sim- 
ple unicellular organisms (algae, bacteria, protozoa) and to the 
surface cells of higher plants and animals. It also damages the 
genetic material of cells and is responsible for sunburn in human 
skin. The incidence of skin cancer has been statistically corre- 
lated with the observed surface intensity of ultraviolet wave- 
length 290-320 nm, which is not totally absorbed by the ozone 
layer. See OZONE; STRATOSPHERE; ULTRAVIOLET RADIATION (BIOL- 
ocy). 

Ozone also plays an important role in photochemical smog 
and in the purging of trace species from the lower atmosphere. 
Furthermore, it heats the upper atmosphere by absorbing solar 
ultraviolet and visible radiation (A < 710 nm) and thermal in- 
frared radiation (A ~ 9.6 micrometers). As a consequence, the 
temperature increases steadily from about —60°F (220 k) at the 
tropopause (5-10 mi or 8-16 km altitude) to about 45°F (280 k) 
at the stratopause (30 mi or 50 km altitude). This ozone heating 
provides the major energy source for driving the circulation of the 
upper stratosphere and mesosphere. See ATMOSPHERIC GENERAL 
CIRCULATION; TROPOPAUSE. 

Above about 19 mi (30 km), molecular oxygen (Oz) is disso- 
ciated to free oxygen atoms (O) during the daytime by ultravi- 
olet photons, (hv), as shown in reaction (1). The oxygen atoms 
produced then form ozone (O3) by reaction (2), where M is an 


02 +h —> 0+0 A < 242 nm (1) 


O +0 +M — 03 +M (2) 


arbitrary molecule required to conserve energy and momentum 
in the reaction. Ozone has a short lifetime during the day be- 
cause of photodissociation, as shown in reaction (3). However, 


03 + w => 02 +0 A < 710 nm (3) 


except above 54 mi (90 km), where O» begins to become a mi- 
nor component of the atmosphere, reaction (3) does not lead to 
a net destruction of ozone. Instead the O is almost exclusively 
converted back to O; by reaction (2). If the odd oxygen concen- 
tration is defined as the sum of the O3 and O concentrations, 
then odd oxygen is produced by reaction (1). It can be seen that 
reactions (2) and (3) do not affect the odd oxygen concentra- 
tions but merely define the ratio of O to O3. Because the rate 
of reaction (2) decreases with altitude while that for reaction (3) 
increases, most of the odd oxygen below 36 mi (60 km) is in the 
form of O3 while above 36 mi (60 km) it is in the form of O. The 
reaction that is responsible for a small fraction of the odd oxy- 
gen removal rate is shown as reaction (4). A significant fraction 


0+ 03 > 02 + 0» (4) 


of the removal is caused by the presence of chemical radicals 
[such as nitric oxide (NO), chlorine (Cl), bromine (Br), hydro- 
gen (H), or hydroxyl (OH)], which serve to catalyze reaction (4) 
(see illustration). 

The discovery in the mid-1980s of an ozone hole over Antarc- 
tica, which could not be explained by the classic theory of ozone 
and had not been predicted by earlier chemical models, led to 
many speculations concerning the causes of this event, which 
can be observed each year in September and October. As sug- 
gested by experimental and observational evidence, heteroge- 
neous reactions on the surface of liquid or solid particles that 
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Principal chemical cycles in the stratosphere. The destruc- 
tion of ozone is affected by the presence of radicals which are 
produced by photolysis or oxidation of source gases. Chem- 
ical reservoirs are relatively stable but are removed from the 
stratosphere by transport toward the troposphere and rain- 
out. 


produce Cl, HOCI, and CINO; gas, and the subsequent rapid 
photolysis of these molecules, produces chlorine radicals (Cl, 
ClO) which in turn lead to the destruction of ozone in the lower 
stratosphere by a catalytic cycle [reactions (5)-(7)]. Solar radia- 


Cl+ 03 > ClO + 02 (5) 
CIO + CIO > Cly05 (6) 
(07107 + hv — 2Cl + 02 (7) 





tion is needed for these processes to occur. 

Sites on which the reactions producing Clz, HOCI, and CINO2 
can occur are provided by the surface of ice crystals in polar 
stratospheric clouds (PSCs). These clouds are formed between 
8 and 14 mi (12 and 22 km) when the temperature drops below 
approximately —123°F (187 K). Other types of particles are ob- 
served at temperatures above the frost point of —123°F (187 k). 
These particles provide additional surface area for these reac- 
tions to occur. Clouds are observed at high latitudes in winter. 
Because the winter temperatures are typically 20-30°F (10-15 Kk) 
colder in the Antarctic than in the Arctic, their frequency of oc- 
currence is highest in the Southern Hemisphere. Thus, the for- 
mation of the springtime ozone hole over Antarctica is explained 
by the activation of chlorine and the catalytic destruction of O3 
which takes place during September, when the polar regions are 
sunlit but the air is still cold and isolated from midlatitude air 
by a strong polar vortex. Satellite observations made since the 
1970s suggest that total ozone in the Arctic has been abnormally 
low during the 1990s, probably in relation to the exceptionally 
cold winter tempratures in the Arctic lower stratosphere recorded 
during that decade. [G.PB.; R.G.Pr.] 


Strawberry Low-growing perennials, spreading by 
stolons, with fruit consisting of a fleshy receptacle, and “seeds” 
in pits or nearly superficial on the receptacle. The strawberry 
in the United States is derived from two species: Fragaria chi- 
hensis, which grows along the Pacific Coast of North and South 
America, and F uirginiana, the eastern meadow strawberry, 
both members of the order Resales. See ROSALES. 

The strawberry is the most universally grown of the small fruits, 
both in the home garden and in commercial plantings. Home 
garden production is possible in nearly all of the states, pro- 
vided water can be supplied where rainfall is insufficient. Com- 
mercial production is important in probably three-fourths of the 
states. The following states are large producers: Oregon, Cali- 
fornia, Tennessee, Michigan, Louisiana, Washington, Arkansas, 
Kentucky, and New York. See FRUIT. (J.H.C.] 


Stream function In fluid mechanics, a mathematical 
idea which satisfies identically, and therefore eliminates com- 
pletely, the equation of mass conservation. If the flow field con- 
sists of only two space coordinates, for example, x and y, a single 
and very useful stream function y(x, y) will arise. If there are three 
space coordinates, such as (x, y, z), multiple stream functions are 
needed, and the idea becomes much less useful and is much less 
widely employed. 

The stream function not only is mathematically useful but also 
has a vivid physical meaning. Lines of constant w are streamlines 
of the flow; that is, they are everywhere parallel to the local 
velocity vector. No flow can exist normal to a streamline; thus, 
selected y lines can be interpreted as solid boundaries of the 
flow. 

Further, yw is also quantitatively useful. In plane flow, for any 
two points in the flow field, the difference in their stream func- 
tion values represents the volume flow between the points. See 
CREEPING FLOW; FLUID FLOW. [EM.W.] 


Stream gaging The measurement of water discharge in 
streams. Discharge is the rate of movement of the stream’s water 
volume. It is the product of water velocity times cross-sectional 
area of the stream channel. Several techniques have been de- 
veloped for measuring stream discharge; selection of the gaging 
method usually depends on the size of the stream. The most 
accurate methods for measuring stream discharge make use of 
in-stream structures through which the water can be routed, such 
as flumes and weirs. 

A flume is a constructed channel that constricts the flow 
through a control section, the exact dimensions of which are 
known. Through careful hydraulic design and calibration by lab- 
oratory experiments, stream discharge through a flume can be 
determined by simply measuring the water depth (stage) in the 
inlet or constricted sections. Appropriate formulas relate stage to 
discharge for the type of flume used. 

A weir is used in conjunction with a dam in the streambed. 
The weir itself is usually a steel plate attached to the dam that 
has a triangular, rectangular, or trapezoidal notch over which 
the water flows. Hydraulic design and experimentation has led 
to calibration curves and appropriate formulas for many different 
weir designs. To calculate stream discharge through a weir, only 
the water stage in the reservoir created by the dam needs to 
be measured. Stream discharge can be calculated by using the 
appropriate formula that relates stage to discharge for the type 
of weir used. See HYDROLOGY; SURFACE WATER. IT.C.Wi.] 


Stream pollution Biological, or bacteriological, pollu- 
tion in a stream indicated by the presence of the coliform group of 
organisms. While nonpathogenic itself, this group is a measure 
of the potential presence of contaminating organisms. Because 
of temperature, food supply, and predators, the environment 
provided by natural bodies of water is not favorable to the growth 
of pathogenic and coliform organisms. Physical factors, such as 
flocculation and sedimentation, also help remove bacteria. Any 
combination of these factors provides the basis for the biological 
self-purification capacity of natural water bodies. 

Nonpolluted natural waters are usually saturated with dis- 
solved oxygen. They may even be supersaturated because of 
the oxygen released by green water plants under the influence of 
sunlight. When an organic waste is discharged into a stream, the 
dissolved oxygen is utilized by the bacteria in their metabolic pro- 
cesses to oxidize the organic matter. The oxygen is replaced by 
reaeration through the water surface exposed to the atmosphere. 
This replenishment permits the bacteria to continue the oxida- 
tive process in an aerobic environment. In this state, reasonably 
clean appearance, freedom from odors, and normal animal and 
plant life are maintained. 

An increase in the concentration of organic matter stimulates 
the growth of bacteria and increases the rates of oxidation and 
oxygen utilization. If the concentration of the organic pollutant 
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is so great that the bacteria use oxygen more rapidly than it can 
be replaced, only anaerobic bacteria can survive and the sta- 
bilization of organic matter is accomplished in the absence of 
oxygen. Under these conditions, the water becomes unsightly 
and malodorous, and the normal flora and fauna are destroyed. 
Furthermore, anaerobic decomposition proceeds at a slower rate 
than aerobic. For maintenance of satisfactory conditions, mini- 
mal dissolved oxygen concentrations in receiving streams are of 
primary importance. 

Municipal sewage and industrial wastes affect the oxygen con- 
tent of a stream. Cooling water, used in some industrial pro- 
cesses, is characterized by high temperatures, which reduce the 
capacity of water to hold oxygen in solution. Municipal sewage 
requires oxygen for its stabilization by bacteria. Oxygen is uti- 
lized more rapidly than it is replaced by reaeration, resulting in 
the death of the normal aquatic life. Further downstream, as the 
oxygen demands are satisfied, reaeration replenishes the oxygen 
supply. 

Polluted waters are deprived of oxygen by the exertion of the 
biochemical oxygen demand, which is defined as the quantity 
of oxygen required by the bacteria to oxidize the organic mat- 
ter. Factors such as the turbulence of the stream flow, biological 
growths on the stream bed, insufficient nutrients, and inadequate 
bacteria in the river water influence the rate of oxidation in the 
stream as well as the removal of organic matter. 

When a significant portion of the waste is in the suspended 
state, settling of the solids in a slow-moving stream is probable. 
The organic fraction of the sludge deposits decomposes anaer- 
obically, except for the thin surface layer which is subjected to 
aerobic decomposition due to the dissolved oxygen in the overly- 
ing waters. In warm weather, when the anaerobic decomposition 
proceeds at a more rapid rate, gaseous end products, usually car- 
bon dioxide and methane, rise through the supernatant waters. 
The evolution of the gas bubbles may raise sludge particles to 
the water surface. Although this phenomenon may occur while 
the water contains some dissolved oxygen, the more intense ac- 
tion during the summer usually results in depletion of dissolved 
oxygen. 

Water may absorb oxygen from the atmosphere when the 
oxygen in solution falls below saturation. Dissolved oxygen for 
receiving waters is also derived from two other sources: that in 
the receiving water and the waste flow at the point of discharge, 
and that given off by green plants. 

Unpolluted water maintains in solution the maximum quantity 
of dissolved oxygen. The saturation value is a function of tem- 
perature and the concentration of dissolved substances, such as 
chlorides. When oxygen is removed from solution, the deficiency 
is made up by the atmospheric oxygen, which is absorbed at the 
water surface and passes into solution. The oxygen balance ina 
stream is determined by the concentration of organic matter and 
its rate of oxidation, and by the dissolved oxygen concentration 
and the rate of reaeration. See ESTUARINE OCEANOGRAPHY; FRESH- 
WATER ECOSYSTEM; SEWAGE DISPOSAL; WATER POLLUTION. —[D.J.0.] 


Streaming potential 


Stream transport and deposition The sediment 
debris load of streams is a natural corollary to the degradation 
of the landscape by weathering and erosion. Eroded material 
reaches stream channels through rills and minor tributaries, be- 
ing carried by the transporting power of running water and by 
mass movement, that is, by slippage, slides, or creep. The size 
represented may vary from clay to boulders. At any place in the 
stream system the material furnished from places upstream ei- 
ther is carried away or, if there is insufficient transporting ability, 
is accumulated as a depositional feature. The accumulation of 
deposited debris tends toward increased ease of movement, and 
this tends eventually to bring into balance the transporting ability 
of the stream and the debris load to be transported. IL.B.L.] 


Streaming potential The potential which is produced 
when a liquid is forced to flow through a capillary or a porous 
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solid. The streaming potential is one of four related electroki- 
netic phenomena which depend upon the presence of an electri- 
cal double layer at a solid-liquid interface. This electrical double 
layer is made up of ions of one charge type which are fixed to the 
surface of the solid and an equal number of mobile ions of the 
opposite charge which are distributed through the neighboring 
region of the liquid phase. In such a system the movement of 
liquid over the surface of the solid produces an electric current, 
because the flow of liquid causes a displacement of the mobile 
counterions with respect to the fixed charges on the solid sur- 
face. The applied potential necessary to reduce the net flow of 
electricity to zero is the streaming potential. [Q.VW.] 


Streamlining 


Streamlining The contouring of a body to reduce its re- 
sistance (drag) to motion through a fluid. 

For fluids with relatively low viscosity such as water and air, 
effects of viscous friction are confined to a thin layer of fluid on 
the surface termed the boundary layer. Under the influence of 
an increasing pressure, the flow within the boundary layer tends 
to reverse and flow in an upstream direction. Viscosity tends to 
cause the flow to separate from the body surface with consequent 
formation of a region of swirling or eddy flow (termed the body 
wake; illus. a). This eddy formation leads to a reduction in the 
downstream pressure on the body and hence gives rise to a force 
opposite to the body motion, known as pressure drag. See WAKE 
FLOW. 

In general, streamlining in subsonic flow involves the contour- 
ing of the body in such a manner that the wake is reduced and 
hence the pressure drag is reduced. The contouring should pro- 
vide for gradual deceleration to avoid flow separation, that is, 
reduced adverse pressure gradients. These considerations lead 
to the following general rules for subsonic streamlining: The for- 
ward portion of the body should be well rounded, and the body 
should curve back gradually from the forward section to a taper- 
ing aftersection with the avoidance of sharp corners along the 
body surface. These conditions are well illustrated by teardrop 
shapes (illus. b). 

At supersonic speeds the airflow can accommodate sudden 
changes in direction by being compressed or expanded. Where 
this change in direction occurs at the nose of the body, a com- 
pression wave is created, the strength of which depends upon the 
magnitude of the change in flow direction. Lowering the body- 
induced flow angle weakens this compression shock wave. When 
the flow changes direction again at the midpoint of the body, the 
air will expand to follow the shape of the body. This change in 
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Flow about bodies in uniform subsonic flow. (a) Blunt body. 
(b) Streamlined body. 


direction creates expansion waves. At the tail of the body the 
direction changes again, creating another compression or shock 
wave. At each of these shock waves, changes in pressure, den- 
sity, and velocity occur, and in this process energy is lost. This 
loss results in a retarding force known as wave drag. See SHOCK 
WAVE. 

Bodies which are streamlined for supersonic speeds are char- 
acterized by a sharp nose and small flow deflection angles. Be- 
cause the intensity of the shock wave and the drag level is 
dependent upon the magnitude of the change in flow direc- 
tion, the width or thickness of the body should be minimal. See 
BOUNDARY-LAYER FLOW. [A.G.H.; D.M.Bu.] 


Strength of materials A branch of applied mechan- 
ics concerned with the behavior of materials under load, rela- 
tionships between externally applied loads and internal resisting 
forces, and associated deformations. Knowledge of the proper- 
ties of materials and analysis of the forces involved are funda- 
mental to the investigation and design of structures and machine 
elements. See MACHINE DESIGN; STRUCTURAL MATERIALS. 

Investigation of the resistance of a member, dealing with in- 
ternal forces, is called free-body analysis. Determination of the 
distribution and intensity of the internal forces and the associated 
deformations is called stress analysis. See STRESS AND STRAIN. 

A material offers resistance to external load only insofar as the 
component elements can furnish cohesive strength, resistance to 
compaction, and resistance to sliding. The relations developed in 
strength of materials analysis evaluate the tensile, compressive, 
and shear stresses that a material is called upon to resist. The 
most important factors in determining the suitability of a struc- 
tural or machine element for a particular application are strength 
and stiffness. See SHEAR. [J.B.S.] 


Strepsiptera An order of twisted-wing insects that spend 
most of their life cycle as internal parasitoids of other insects. 
The adult male is only a few millimeters in wing span, is free 
living, and has only one pair of full flight wings (the posterior, or 
metathoracic, pair). The front (mesothoracic) wings are reduced 
to narrow, clublike organs that may function as halteres, or flight 
balancers. The male eyes are coarsely faceted and berrylike, and 
the antennae have four to seven segments, with some segments 
having finger- or bladelike extensions. Most adult females are 
immobile, blind, and larviform, and live inside the insect host. 
Rarely, the female is free living, with legs and eyes but no wings. 
More than 600 species of Strepsiptera are known, many of them 
not yet formally described and named. The order’s relationship 
to other insect orders remains uncertain. 

Both male and female begin larval life as mobile first-stage or- 
ganisms with eyes and three pairs of functional legs. These triun- 
guloids eventually attack the immature or adult forms of the host 
and enter their bodies, where they molt to an apodous instar. Two 
subsequent molts result in the divergence of the sexes and the 
differentiation of a hardened forebody that eventually protrudes 
through the host integument; the molted larval integuments are 
not shed but are incorporated in the general puparial wall arid 
the neotenous adult female capsule. Males (and some adult fe- 
males) emerge from the puparium and begin the search for a 
mate, probably through the mediation of airborne pheromones. 

Strepsiptera are found worldwide in temperate and tropical 
habitats. They parasitize insects, mostly of orders Hemiptera and 
Hymenoptera, but also some cockroaches, mantids, orthopter- 
ans, flies, and silverfish. Their effect on the host is variable, 
ranging from reproductive failure to death. See ENDOPTERYGOTA; 
PHEROMONE. [W.L.Bro.] 


Streptococcus A large genus of spherical or ovoid bac- 
teria that are characteristically arranged in pairs or in chains 
resembling strings of beads. Many of the streptococci that con- 
stitute part of the normal flora of the mouth, throat, intestine, 


and skin are harmless commensal forms; other streptococci are 
highly pathogenic. The cells are gram-positive and can grow 
either anaerobically or aerobically, although they cannot uti- 
lize oxygen for metabolic reactions. Glucose and other carbo- 
hydrates serve as sources of carbon and energy for growth. All 
members of the genus lack the enzyme catalase. Streptococci 
can be isolated from humans and other animals. 

Streptococcus pyogenes is well known for its participation in 
many serious infections. It is a common cause of throat infec- 
tion, which may be followed by more serious complications such 
as rheumatic fever, glomerulonephritis, and scarlet fever. Other 
beta-hemolytic streptococci participate in similar types of infec- 
tion, but they are usually not associated with rheumatic fever 
and glomerulonephritis. Group B streptococci, which are usually 
beta-hemolytic, cause serious infections in newborns (such as 
meningitis) as well as in adults. Among the alpha-hemolytic and 
nonhemolytic streptococci, S. pneumoniae is an important cause 
of pneumonia and other respiratory infections. Vaccines that pro- 
tect against infection by the most prevalent capsular serotypes 
are available. The viridans streptococci comprise a number of 
species commonly isolated from the mouth and throat. Although 
normally of low virulence, these streptococci are capable of caus- 
ing serious infections (endocarditis, abcesses). See PNEUMONIA. 

[K.Ru.] 


Streptothricosis An acute or chronic infection of the 
epidermis, caused by the bacterium Dermatophilus congolensis. 
which results in an oozing dermatitis with scab formation. Strep- 
tothricosis includes dermatophilosis, mycotic dermatitis, lumpy 
wool, strawberry foot-rot, and cutaneous streptothricosis— 
diseases having a worldwide distribution and affecting a wide 
variety of species, including humans. 

The infectious form of the bacterium is a coccoid, motile 
zoospore that is released when the skin becomes wet. Thus, 
the disease is closely associated with rainy seasons and wet 
summers. Zoospores lodge on the skin of susceptible animals 
and germinate by producing filaments which penetrate to the 
living epidermis, where the organism proliferates by branching 
mycelial growth. 

Early cutaneous lesion of dermatophilosis in cattle reveals 
small vesicles, papules, and pus formation under hair plaques. 
An oozing dermatitis then appears as the disease progresses and 
exudates coalesce to form scabs, which change to hard crusts 
firmly adherent to the skin. The crusts enlarge and harden, and 
are often devoid of hair. Lesions occur on most areas of sheep, 
but the characteristic lesions in the wooled areas occur as numer- 
ous hard masses of crust or scab scattered irregularly over the 
back, flanks, and upper surface of the neck. Lesion resolution 
has been found to correlate with the presence of immunoglobu- 
lin A-containing plasma cells in the dermis and with the antibody 
levels to D. congolensis at the skin surface of infected sheep and 
cattle. 

No single treatment is considered specific for dermatophilo- 
sis. Some observers claim that topical agents are successful for 
sheep and cattle. A number of systemic antibiotics are effec- 
tive in treating the disease. A combination of streptomycin and 
penicillin has given good therapeutic results in both bovine and 
equine infections. [S.S.S.] 


Stress (psychology) Generally, environmental events 
of a challenging sort as well as the body’s response to such 
events. Of particular interest has been the relationship between 
stress and the body’s adaptation to it on the one hand and the 
body’s susceptibility to disease on the other. Both outcomes in- 
volve behavioral and brain changes as well as psychosomatic 
events, that is, changes in body function arising from the abil- 
ity of the brain to control such function through neural output 
as well as hormones. One problem is that both environmental 
events and bodily responses have been referred to interchange- 
ably as stress. It is preferable to refer to the former as the stressor 


2127 


and the latter as the stress response. The stress response consists 
of a cascade of neural and hormonal events that have short- 
and long-lasting consequences for brain and body alike. A more 
serious issue is how an event is determined to be a stressor. One 
view is to define a stressor as an environmental event causing 
a negative outcome, such as a disease. Another approach is to 
view stressors as virtually any challenge to homeostasis and to 
regard disease processes as a failure of the normal operation 
of adapative mechanisms, which are part of the stress response. 
With either view, it is necessary to include psychological stressors, 
such as fear, that contain implied threats to homeostasis and that 
evoke psychosomatic reactions. These are reactions that involve 
changes in neural and hormonal output caused by psychological 
stress. Psychosomatic reactions may lead to adaptive responses, 
or they may exacerbate disease processes. Whether the empha- 
sis is on adaptation or disease, it is essential to understand the 
processes in the brain that are activated by stressors and that in- 
fluence functions in the body. See HOMEOSTASIS; PSYCHOSOMATIC 
DISORDERS. 

Among the many neurotransmitter systems activated by stress 
is noradrenaline, produced by neurons with cell bodies in the 
brainstem that have vast projections up to the forebrain and 
down the spinal cord. Stressful experiences activate the nora- 
drenergic system and promote release of noradrenaline; severe 
stress leads to depletion of noradrenaline in brain areas such as 
the hypothalamus. This release and depletion of noradrenaline 
stores results in changes at two levels of neuronal function: phos- 
phorylation is triggered by the second-messenger cyclic AMP 
and occurs in the presynaptic and postsynaptic sites where no- 
radrenaline is released and where it also acts; synthesis of new 
protein is induced via actions on the genome. Both processes 
enhance the ability of the brain to form noradrenaline when 
the organism is once again confronted with a stressful situation. 
Other neurotransmitter systems may also show similar adaptive 
changes in response to stressors. See NORADRENERGIC SYSTEM. 

Stress also activates the neurally mediated discharge of 
adrenaline from the adrenal medulla and of hypothalamic hor- 
mones that initiate the neuroendocrine cascade, culminating 
in glucocorticoid release from the adrenal cortex. Thus, the 
activity of neurons triggered by stressful experiences, physical 
trauma, fear, or anger leads to hormone secretion that has effects 
throughout the body. Virtually every organ of the body is affected 
by stress hormones. The hypothalamic hormone (corticotrophin- 
releasing hormone) that triggers the neuroendocrine cascade di- 
rectly stimulates the pituitary to secrete ACTH. In response to 
certain stressors, the hypothalamus also secretes vasopressin and 
oxytocin, which act synergistically with corticotrophin-releasing 
hormone on the pituitary to potentiate the secretion of ACTH. 
Various stressors differ in their ability to promote output of va- 
sopressin and oxytocin, but all stressors stimulate release of 
corticotrophin-releasing hormone. Other hormones involved in 
the stress response include prolactin and thyroid hormone; the 
metabolic hormones insulin, epinephrine, and glucagon; and 
the endogenous opiates endorphin and enkephalin. See ENDor- 
PHINS. 

Of all the hormones in the endocrine cascade initiated by 
stress, the glucocorticoids are the most important because of their 
widespread effects throughout the body and in the brain. The 
brain contains target cells for adrenal glucocorticoids secreted in 
stress, and receptors in these cells are proteins that interact with 
the genome to affect expression of genetic information. Thus, 
the impact of stress-induced activation of the endocrine cascade 
that culminates in glucocorticoid release is the feedback of gluco- 
corticoids on target brain cells. The effect is to alter the structure 
and function of the brain cells over a period of time ranging from 
hours to days. 

In the case of noradrenaline, glucocorticoids have several 
types of feedback effects that modify how the noradrenergic sys- 
tem responds to stress. Glucocorticoids inhibit noradrenaline re- 
lease, and they reduce the second-messenger response of brain 
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structures such as the cerebral cortex to noradrenaline. Gluco- 
corticoid feedback also affects the serotonin system, facilitating 
serotonin formation during stress but at the same time alter- 
ing the levels of several types of serotonin receptors in different 
brain regions, which has the net effect of shifting the balance 
within the serotonergic system. Taken together, evidence points 
to a role of glucocorticoid secretion in leading to restoration of 
homeostatic balance by counteracting the acute neural events 
such as increased activity of noradrenaline and serotonin, which 
are turned on by stressful experiences. Other neurotransmitter 
systems may also respond to glucocorticoid action. Moreover, 
the other hormones activated by stress have effects on the brain 
and body that must be considered. See SEROTONIN. 

In general, stress hormones are protective and adaptive in the 
immediate aftermath of stress, and the organism is more vul- 
nerable to many conditions without them. However, the same 
hormones can promote damage and accelerate pathophysiolog- 
ical changes, such as bone mineral loss, obesity, and cognitive 
impairment, when they are overproduced or not turned off. This 
wear-and-tear on the body has been called allostatic load. It is 
based upon the notion that allostasis is the active process of 
maintaining stability, or homeostasis, through change, and al- 
lostatic load is the almost inevitable cost to the body of doing 
so. 

Stress hormone actions have important effects outside the 
brain on such systems as the immune response. Glucocorticoids 
and catecholamines from sympathetic nerves and the adrenal 
medulla participate in the mobilization and enhancement of im- 
mune function in the aftermath of acute stress. These effects 
improve the body’s defense against pathogens but can exacer- 
bate autoimmune reactions. When they are secreted chronically, 
the stress-related hormones are generally immunosuppressive; 
such effects can be beneficial in the case of an autoimmune 
disease but may compromise defense against a virus or bacte- 
rial infections. At the same time, glucocorticoids are important 
agents for containing the acute-phase response to an infection or 
autoimmune disturbance. In the absence of such containment, 
the organism may die because of the excessive inflammatory 
response. See IMMUNOLOGY. 

Besides affecting the immune response, stressors are believed 
to exacerbate endogenous depressive illness in susceptible in- 
dividuals. Major depressive illness frequently results in elevated 
levels of cortisol in the blood. It is not clear whether the elevated 
cortisol is a cause or strictly a result of the brain changes involved 
in depressive illness. See AFFECTIVE DISORDERS. [B.McE.] 


Stress and strain Related terms defining the intensity 
of internal reactive forces in a deformed body and associated 
unit changes of dimension, shape, or volume caused by exter- 
nally applied forces. Stress is a measure of the internal reaction 
between elementary particles of a material in resisting separa- 
tion, compaction, or sliding that tend to be induced by external 
forces. Total internal resisting forces are resultants of continuously 
distributed normal and parallel forces that are of varying magni- 
tude and direction and are acting on elementary areas through- 
out the material. These forces may be distributed uniformly or 
nonuniformly. Stresses are identified as tensile, compressive, or 
shearing, according to the straining action. 

Strain is a measure of deformation such as (1) linear strain, the 
change of length per unit of linear dimensions; (2) shear strain, 
the angular rotation in radians of an element undergoing change 
of shape by shearing forces; or (3) volumetric strain, the change 
of volume per unit of volume. The strains associated with stress 
are characteristic of the material. Strains completely recoverable 
on removal of stress are called elastic strains. Above a critical 
stress, both elastic and plastic strains exist, and that part remain- 
ing after unloading represents plastic deformation called inelastic 
strain. Inelastic strain reflects internal changes in the crystalline 
structure of the metal. Increase of resistance to continued plastic 
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Stress-strain diagram for a low-carbon steel. AS = change 
in stress; Ae = change in strain; P = force; Ap = area of 
cross section. 


deformation due to more favorable rearrangement of the atomic 
structure is strain hardening. 

A stress-strain diagram is a graphical representation of simul- 
taneous values of stress and strain observed in tests and indicates 
material properties associated with both elastic and inelastic be- 
havior (see illustration). It indicates significant values of stress- 
accompanying changes produced in the internal structure. See 
ELASTICITY; STRENGTH OF MATERIALS. [J.B.S.] 


Strigiformes The owls, an order of nocturnal, worldwide 
predacious birds that are probably most closely related to the 
goatsuckers. The strigiforms are arranged in four families: Ogy- 
goptyngidae (fossil), Protostrigidae (fossil), Tytonidae (barn owls; 
11 species), and Strigidae (typical owls; 135 species). The bay 
owls (Phodilus), occurring in southern Asia and Africa, are some- 
what intermediate between the Tytonidae and the Strigidae and 
are sometimes placed in a separate family; here they are treated 
as a subfamily, Phodilinae, of the Tytonidae. 

Owls are small to medium in size, with soft plumage of somber 
colors. The large head has a facial disk that covers the feathered 
parabolic reflectors of the bird’s acute directional hearing system. 
The eyes are large and capable of sight in very dim light, and 
the bill is strong and hooked. The wings are long and rounded, 
and the flight feathers are fringed for silent flight. The ulna has 
a unique bony arch on the shaft. Their strong legs are short to 
medium in length and terminate in strong feet. Owls are excel- 
lent fliers but walk poorly. Of the four toes on each foot, two 
point forward and two backward and bear strong claws. Prey is 
detected by acute night vision or by directional hearing; owls can 
locate and catch their prey in total darkness. Owls are generally 
nonmigratory and solitary, but some species live in small flocks. 
Courtship takes place at night with a male hooting to a female, 
which answers. A strong pair bond exists between the monoga- 
mous male and female, which usually build their nest in a tree 
cavity. Some species nest on the ground, on cliff ledges, or in 
abandoned crow or hawk nests. The clutch of up to seven eggs 
is incubated by both sexes, and after hatching, the young stay in 
the nest and are cared for by both parents. See AVES. [WJ.B.] 


Stripping The removal of volatile component from a liquid 
by vaporization. The stripping operation is an important step 
in many industrial processes which employ absorption to purify 
gases and to recover valuable components from the vapor phase. 
In such processes, the rich solution from the absorption step must 
be stripped in order to permit recovery of the absorbed solute 
and recycle of the solvent. See GAS ABSORPTION OPERATIONS. 
Stripping may be accomplished by pressure reduction, the 
application of heat, or the use of an inert gas (stripping va- 
por). Many processes employ a combination of all three; that 
is, after absorption at elevated pressure, the solvent is flashed to 


atmospheric pressure, heated, and admitted into a stripping col- 
umn which is provided with a bottom heater (reboiler). Solvent 
vapor generated in the rebolier or inert gas injected at the bot- 
tom of the column serves as stripping vapor which rises coun- 
tercurrently to the downflowing solvent. When steam is used as 
stripping vapor for a system not miscible with water, the process 
is called steam stripping. 

In addition to its use in conjunction with gas absorption, the 
term stripping is also used quite generally in technical fields to 
denote the removal of one or more components from a mixed 
system. Such usage covers (1) the distillation operation which 
takes place in a distilling column in the zone below the feed 
point, (2) the extraction of one or more components from a liquid 
by contact with a solvent liquid, (3) the removal of organic or 
metal coatings from solid surfaces, and (4) the removal of color 
from dyed fabrics. See DISTILLATION; ELECTROPLATING OF METALS; 
SOLVENT EXTRACTION. [ALK] 


Stroboscope An instrument for observing moving bodies 
by making them visible intermittently and thereby giving them 
the optical illusion of being stationary. A stroboscope may op- 
erate by illuminating the object with brilliant flashes of light or 
by imposing an intermittent shutter between the viewer and the 
object. 

Stroboscopes are used to measure the speed of rotation or fre- 
quency of vibration of a mechanical part or system. They have 
the advantage over other instruments of not loading or disturb- 
ing the equipment under test. Mechanical equipment may be 
observed under actual operating conditions with the aid of stro- 
boscopes. Parasitic oscillations, flaws, and unwanted distortion 
at high speeds are readily detected. 

The flashing-light stroboscopes employ gas discharge tubes 
to provide a brilliant light source of very short duration. Tubes 
may vary from neon glow lamps, when very little light output 
is required, to special stroboscope tubes capable of producing 
flashes of several hundred thousand candlepower with a dura- 
tion of only a few millionths of a second. See NEON GLOW LAMP; 
VAPOR LAMP. [A.R.E.] 


Stroboscopic photography Sitroboscopic or “stro- 
be” photography generally refers to pictures of both single and 
multiple exposure taken by flashes of light from electrical dis- 
charges. Originally the term referred to multiple-exposed pho- 
tographs made with a Stroboscopic disk as a shutter. One es- 
sential feature of modern Stroboscopic photography is a short 
exposure time, usually much shorter than can be obtained by a 
mechanical shutter. 

High-speed photography with Stroboscopic light has proved 
to be one of the most powerful research tools for observing fast 
motions in engineering and in science. Likewise, the electrical 
system of producing flashes of light in xenon-filled flash lamps 
is of great utility for studio, candid, and press photography. See 
PHOTOGRAPHY; STROBOSCOPE. [H.E.E.] 


Stromatolite A laminated, microbial structure in carbon- 
ate rocks (limestone and dolomite). Stromatolites are the oldest 
macroscopic evidence of life on Earth, at least 2.5 billion years 
old, and they are still forming in the seas. During the 1.5 bil- 
lion years of Earth history before marine invertebrates appeared, 
stromatolites were the most obvious evidence of life, and they 
occur sporadically throughout the remainder of the geologic 
record. In Missouri and Africa, stromatolite reefs have major ac- 
cumulations of lead, zinc, or copper; and in Montana, New Mex- 
ico, and Oman, stromatolites occur within oil and gas reservoirs. 
For geologists, the shapes of stromatolites are useful indications 
of their environmental conditions, and variations in form and mi- 
crostructure of the laminations may be age-diagnostic in those 
most ancient sedimentary rocks that lack invertebrate fossils. See 
DOLOMITE; LIMESTONE; REFF. 
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Stromatolites are readily recognizable in outcrops by their 
characteristic convex-upward laminated structure. Individual, 
crescent-shaped laminations, which are generally about a mil- 
limeter thick, are grouped together to produce an enormous 
range of shapes and sizes. 

The tiny, filamentous cyanobacteria (blue-green algae) that 
make present-day stromatolites, and similar filaments associated 
with the oldest stromatolites known, are considered one of the 
most successful organisms on Earth. Living stromatolites in the 
Bahamas and Western Australia possess laminations that record 
the episodic trapping and binding of sediment particles by the 
microbial mat. In the modern oceans, stromatolites develop al- 
most exclusively in extreme marine conditions that exclude or 
deter browsing invertebrates and fish from destroying the mi- 
crobial mats and inhibit colonization by competing algae. See 
ALGAE; CYANOBACTERIA. [R.Gi.] 


Strong nuclear interactions 


Stromatoporoidea An extinct order thought to be 
sponges because of its close similarity to the class Sclerospon- 
giae. Generally accepted stromatoporoids first appeared at the 
beginning of the Middle Ordovician and died out at the end of 
the Devonian. The Stromatoporoidea are worldwide in distri- 
bution, most commonly associated with bedded limestones and 
fossil reefs. 

The stromatoporoid skeleton is called a coenosteum, which 
begins development as a thin, encrusting layer of irregular rods 
and plates called the peritheca. Most coenostea are hemispher- 
ical, or thick laminar or lens-shaped structures. However, some 
stromatoporoids are thin and irregularly undulatory and others 
are cylindrical, cone-shaped, branching, or anastomosing. See 
SCLEROSPONGIAE. [J.St.J.] 


Strong nuclear interactions One ofthe fundamen- 
tal physical interactions, which acts between a pair of hadrons. 
Hadrons include the nucleons, that is, neutrons and protons; 
the strange baryons, such as lambda (A^) and sigma (£); the 
mesons, such as pion (x) and rho (p); and the strange meson, 
kaon (K). The nature of the interaction is determined principally 
through observations of the collision of a hadron pair. From 
this it is found that the interaction has a short range of about 
1071 m (107! in.) and is by far the dominant force within this 
range, being much larger than the electromagnetic interaction, 
which is next in magnitude. The strong interaction conserves 
parity and is time-reversal-invariant. See BARYON; HADRON; ME- 
SON; NUCLEON; PARITY (QUANTUM MECHANICS); STRANGE PARTICLES; 
SYMMETRY LAWS (PHYSICS); TIME REVERSAL INVARIANCE. 

The interaction between baryons for distances greater than 
1071 m arises from the exchange of mesons. At relatively large 
distances, single-pion exchange dominates (illus. a). At shorter 
separation distances, the two-pion systems such as the p become 
important (illus. b). The interaction between the strange baryons, 
and between the strange baryons and the nucleons, is moderated 
by kaon exchange. To summarize this description, the interaction 
between baryons in the SU(3) multiplet is the consequence of 
the exchange of SU(3) spin 0 and spin 1 bosons. In a second 
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Interaction between nucleons (a) from exchange of single 
pion, (b) from exchange of p-meson, a two-pion system, 
(c) from exchange of two separate pions with formation of 
excited state of nucleon, and (d) without formation of excited 
state. 
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approximation, the exchange of two pions (illus. c, d), or more 
generally, the exchange of two members of the SU(3) spin 0 and 
spin 1 multiplet, is responsible for a component of the strong 
nuclear interaction. See UNITARY SYMMETRY. 

The range of the interaction generated by these exchanges can 
be calculated by using the formula below, where m is the mass 
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of the exchanged particles, f is Planck’s constant divided by 27, 
and c is the speed of light. According to the above equation, 
the range of the interaction developed when a single pion is 
exchanged (illus. 1a) is equal to 1.4 x 10-4 m (5.5 x 10714 in.), 
while that due to two-pion exchange (illus. 1d) is 0.7 x 1071 m 
(2.8 x 10-14 in.). 

At short separation distances the quark-gluon structure of the 
baryons must be taken into account. The interaction must be 
considered as a property of the six-quark-plus-gluon system. The 
decisive elements are the Pauli principle obeyed by the quarks, 
and the mismatch between the six-quark wave function and the 
two-baryon wave function. Thus, at short distances the interac- 
tion is effectively repulsive or more generally independent of the 
kinetic energy of the baryons at infinite separations. See ELEMEN- 
TARY PARTICLE; EXCLUSION PRINCIPLE; FUNDAMENTAL INTERACTIONS; 
GLUONS; QUARKS. [H.E] 


Strongylida An order of nematodes in which the cephalic 
region may be adorned with three or six labia or the labia may be 
replaced by a corona radiata. All strongylid nematodes are para- 
sitic. The order embraces eight superfamilies: Strongyloidea, Di- 
aphanocephaloidea, Ancylostomatoidea, Trichostrongyloidea, 
Metastrongyloidea, Cosmocercoidea, Oxyuroidea, and Heter- 
akoidea. 

Strongyloidea. The Strongyloidea contain important para- 
sites of reptiles, birds, and mammals. The early larval stages may 
be free-living microbivores, but the adults are always parasitic. 
Three species are important parasites of horses, Strongylus vul- 
garis, S. equinus, and S. edentatus. All three undergo direct life 
cycles; that is, infestations are acquired by ingestion of contam- 
inated food. 

Trichostrongyloidea. The Trichostrongyloidea comprise 
obligate parasites of all vertebrates but fishes. Normally they 
are intestinal parasites, but some are found in the lungs. The 
species are important parasites of sheep, cattle, and goats. The 
adult females lay eggs in the intestinal tract, which are passed 
out with the feces. In the presence of oxygen the eggs hatch 
in a few days. When the larvae are ingested by an appropriate 
host, their protective sheath is lost, and they proceed through 
the fourth larval stage to adulthood in the intestinal tract, where 
they may enter the mucosa. No migration takes place outside 
the gastrointestinal tract. 

Metastrongyloidea. The Metastrongyloidea comprise obli- 
gate parasites of terrestrial and marine mammals, found com- 
monly in the respiratory tract. In their life cycle they utilize both 
paratenic and intermediate hosts, among them a variety of in- 
vertebrates, including earthworms and mollusks. Two impor- 
tant species are Metastrongylus apri (swine lungworm) and An- 
giostrongylus cantonensis (rodent lungworm). 

Heterakoidea. The Heterakoidea are capable of parasitizing 
almost any warm-blooded vertebrate as well as reptiles and am- 
phibians. The species Ascaridia galli is the largest known nema- 
tode parasite of poultry; males are 2-3 in. (50-76 mm) long, and 
females 3-4.5 in. (75-116 mm). [A.R.M.] 

Oxyuroidea. The Oxyuroidea constitute a large group of the 
phylum Nemata. Hosts include terrestrial mammals, birds, rep- 
tiles, amphibians, fishes, insects, and other arthropods. 

The species are small to medium sized and thin bodied. With 
one exception, known life cycles are direct. Typically the eggs 
pass out of the host’s alimentary tract onto the ground, where 
they become fully embryonated and infective. Normally the in- 


fective egg does not hatch until a susceptible animal ingests it. 
The cecum and colon of the host are the typical locations of 
these parasites. Larvae in all stages of development and adults 
occur in the gut. 

The human pinworm, Enterobius vermicularis, is probably the 
most contagious of all helminthic diseases. It is estimated that 
10% of the world’s population suffer from this parasite, the ma- 
jority being children. Indeed, incidence among schoolchildren in 
the cool regions of the world often approaches 100%. Infection 
occurs when eggs are inhaled or ingested. The most common 
method of transmission is from anus to mouth. Because of the 
aerial transmission, this disease is highly contagious. Though 
the infection is seldom serious, the behavioral symptoms are 
disturbing: nail biting, teeth grinding, anal scratching, insomnia, 
nightmares, and even convulsions. Several medical treatments 
are available, but there is often the danger of reinfestation from 
contaminated objects within the household or institution. See 
NEMATA. (J.T.L.] 


Strontianite The mineral form of strontium carbonate, 
usually with some calcium replacing strontium. It characteristi- 
cally occurs in veins with barite or celestite or as masses in certain 
sedimentary rocks. Strontianite is normally prismatic, but it may 
also be massive. It may be colorless or gray with yellow, green, 


or brownish tints. The hardness is 3 on Mohs scale, and the 
specific gravity of 3.76. It occurs at Strontian, Scotland, and in 
Germany, Austria, Mexico, and India and, in the United States, 
in the Strontium Hills of Calfornia. See CARBONATE MINERALS; 
STRONTIUM. [R.LHa.] 


Strontium A chemical element, Sr, atomic number 38, and 
atomic weight 87.62. Strontium is the least abundant of the 
alkaline-earth metals. The crust of the Earth is 0.042% stron- 
tium, making this element as abundant as chlorine and sulfur. 
The main ores are celestite, SrSOxq, and strontianite, SrCO3. See 
ALKALINE-EARTH METALS; PERIODIC TABLE; STRONTIANITE. 

Strontium nitrate is used in pyrotechnics, railroad flares, and 
tracer bullet formulations. Strontium hydroxide forms soaps and 
greases with a number of organic acids which are structurally 
stable, resistant to oxidation and breakdown over a wide tem- 
perature range. 


Properties of strontium 


Property Value 
Atomic number 38 
Atomic weight 87.62 
Isotopes (stable) 84, 86, 87, 88, 90 
Boiling point, °C 1638(?) 
Melting point, °C 704(?) 
Density, g/cm3 at 20°C 2.6 


Strontium is divalent in all its compounds which are, aside 
from the hydroxide, fluoride, and sulfate, quite soluble. Stron- 
tium is a weaker complex former than calcium, giving a few weak 
oxy complexes with tartrates, citrates, and so on. Some physical 
properties of the element are given in the table. [R.ER.] 


Strophanthus A genus of woody climbers of the dogbane 
family (Apocynaceae), natives of tropical Asia and Africa. They 
are the source of arrow poisons. The dried, greenish, ripe seeds 
of Strophanthus hispidus and S. kombe contain the glucoside 
strophanthin, which is much used in treating heart ailments. Stro- 
phanthin acts directly on heart muscle, increasing muscular force. 
It causes the heart to beat more regularly and decreases the pulse 
rate. Strophanthin is a precursor of cortisone, which is used in 
the treatment of arthritis. See GENTIANALES. [PD.St.; E.L.C.] 


Strophomenida Ap extinct order of brachiopods, in the 
subphylum Rhynchonelliformea, that inhabited shallow shelf 
seas of the Paleozoic Era. It contains the greatest taxonomic di- 
versity of all rhynchonelliform brachiopod orders, and its mem- 
bers exhibit a diverse range of morphologies and modes of life. 

Strophomenida contains four suborders: Strophomenidina, 
Chonetidina, Productidina, and Strophalosiidina. A fifth subor- 
der, Orthotetidina, shares characteristics with both strophome- 
nids and primitive orthids. Strophomenids are distinguished 
from all other articulate brachiopods by their concavo-convex 
or plano-convex lateral profiles and the lack of a functional 
pedicle in adults. They also possess strophic (straight) hinge 
lines and variable ornamentation from fine-ribbed early Paleo- 
zoic forms to more elaborate spine-bearing late Paleozoic forms. 
Adult strophomenids were mostly free-living, sessile, epifaunal 
suspension feeders. The concavo-convex lateral profile and elab- 
orate spines were adaptations for living on soft substrates. See 
BRACHIOPODA; RHYNCHONELLIFORMEA. [M.E.P] 


Structural analysis A detailed evaluation intended to 
assure that, for any structure, the deformations will be suffi- 
ciently below allowable values that structural failure will not 
occur. The deformations may be elastic (fully recoverable) or 
inelastic (permanent). They may be small, with an associated 
structural failure that is cosmetic; for example, the deflection of 
a beam supporting a ceiling may cause cracking of the plas- 
ter. They may be large, with an associated structural failure that 
is catastrophic; for example, the buckling of a column or the 
fracture of a tension member causes complete collapse of the 
structure. 

Structural analysis may be performed by tests on the actual 
structure, on a physical model of the structure to some scale, 
or through the use of a mathematical model. Tests on an ac- 
tual structure are performed in those cases where many similar 
structures will be produced, for example, automobile frames, or 
where the cost of a test is justified by the importance and diffi- 
culty of the project, for example, a lunar lander. Physical models 
are sometimes used where subassemblages of major structures 
are to be investigated. The vast majority of analyses, however, 
are on mathematical models, particularly in the field of struc- 
tural engineering which is concerned with large structures such 
as bridges, buildings, and dams. See BRIDGE; BUILDINGS; Dam; 
STRUCTURE (ENGINEERING). 

The advent of the digital computer made it possible to cre- 
ate mathematical models of great sophistication, and almost all 
complex structures are now so analyzed. Programs of such gen- 
erality have been written as to permit the analysis of any struc- 
ture. These programs permit the model of the structure to be 
two- or three-dimensional, elastic or inelastic, and determine the 
response to forces that are static or dynamic. Most of the pro- 
grams utilize the stiffness method, in which the stiffnesses of 
the individual elements are assembled into a stiffness matrix for 
the entire structure, and analysis is performed in which all be- 
havior is assumed to be linearly elastic. See DIGITAL COMPUTER; 
ELASTICITY. 

The structural engineer’s function continues to require train- 
ing and experience in conceptualizing the structure, choosing 
the appropriate model, estimating the loads that will be of im- 
portance, coding the information for the program, and interpret- 
ing the results. The analyst usually enters the process after the 
conceptualization. Most structures consist of assemblies of mem- 
bers connected at joints. While all real members transmit axial, 
torsional, and bending actions, the majority of buildings and 
bridges are analyzed as trusses, beams, and frames with either 
axial or bending forces predominant. See BEAM; ENGINEERING 
DESIGN; STRESS AND STRAIN; STRUCTURAL DESIGN; TRUSS. 

Whether the model selected is detailed or simplified, one ex- 
tremely important part of the analysis consists of the estimate of 
the loads to be resisted. For bridges and buildings, the primary 
vertical loads are gravity loads. These include the weight of the 
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structure itself, and such appurtenances as will be permanent 
in nature. These are referred to as dead loads. The loads to be 
carried, the live loads, may consist of concentrated loads (heavy 
objects occupying little space, for example, a printing press), or 
loads distributed over relatively large areas (such as floor and 
deck coverings). Horizontal loads on buildings are produced by 
wind and by the inertia forces created during earthquakes. In 
seismic analysis, computers are used to simulate the dynamic 
characteristics of the structure. The accelerations actually mea- 
sured during earthquakes are then used to determine the re- 
sponse of the structure. See LOADS, DYNAMIC; LOADS, TRANSVERSE; 
SEISMIC RISK. [G.D.B.R.] 


Structural chemistry 


Structural chemistry Much of chemistry is explain- 
able in terms of the structures of chemical compounds. The 
understanding of these structures hinges very strongly on under- 
standing the electronic configurations of the elements. The union 
of atoms, and therefore the formation of compounds from the 
elements, is associated with interactions among the extranuclear 
electrons of the individual atoms. Electronic interactions among 
atoms may occur in two ways: Electrons may be transferred from 
one atom to another, or they may be shared by two (or more) 
atoms. The first type of interaction is called electrovalence and 
results in the formation of electrically charged monatomic ions. 
The second, covalence, leads to the formation of molecules and 
complex ions. See CHEMICAL BONDING. [D.H.B.] 

In considering structures more complex than those derived 
from simple monoatomic ions, the logical step is to consider sin- 
gle polyhedral aggregates of atoms. In its most precise sense, 
structure is used to denote a knowledge of the bonding dis- 
tances and angles between atoms in chemical compounds and, 
in turn, the geometrical arrangements which they form. These 
atomic arrangements and the associated distances and angles 
serve uniquely as “fingerprints” of these atom spatial config- 
urations, and depend very much on the electronic configura- 
tions around atoms. The chemical combination of neutral atoms 
to produce uncharged species results in molecule formation, 
whereas the similar combination of atoms or ions possessing 
a net charge results in the formation of complex ions. A ba- 
sic understanding of the species formed involves the concept 
of the coordination polyhedron, which allows a simple classi- 
fication of the structures of many polyatomic molecules and 
ions. This type of classification is particularly useful because 
it conveniently explains the packing together of simple chem- 
ical molecules or ions in terms of highly symmetrical polyhe- 
dra. There is an obvious connection between polyhedra and the 
structures found in crystalline solids formed from them. Crys- 
tal formation often involves the linking of convex polyhedra 
by the sharing of corners, edges, or faces, ultimately forming 
space-filling assemblies in which all faces of each polyhedron 
are in contact with faces of other polyhedra. The most important 
simple polyhedrons are the tetrahedron, the trigonal bipyramid, 
the octahedron, the pentagonal bipyramid, and the cube. The 
most commonly observed of these polyhedral configurations 
are the tetrahedron (four faces) and the octahedron (six faces). 

The simplest correlative device which accurately summarizes 
a very large number of structures and enables the chemist to 
predict, with a good chance of success, the geometric array of 
the atoms in a compound of known composition, is based on 
an extreme electrostatic model. This model, or theory, repre- 
sents the bonds in a purely formal way. The central atom is 
considered to be a positive ion having a charge equal to its 
oxidation state. The groups attached to the central atom (the 
ligands) are then treated either as negative ions or as neutral 
dipolar molecules. The principal justification for this approach 
lies in its successful correlation of a vast amount of information. 

A number of significant observations can be made with 
regard to these formulations. There are series of ions, or 
ions and molecules, having the same type of composition, 
differing only in the nature of the central ion and the net charge 
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on the aggregate. Examples are found in the series: NO; , 
CO, BO;*-, CIO3°, S037, PO337; ClO,, S022, PO,°-, 
SiO,*-; AIF,2-, SiF.2-, PFg~. The numbers of atomic nuclei 
and of electrons are the same for all the members of each 
series; consequently, these are called isoelectronic series. Not 
only are the several chemical entities in such series isoelec- 
tronic, but they are usually identical in geometrical structure 
(isostructural). 

It may also be observed that corresponding ions from a 
given vertical family of the periodic table commonly vary in 
coordination number. A useful example is found in N°* and 
P+ which form NO3~ and PO,°-, respectively. In addition, 
some neutral molecules expand their coordination numbers to 
form stable anionic halo complexes, whereas others do not. 
Thus, SiF, reacts with fluoride ion to form SiF,2~, whereas CF, 
does not form a similar complex ion. The most satisfactory 
explanation of these and many related observations is conve- 
niently formulated in terms of the electrostatic model chosen 
here. 

The necessary condition for stability of the coordination poly- 
hedron MA,, requires that the anions A are each in contact with 
the central atom M. As a consequence of this condition, the limit 
of stability of the structure arises in those cases where the anions 
are also mutually in contact. Larger ligands, or anions, would 
not be in contact with the central ion. This relationship is usu- 
ally summarized in terms of the limiting ratio of the radius of the 
cation, ry, to that of the anion, ra, below which the anions would 
no longer be in contact with the cation. 

According to the valence-bond theory, the principal re- 
quirements for the formation of a covalent bond are a pair of 
electrons and suitably oriented electron orbitals on each of the 
atoms being bonded. The geometry of the atoms in the resulting 
coordination polyhedron is correlated with the orientation of 
the orbitals on the central atom. The orbitals used depend on 
the energies of the electrons in them. In general, the order 
of increasing energy of the electron orbitals is (n — 1)d < ns 
< np < nd. It is concluded that a nontransition atom having 
one valence electron will form a covalent bond utilizing an 
s orbital. In those cases where an unshared pair of electrons 
may be assigned to the ns orbital, as many as three equivalent 
bonds may be formed by utilizing the three np orbitals of the 
central atom. Because of the orientation of these p orbitals 
with respect to each other, the three resulting bonds should 
be at 90° to each other. This expectation is nearly realized in 
PH3. In order to account for four or six equivalent bonds, or 
for that matter in order to account for all the remaining poly- 
hedral and polygonal structures, except the angular structure 
for a coordination number of 2 (with two unshared pairs of 
electrons on the central atom), an additional assumption is 
necessary. It is assumed that s and p, s and d, or s, p, and d 
orbitals, are replaced by new orbitals, called hybridized orbitals. 
These hybridized orbitals are derived from the original orbitals 
(mathematically) in such a way that the required number of 
equivalent bonds may be formed. In the simplest case, it is 
shown that s and p may be combined to form two equivalent 
sp hybridized orbitals directed at 180° to each other. Other 
sets of hybridized orbitals have been shown to be appropriate 
to describe the bonding in other structures. See LIGAND FIELD 
THEORY. 

Among inert-gas ions of the first row of eight elements in the 
periodic table, there are four orbitals available for covalent bond 
formation, one 2s and three 2p. Consequently, a maximum of 
four bonds may be formed. This is in general agreement with the 
existence of the tetrahedron as the limiting coordination polyhe- 
dron among these elements, for example, BeFy?~, BFa~, CCla, 
NH,*. Although only Li* deviates from this pattern, having a 
coordination number of 6 in its crystalline halides, these com- 
pounds are best treated as simple electrovalent salts. In keeping 
with the limitation of only four orbitals, the formation of dou- 
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ble or triple bonds between atoms of these elements reduces 
the coordination number of the central atom. Thus, the high- 
est coordination number of a first-row element forming one 
double bond is 3. This is illustrated by the structures below. 
:Cl :Cl :0 
fy ls ang ae . ” 
B=Cl: c=0: N=O: 
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In these and similar examples, the geometric array is determined 
by the formation of three single bonds utilizing sp? hybridized or- 
bitals on the central atom and p orbitals on the ligand. In general, 
the bonds determining the geometry of a molecule or ion (in this 
way) are called o bonds. The double bond results from the su- 
perposition of a second bond, a x bond, between two atoms. In 
this example the formation of a x bond reduces the number of o 
bonds from four to three, thus changing the geometries of the cor- 
responding molecules or ions from tetrahedral to trigonal planar. 
The formation of a second z bond (a triple bond or two 
double bonds) reduces the coordination number of the atom 
in question still further, resulting in the linear sp set of hybridized 

orbitals being utilized in o-bond formation. See VALENCE. 
With regard to the nature of doubly bonded compounds, an- 
other problem arises when such structures are viewed from the 
standpoint of valence-bond theory. In the species BCl, COCH, 
NO-Cl, and many similar substances, nonequivalent bonds are 
predicted. The doubly bonded oxygen should be closer to the 
central carbon atom than the singly bonded ones. This is not 
found to be true experimentally so long as the similar atoms are 
otherwise equivalent. There is only one observable C—O dis- 
tance in carbonate, one N—O distance in nitrate, and so on. To 
account for such facts as these, the concept of resonance must 
be introduced. If the z bond exists, it must exist equally between 
the central atom and all the equivalent oxygen atoms. The res- 
onance method of describing this situation is to say that one of 
the pictorial structures is inadequate to describe the substance 
properly, but that enough pictorial structures (resonance struc- 
tures) should be considered to permute the double bond about 
all the equivalent bonds. The true structure is assumed to be 
something intermediate to all the resonance structures and more 
stable than any of them because it exists in preference to any one 
of them. The resonance structures for CO;~ are the following: 
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See CONJUGATION AND HYPERCONJUGATION; RESONANCE (MOLE- 
CULAR STRUCTURE). 

The classic homologous series of compounds in organic chem- 
istry provide useful examples involving the condensation of poly- 
hedrons containing the same central element in the individual 
units. The general formula, C,H», + 2, represents a large num- 
ber of compounds extending from the lowest member, methane, 
CHg, to polyethylene, a plastic of economic importance in 
which n is a very large number. Two ways exist for the linking 
together of these tetrahedrons. This gives rise to two molec- 
ular forms, both of which are stable, well-known compounds. It is 
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Fig. 1. Geometric structure of propane. 
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Fig. 2. Cis-trans isomerism among olefins. 
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Fig. 3. Benzene molecule. (a) Structural formula. (b) Two 
forms in resonance. 


an essential part of these structures that each —C link is lin- 
ear (because it is merely a o-bond); however, when two car- 
bon atoms are linked to a third, the C—C—C angle is essen- 
tially determined by the bond angle of the central carbon atom 
(that is, the other carbons may be treated as ligands to the 
first (Fig. 1). The other familiar homologous series of organic 
chemistry differ from the saturated hydrocarbons in having at 
least one unique coordination polyhedron of a different type. 
The olefins contain two doubly bonded, or unsaturated, car- 
bon atoms, whose polyhedral structures are trigonal planar, and 
the remainder of the carbons are tetrahedral. As in the case 
of the aliphatic hydrocarbons, the olefins exhibit an isomerism 
which is associated with the branching of the chain structure. 
In addition, the presence of two linked trigonal planar carbon 
atoms and the fact that the polyhedrons cannot rotate about the 
double bond give rise to a different kind of isomerism, called 
cis-trans isomerism (Fig. 2). See BOND ANGLE AND DISTANCE. 
The existence of a predicted isomerism provides one of the 
most important confirmations of the theories of chemical struc- 
ture. In general, the polyhedral view of molecular structure, as 
described here, has been thoroughly verified by the discov- 
ery of the many types of predicted isomerism. The first really 
convincing proof of the tetrahedral structures of saturated car- 
bon atoms involved optical isomerism. See OPTICAL ACTIVITY. 
The aromatic hydrocarbons are characterized by cyclic ar- 
rangements of trigonal planar carbon atoms (Fig. 3a). The highly 
symmetrical nature of the benzene molecule is not fully repre- 
sented by such a structure. The figure indicates the presence 
of three double and three single bonds in the ring. It has been 
shown that the C—C bonds are all the same and, consequently, 
the true structure of the substance must be represented by two 
resonance structures which interchange the single and double 
bonds (Fig. 3b). See BENZENE. [J.M.Wi.] 


Structural connections Methods of joining the indi- 
vidual members of a structure to form a complete assembly. The 
connections furnish supporting reactions and transfer loads from 
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one member to another. Loads are transferred by fasteners (riv- 
ets, bolts) or welding supplemented by suitable arrangements 
of plates, angles, or other structural shapes. When the end of a 
member must be free to rotate, a pinned connection is used. 

The suitability of a connection depends on its deformational 
characteristics as well as its strength. Rotational flexibility or com- 
plete rigidity must be provided according to the degree of end 
restraint assumed in the design. A rigid connection maintains 
the original angles between connected members virtually un- 
changed after loading. Flexible or nonrestraining connections 
permit rotation approximately equal to that at the ends of a sim- 
ply supported beam. Intermediate degrees of restraint are called 
semirigid. 

A commonly used form of connection for rolled-beam sec- 
tions, called a web connection, consists of two angles which are 
attached to opposite sides of a member and which are in turn 
connected to the web of a supporting beam, girder, column, or 
framing at right angles. A shelf angle may be added to facilitate 
erection (Fig. 1). 

A bracket or seat on which the end of the beam rests is a seat 
connection; it is intended to furnish the end reaction of the sup- 
ported beam. Two general types are used: The unstiffened seat 
provides bearing for the beam by a projecting plate or angle leg 
which offers resistance only by its own flexural strength (Fig. 2); 
the stiffened seat is supported by a vertical plate or angle which 
transfers the reaction force to the supporting member without 
flexural distortion of the outstanding seat. 

When the action line of a transferred force does not pass 
through the centroid of the connecting fastener group or welds, 
the connection is subjected to rotational moment which pro- 
duces additional shearing stresses in the connectors. The load 
transmitted by diagonal bracing to a supporting column flange 
through a gusset plate is eccentric with reference to the connect- 
ing fastener group. 


Structural connections 
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Fig. 1. Riveted or bolted web connections. (a) Beam-to- 
ginder, elevation view. (b) Plan view. (c) Bending of angle 
legs. 
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Fig. 2. Unstiffened seat connections. (a) Fastened angle 
seat. (b) Welded angle seat. R = reaction load; e = distance 
to reaction from column face. 





2134 Structural deflections 


In beam-to-column connections and stiffened seat connec- 
tions or when members transfer loads to columns by a gusset 
plate or a bracket, the fasteners are subjected to tension forces 
caused by the eccentric connection. Although there are initial ten- 
sions in the fasteners, the final tension is not appreciably greater 
than the initial tension. 

Rigidity and moment resistance are necessary at the ends of 
beams forming part of a continuous framework which must resist 
lateral and vertical loads. Wind pressures tend to distort a build- 
ing frame, producing bending in the beams and columns which 
must be suitably connected to transfer moment and shear. The 
resisting moment can be furnished by various forms of angle T 
for fasteners or welded or bracket connections. 

Where appreciably angular change between members is ex- 
pected, and in special cases where a hinge support without mo- 
ment resistance is desired, connections are pinned. Many bridge 
trusses and large girder spans have pin supports. See BOLT; JOINT 
(STRUCTURES); RIVET. [J.B.S.] 


Structural deflections The deformations or move- 
ments of a structure and its components, such as beams and 
trusses, from their original positions. It is as important for the 
designer to determine deflections and strains as it is to know the 
stresses caused by loads. See STRESS AND STRAIN. 

Deflections may be computed by any of several methods. Gen- 
erally the computation is based on the assumption that stress is 
proportional to strain. As a result, deflection equations involve 
the modulus of elasticity E, which is a measure of the stiffness of 
a material. 

The relation between deflections at different parts of a structure 
is indicated by Maxwell’slaw of reciprocal deflections. This states 
that if a load P is applied at any point A in any direction a 
and causes a shift of another point B in direction b, the same 
load applied at B in direction b will cause an equal shift of A 
in direction a (see illustration). The law is used in a number of 
ways such as in simplifying deflection calculations, checking the 
accuracy of computations, and producing influence lines. See 
STRUCTURAL ANALYSIS. 
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Example of Maxwell’s law of reciprocal deflections. 


Beam and truss deflections usually are computed by similar 
methods, except that integration is used for equations and sum- 
mation for trusses. Beam deflection equations involve bending 
moments and moments of inertia. Truss deflection equations are 
based on the stresses and cross-sectional areas of chords and web 
members. Deflections may also be determined graphically. See 
BEAM; TRUSS. [J.B.S.] 


Structural design The selection of materials and mem- 
ber type, size, and configuration to carry loads in a safe and 
serviceable fashion. In general, structural design implies the en- 
gineering of stationary objects such as buildings and bridges, or 
objects that may be mobile but have a rigid shape such as ship 
hulls and aircraft frames. Devices with parts planned to move 
with relation to each other (linkages) are generally assigned to 
the area of mechanical design. 

Structural design involves at least five distinct phases of work: 
project requirements, materials, structural scheme, analysis, and 
design. For unusual structures or materials a sixth phase, test- 
ing, should be included. These phases do not proceed in a rigid 
progression, since different materials can be most effective in dif- 


ferent schemes, testing can result in changes to a design, and a 
final design is often reached by starting with a rough estimated 
design, then looping through several cycles of analysis and re- 
design. Often, several alternative designs will prove quite close in 
cost, strength, and serviceability. The structural engineer, owner, 
or end user would then make a selection based on other consid- 
erations. 

Before starting design, the structural engineer must determine 
the criteria for acceptable performance. The loads or forces to 
be resisted must be provided. For specialized structures this may 
be given directly, as when supporting a known piece of machin- 
ery, or a crane of known capacity. For conventional buildings, 
building codes adopted on a municipal, county, or state level 
provide minimum design requirements for live loads (occupants 
and furnishings, snow on roofs, and so on). The engineer will cal- 
culate dead loads (structure and known, permanent intallations) 
during the design process. For the structure to be serviceable 
or useful, deflections must also be kept within limits, since it is 
possible for safe structures to be uncomfortably “bouncy.” Very 
tight deflection limits are set on supports for machinery, since 
beam sag can cause driveshafts to bend, bearings to burn out, 
parts to misalign, and overhead cranes to stall. Beam stiffness 
also affects floor “bounciness,” which can be annoying if not 
controlled. In addition, lateral deflection, sway, or drift of tall 
buildings is often held within approximately height/500 (1/500 
of the building height) to minimize the likelihood of motion dis- 
comfort in occupants of upper floors on windy days. See LOADS, 
DYNAMIC; LOADS, TRANSVERSE. 

Technological advances have created many novel materials 
such as carbon fiber- and boron fiber-reinforced composites, 
which have excellent strength, stiffness, and strenth-to-weight 
properties. However, because of the high cost and difficult or 
unusual fabrication techniques required, glass-reinforced com- 
posites such as fiberglass are more common, but are limited to 
lightly loaded applications. The main materials used in structural 
design are more prosaic and include steel, aluminum, reinforced 
concrete, wood, and masonry. See COMPOSITE MATERIAL; MA- 
SONRY; PRECAST CONCRETE; PRESTRESSED CONCRETE; REINFORCED 
CONCRETE; STRUCTURAL MATERIALS. 

In an actual structure, various forces are experienced by struc- 
tural members, including tension, compression, flexure (bend- 
ing), shear, and torsion (twist). However, the structural scheme 
selected will influence which of these forces occurs most fre- 
quently, and this will influence the process of material selection. 
See SHEAR; TORSION. 

Analysis of structures is required to ensure stability (static equi- 
librium), find the member forces to be resisted, and determine 
deflections. It requires that member configuration, approximate 
member sizes, and material properties be known or assumed. As- 
pects of analysis include: equilibrium; stress, strain, and elastic 
modulus; linearity; plasticity; and curvature and plane sections. 
Various methods are used to complete the analysis. 

Once a structure has been analyzed (by using geometry alone 
if the analysis is determinate, or geometry plus assumed member 
sizes and materials if indeterminate), final design can proceed. 
Deflections and allowable stresses or ultimate strength must be 
checked against criteria provided either by the owner or by the 
governing building codes. Safety at working loads must be cal- 
culated. Several methods are available, and the choice depends 
on the types of materials that will be used. Once a satisfactory 
scheme has been analyzed and designed to be within project 
criteria, the information must be presented for fabrication and 
construction. This is commonly done through drawings, which 
indicate all basic dimensions, materials, member sizes, the antic- 
ipated loads used in design, and anticipated forces to be carried 
through connections. [R.L.T.; L.M.J.] 


Structural geology The branch of geology that deals 
with study and interpretation of deformation of the Earth’s 
crust. Deformation brings about changes in size (dilation), shape 


(distortion), position (translation), or orientation (rotation). Ev- 
idence for the changes caused by deformation are commonly 
implanted into geologic bodies in the form of recognizable struc- 
tures, such as faults and joints, folds and cleavage, and foliation 
and lineation. The geologic record of structures and structural 
relations is best developed and most complicated in mountain 
belts, the most intensely deformed parts of the Earth’s crust. See 
MOUNTAIN SYSTEMS. 

The discipline of structural geology harnesses three interre- 
lated strategies of analysis: descriptive analysis, kinematic anal- 
ysis, and dynamic analysis. Descriptive analysis is concerned 
with recognizing and describing structures and measuring their 
orientations. Kinematic analysis focuses on interpreting the de- 
formational movements responsible for the structures. Dynamic 
analysis interprets deformational movements in terms of forces, 
stresses, and mechanics. The ultimate goal of these interdepen- 
dent approaches is to interpret the physical evolution of crustal 
structures, that is, tectonic analysis. A major emphasis in modern 
structural geology is strain analysis, the quantitative analysis of 
changes in size and shape of geologic bodies, regardless of scale. 

There are many significant practical applications of structural 
geology. An understanding of the descriptive and geometric 
properties of folds and faults, as well as mechanisms of fold- 
ing and faulting, is of vital interest to exploration geologists in 
the petroleum industry. Ore deposits commonly are structurally 
controlled, or structurally disturbed, and for these reasons de- 
tailed structural geologic mapping is an essential component of 
mining exploration. Other applications of structural geology in- 
clude the evaluation of proposals for the disposal of radioactive 
waste in the subsurface, and the targeting of safe sites for dams, 
hospitals, and the like in regions marked by active faulting. See 
FAULT AND FAULT STRUCTURES; FOLD AND FOLD SYSTEMS; ORE AND 
MINERAL DEPOSITS; PETROLEUM GEOLOGY. [GHD] 


Structural materials Construction materials which, 
because of their ability to withstand external forces, are con- 
sidered in the design of a structural framework. 

Brick is the oldest of all artificial building materials. It is clas- 
sified as face brick, common brick, and glazed brick. Face brick 
is used on the exterior of a wall and varies in color, texture, 
and mechanical perfection. Common brick consists of the kiln 
run of brick and is used behind whatever facing material is em- 
ployed providing necessary wall thickness and additional struc- 
tural strength. Glazed brick is employed largely for interiors 
where beauty, ease of cleaning, and sanitation are primary con- 
siderations. See BRICK. 

Structural clay tiles are burned-clay masonry units having inte- 
rior hollow spaces termed cells. Such tile is widely used because 
of its strength, light weight, and insulating and fire-protection 
qualities. See TILE. 

Architectural terra-cotta is a burned-clay material used for 
decorative purposes. The shapes are molded either by hand in 
plaster-of-paris molds or by machine, using the stiff-mud process. 

Building stones generally used are limestone, sandstone, gran- 
ite, and marble. Until the advent of steel and concrete, stone was 
the most important building material. Its principal use now is asa 
decorative material because of its beauty, dignity, and durability. 
See GRANITE; LIMESTONE; MARBLE; SANDSTONE; STONE AND STONE 
PRODUCTS. 

Concrete is a mixture of cement, mineral aggregate, and water, 
which, if combined in proper proportions, form a plastic mixture 
capable of being placed in forms and of hardening through the 
hydration of the cement. See CONCRETE; PRESTRESSED CONCRETE; 
REINFORCED CONCRETE. 

The cellular structure of wood is largely responsible for its basic 
characteristics, unique among the common structural materials. 
When cut into lumber, a tree provides a wide range of material 
which is classified according to use as yard lumber, factory or 
shop lumber, and structural lumber. Laminated lumber is used 
for beams, columns, arch ribs, chord members, and other struc- 
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tural members. Plywood is generally used as a replacement for 
sheathing, or as form lumber for reinforced concrete structures. 
See LUMBER; PLYwooD; WooD ANATOMY; WOOD PRODUCTS. 
Important structural metals are the structural steels, steel cast- 
ings, aluminum alloys, magnesium alloys, and cast and wrought 
iron. Steel castings are used for rocker bearings under the ends of 
large bridges. Shoes and bearing plates are usually cast in carbon 
steel, but rollers are often cast in stainless steel. Aluminum alloys 
are strong, lightweight, and resistant to corrosion. The alloys 
most frequently used are comparable with the structural steels 
in strength. Magnesium alloys are produced as extruded shapes, 
rolled plate, and forgings. The principal structural applications 
are in aircraft, truck bodies, and portable scaffolding. Gray cast 
iron is used as a structural material for columns and column 
bases, bearing plates, stair treads, and railings. Malleable cast 
iron has few structural applications. Wrought iron is used exten- 
sively because of its ability to resist corrosion. It is used for blast 
plates to protect bridges, for solid decks to support ballasted 
roadways, and for trash racks for dams. See ALUMINUM; CAST 
IRON; MAGNESIUM ALLOYS; STEEL; STRUCTURAL STEEL; WROUGHT 
IRON. [C.M.A.] 
Composite materials are engineered materials that contain a 
load-bearing material housed in a relatively weak protective ma- 
trix. A composite material results when two or more materials, 
each having its own, usually different characteristics, are com- 
bined, producing a material with properties superior to its com- 
ponents. The matrix material (metallic, ceramic, or polymeric) 
bonds together the reinforcing materials (whiskers, laminated 
fibers, or woven fabric) and distributes the loading between 
them. See COMPOSITE MATERIAL; CRYSTAL WHISKERS. 
Fiber-reinforced polymers (FRP) are a broad group of com- 
posite materials made of fibers embedded in a polymeric ma- 
trix. Compared to metals, they generally have relatively high 
strength-to-weight ratios and excellent corrosion resistance. They 
can be formed into virtually any shape and size. Glass is by far 
the most used fiber in FRP (glass-FRP), although carbon fiber 
(carbon-FRP) is finding greater application. Although complete 
FRP shapes and structures are possible, the most promising ap- 
plication of FRP in civil engineering is for repairing structures 
or infrastructure. FRP can be used to repair beams, walls, slabs, 
and columns. See POLYMERIC COMPOSITE. [M.Sc.] 


Structural plate 


Structural petrology The study of the structural as- 
pects of rocks, as distinct from the purely chemical and miner- 
alogical studies that are generally emphasized in other branches 
of petrology. The term was originally used synonymously with 
petrofabric analysis, but is sometimes restricted to denote the 
analysis of only microscopic structural and textural features. See 
PETROFABRIC ANALYSIS; PETROGRAPHY. [J.M.Ch.] 


Structural plate A simple rolled steel section used as 
an isolated structural element, as a support of other structural 
elements, or as part of other structural elements. When isolated 
plates are extremely thick, their design is controlled by shear; 
plates of moderate thickness are controlled by bending (with 
some torsion), and very thin plates carry their loads principally 
by tensile membrane action. Although stresses of all types exist 
in all plates, it is usually sufficient to deal with only the most 
significant. Bending is the most common design criterion. 
Plates are commonly used as cover plates on wide-flange 
beams, as the flanges and webs of plate girders, and as the sides 
of tube-shaped beams and columns. In all these cases, serious 
consideration must be given to the fact that the plate may buckle 
when compressed. Fortunately, the plates have edge supports 
in the direction of the stress, so they function as panels rather 
than as beams. Their ratios of length to width are large enough 
that the resistance to local buckling of the plate element de- 
pends upon its width-thickness ratio, practically independent of 
its length. (The length of the overall section is still significant in 
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determining the member’s capacity.) See BEAM; COLUMN; JOINT 
(STRUCTURES); LOADS, TRANSVERSE; STRUCTURAL STEEL. [G.D.Br.] 


Structural steel Steel used in engineering structures, 
usually manufactured by either the open-hearth or the electric- 
furnace process. The exception is carbon-steel plates and shapes 
whose thickness is 7/16 in. (11 mm) or less and which are used 
in structures subject to static loads only. These products may 
be made from acid-Bessemer steel. The physical properties and 
chemical composition are governed by standard specifications 
of the American Society for Testing and Materials (ASTM). Struc- 
tural steel can be fabricated into numerous shapes for various 
construction purposes. See STEEL. [W.G.B.] 


Structure (engineering) An arrangement of de- 
signed components that provides strength and stiffness to a built 
artifact such as a building, bridge, dam, automobile, airplane, or 
missile. The artifact itself is often referred to as a structure, even 
though its primary function is not to support but, for example, to 
house people, contain water, or transport goods. See AIRPLANE; 
AUTOMOBILE; BRIDGE; BUILDINGS; DAM. 

The primary requirements for structures are safety, strength, 
economy, stiffness, durability, robustness, esthetics, and ductility. 
The safety of the structure is paramount, and it is achieved by 
adhering to rules of design contained in standards and codes, 
as well as in exercising strict quality control over all phases of 
planning, design, and construction. The structure is designed to 
be strong enough to support loads due to its own weight, to 
human activity, and to the environment (such as wind, snow, 
earthquakes, ice, or floods). The ability to support loads during 
its intended lifetime ensures that the rate of failure is insignificant 
for practical purposes. The design should provide an economi- 
cal structure within the constraints of all other requirements. The 
structure is designed to be stiff so that under everyday conditions 
of loading and usage it will not deflect or vibrate to an extent that 
is annoying to the occupants or detrimental to its function. The 
materials and details of construction have durability, such that 
the structure will not corrode, deteriorate, or break under the 
effects of weathering and normal usage during its lifetime. A 
structure should be robust enough to withstand intentional or 
unintentional misuse (for example, fire, gas explosion, or colli- 
sion with a vehicle) without totally collapsing. A structural design 
takes into consideration the community’s esthetic sensibilities. 
Ductility is necessary to absorb the energy imparted to the struc- 
ture from dynamic loads such as earthquakes and blasts. See 
CONSTRUCTION ENGINEERING; ENGINEERING DESIGN. 

Common structural materials are wood, masonry, steel, re- 
inforced concrete, aluminum, and fiber-reinforced composites. 
Structures are classified into the categories of frames, plates, and 
shells, frequently incorporating combinations of these. Frames 
consist of “stick” members arranged to form the skeleton on 
which the remainder of the structure is placed. Plated structures 
include roof and floor slabs, vertical shear walls in a multistory 
building, or girders in a bridge. Shells are often used as water 
or gas containers, in roofs of arenas, or in vehicles that trans- 
port gases and liquids. The connections between the various 
elements of a structure are made by bolting, welding or riveting. 
See COMPOSITE MATERIAL; CONCRETE; STEEL; STRUCTURAL MATERI- 
ALS. [T.V.G.] 


Struthioniformes A small order of weak-flying, par- 
tridgelike birds and giant flightless ratite birds found in the south- 
ern continents. Their relationship to other birds is unknown. The 
struthioniforms are characterized by a palaeognathous palate 
which provides freedom of movement between segments of the 
palate and the jaw. For this reason they are frequently placed ina 
separate superorder, the Palaeognathae. However, that distinc- 
tion places too much emphasis on their separation from other 
birds. Contrary to common opinion, no evidence supports the 


concept that the palaeognathous birds are primitive among living 
birds. 

The order Struthioniformes can be divided into three subor- 
ders, Lithornithi, Tinami, and Struthioni, each of which includes 
both fossil and extant representatives. 

The struthioniforms are medium-sized to giant birds. The os- 
triches are the largest extant birds, but some of the moas, ele- 
phant birds, and dromornithids were even larger. The head is 
small; a medium flattened bill is a common feature except in the 
kiwis, which have a long bill used for probing into the ground for 
worms. The wings are reduced in all forms except the lithionithids 
and tinamous, which can still fly, and the plumage is soft. Their 
legs are strong, and all forms run well. The struthioniforms eat 
a variety of foods, especially large fruits and other large food 
items. Breeding is polygamous, with two or more females laying 
eggs in a single nest. The males are responsible for incubation, 
and they assume the major or sole role in caring for the downy 
young, which leave the nest after hatching. See AVES; RATITES. 

[W.J.B.] 


Strychnine alkaloids Alkaloid substances derived 
from the seeds and bark of plants of the genus Strychnos 
(family Loganiaceae). This genus serves as the source of poi- 
sonous, nitrogen-containing plant materials, such as strychnine 
(see structure; R = H). The seeds of the Asian species of Strych- 
nos contain 2-3% alkaloids, of which about half is strychnine 





and the rest is closely related materials; for example, brucine 
(see structure; R = OCHS) is a more highly oxygenated relative. 
Strychnine and brucine are isolated by extraction of basified 
plant residue with chloroform and then, from the chloroform so- 
lution, by dilute sulfuric acid. Precipitation from the dilute acid 
is accomplished with ammonium hydroxide. Strychnine is sepa- 
rated from brucine by fractional crystallization from ethanol. See 
CRYSTALLIZATION; STRYCHNOS. 

At one time strychnine was used as a tonic and a central ner- 
vous system stimulant, but because of its high toxicity (5 mg/kg 
is a lethal dose in the rat) and the availability of more effective 
substances, it no longer has a place in human medicine. [D.D.] 


Strychnos A genus of tropical trees and shrubs belonging 
to the Logania family (Loganiaceae). Strychnos nux-vomica, a 
native of India and Ceylon, is the source of strychnine. The alka- 
loid, strychnine, has been used medicinally in the treatment of 
certain nervous disorders and paralysis. Curare, used by the In- 
dians to poison arrows, is obtained from S. toxifera and S. castel- 
naei in Guiana and Amazonas and from S. tieute in the Sunda 
Islands. Curare paralyzes the motor nerve endings in striated 
muscles and is used in medical practice in cases in which a state 
of extreme muscular relaxation or even immobility is desirable. 
It has become an important drug in the field of anesthesiology. 
See GENTIANALES; STRYCHNINE ALKALOIDS. [PD.St.; E.L.C.] 


Sturgeon Any of 10 species of large fish which comprise 
the family Acipenseridae in the order Acipenseriformes. These 
fish are found in North Temperate Zone waters, where they are 
almost exclusively bottom-living and feed on organisms such as 
mollusks, worms, and larvae. The body has five rows of bony 
plates, of which one is situated dorsally, two laterally, and two 
ventrally. The snout is elongate and there are four barbels on its 
lower surface; the mouth is ventrally located, and in the adult 





Two species of sturgeons found in United States coastal 
waters. (a) Atlantic sturgeon (Acipenser oxyrhynchus). 
(b) Short-nosed sturgeon (A. brevirostrus). 


the jaws lack teeth (see illustration). The skeleton is mainly car- 
tilaginous. 

Sturgeons are known primarily for the roe, which is processed 
as caviar. A single female can produce millions of eggs, which 
may be removed from dead fish or stripped from living fish. The 
smoked flesh of the sturgeon is a delicacy. See ACIPENSERIFORMES. 

[C.B.C.] 


Stylasterina An order of the class Hydrozoa of the phy- 
lum Coelenterata, including several brightly colored branching 
or encrusting “corals” of warm seas. (True corals belong to a dif- 
ferent class, the Anthozoa). The calcareous skeleton is covered 
by living tissue and is penetrated by ramifying tubes. Nutritive 
polyps, the gastrozooids, lie in cups on one surface or along cer- 
tain edges of the skeletal substance. A spine, or style, at the base 
of each cup gives the order its name. 

Some authorities combine the Stylasterina with the Millepo- 
rina in a single order, the Hydrocorallina. See ANTHOZOA; Hy- 
DROZOA. [S.Cr.] 


Stylolites Irregular surfaces occurring in certain rocks, 
mostly parallel to bedding planes, in which small toothlike pro- 
jections on one side of the surface fit into cavities of like shape 
on the other side (see illustration). Stylolites are most common 

















Stylolite in limestone. 


in limestones and dolomites but are also present in many other 
kinds of rock, including sandstones, gypsum beds, and cherts. 
See DOLOMITE; LIMESTONE; SEDIMENTARY ROCKS. [R.Si.] 


Stylommatophora A superorder of the molluscan sub- 
class Pulmonata containing about 20,500 species that are 
grouped into 56 families. Nearly all land snails without an oper- 
culum are stylommatophorans. They have eyespots on the tips 
of a pair of retractile tentacles, hermaphroditic reproduction with 
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partial fusion of the male and female system, and, normally, a 
second pair of retractile tentacles that function as chemorecep- 
tors. All are air-breathing and most are truly terrestrial. Three 
orders, based on excretory structures, are recognized. The more 
primitive Orthurethra and Mesurethra have less efficient water 
conservation devices than do the more specialized members of 
the order Sigmurethra, in which a closed ureter functions in water 
conservation. The latter specialization apparently is a prerequi- 
site for evolution toward a sluglike structure, since all 16 families 
with slugs or sluglike taxa belong to the Sigmurethra. See PuL- 
MONATA. [G.A.S.] 


Subduction zones 


Styrene A colorless, liquid hydrocarbon with the formula 
CsH5sCH=CH3. It boils at 145.2°C (293.4°F) and freezes at 
—30.6°C (—23.1°F). The ethylenic linkage of styrene readily un- 
dergoes addition reactions and under the influence of light, heat, 
or catalysts undergoes self-addition or polymerization to yield 
polystyrene. 

The majority of the styrene used is converted into polystyrene, 
but other thermoplastic or even thermosetting resins are 
prepared from styrene by copolymerization with suitable 
comonomers. A smaller quantity of styrene goes into the manu- 
facture of elastomers or synthetic rubbers. 

Styrene is a skin irritant. Prolonged breathing of air containing 
more than 400 ppm of styrene vapor may be injurious to health. 
See POLYMERIZATION; POLYSTYRENE RESIN. [C.K.B.] 


Subduction zones Regions where portions of the 
Earth’s tectonic plates are diving beneath other plates, into the 
Earth’s interior. Subduction zones are defined by deep oceanic 
trenches, lines of volcanoes parallel to the trenches, and zones 
of large earthquakes that extend from the trenches landward. 

Plate tectonic theory recognizes that the Earth’ssurface is com- 
posed of a mosaic of interacting lithospheric plates, with the litho- 
sphere consisting of the crust (continental or oceanic) and asso- 
ciated underlying mantle, for a total thickness of about 100 km 
(60 mi). Oceanic lithosphere is created by sea-floor spreading at 
mid-ocean ridges (divergent, or accretionary, plate boundaries) 
and destroyed at subduction zones (at convergent, or destructive, 
plate boundaries). At subduction zones, the oceanic lithosphere 
dives beneath another plate, which may be either oceanic or 
continental. Part of the material on the subducted plate is recy- 
cled back to the surface (by being scraped off the subducting 
plate and accreted to the overriding plate, or by melting and ris- 
ing as magma), and the remainder is mixed back into the Earth’s 
deeper mantle. This process balances the creation of lithosphere 
that occurs at the mid-ocean ridge system. The convergence of 
two plates occurs at rates of 1-10 cm/yr (0.44 in./yr) or 10- 
100 km (6-60 mi) per million years (see illustration). 

During subduction, stress and phase changes in the upper 
part of the cold descending plate produce large earthquakes 
in the upper portion of the plate, in a narrow band called the 
Wadati-Benioff zone that can extend as deep as 700 km (420 mi). 
The plate is heated as it descends, and the resulting release of 
water leads to melting of the overlying mantle. This melt rises 
to produce the linear volcanic chains that are one of the most 
striking features of subduction zones. See LITHOSPHERE. 

Subduction zones can be divided in two ways, based either 
on the nature of the crust in the overriding plate or on the age 
of the subducting plate. The first classification yields two broad 
categories: those beneath an oceanic plate, as in the Mariana or 
Tonga trenches, and those beneath a continental plate, as along 
the west coast of South America (see illustration). The first type is 
known as an intraoceanic convergent margin, and the second is 
known as an Andean-type convergent margin. See MID-OCEANIC 
RIDGE; PLATE TECTONICS. 

Active volcanoes are highly visible features of subduction 
zones. The volcanoes that have developed above subduc- 
tion zones in East Asia, Australasia, and the western Ameri- 
cas surround the Pacific Ocean in a so-called Ring of Fire. At 
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Principal subduction zones. Plate names are in all-capitals. Numbers at ar- 
rows indicate velocity of plate movement in kilometers (miles) per million 
years. (After J. Gill, Orogenic Andesites and Plate Tectonics, Springer- 


Verlag, 1981) 


intraoceanic convergent margins, volcanoes may be the only 
component above sea level, leading to the name “island arc.” 
The more general term “volcanic arc” refers to volcanoes built 
on either oceanic or continental crust. See VOLCANO. 

An eventual consequence of subduction is orogeny, or moun- 
tain building. Subduction zones are constantly building new crust 
by the production of volcanic material or the accretion of oceanic 
sediments. However, the development of the greatest mountain 
ranges—the Alps or the Himalayas—occurs not during “normal” 
subduction but during the death of a subduction zone, when it 
becomes clogged with a large continent or volcanic arc. See 
OROGENY. [R.J.Ste.; S.H.B.] 


Subgiant star An evolving star of luminosity class IV. 
Such a star is brighter than the main-sequence dwarfs and fainter 
than the true giants in its spectral class, lying between the two on 
the Hertzsprung-Russell diagram. The classic subgiants fall in a 
small region from class F to K (with effective temperatures rang- 
ing from 7000 to 4000 K or 12,000 to 7000°F). In class G they 
lie at absolute visual magnitude +3 with luminosities about five 
times the solar luminosity. Classic subgiants have masses around 
1.3 times that of the Sun and violate the mass-luminosity rela- 
tion as too bright for their masses. The concept is extended to 
the hot part of the Hertzsprung-Russell diagram from class F to 
O, in which the distinctions between subgiants and neighboring 
dwarfs and giants are much smaller, only a magnitude or less. 
Main-sequence dwarfs run on the fusion of hydrogen to helium 
in their cores. The subgiant stage begins when the core hydrogen 
mass fraction drops to around 0.1, and then continues as the 
fraction goes to zero and the star evolves toward, but not onto, 
the red-giant branch with a contracting helium core. See DWARF 
STAR; GIANT STAR; HERTZSPRUNG-RUSSELL DIAGRAM; SPECTRAL TYPE; 
STAR; STELLAR EVOLUTION. [J.B.Ka.] 


Sublimation The process by which solids are transformed 
directly to the vapor state without passing through the liquid 
phase. Sublimation is of considerable importance in the purifi- 
cation of certain substances such as iodine, naphthalene, and 
sulfur. 

Sublimation is a universal phenomenon exhibited by all solids 
at temperatures below their triple points. For example, it is a com- 
mon experience to observe the disappearance of snow from the 
ground even though the temperature is below the freezing point 
and liquid water is never present. The rate of disappearance 
is low, of course, because the vapor pressure of ice is low be- 
low its triple point. Sublimation is a scientifically and technically 
useful phenomenon, therefore, only when the vapor pressure of 
the solid phase is high enough for the rate of vaporization to 
be rapid. See PHASE EQUILIBRIUM; TRIPLE POINT; VAPOR PRESSURE. 

[N.H.N.] 


Submarine A ship which can operate completely sub- 
merged in the water. The term formerly applied to any ship 
capable of operating completely underwater, but now usually de- 
scribes a ship built for military purposes. The term “submersible” 
usually is applied to small, underwater vehicles that are built for 
research, rescue, commercial work, or pleasure. 

By the end of World War II, antisubmarine warfare had pro- 
gressed significantly by exploiting the limited underwater en- 
durance and speed of the diesel-electric designs of that era. 
The application of nuclear power to submarines after World 
War II reestablished the near-invulnerability of the submarine to 
antisubmarine warfare from surface ships and aircraft. Nuclear 
power depends on nuclear fission rather than the oxidation of 
fossil fuels and thus requires no oxygen source as do diesel en- 
gines, allowing the submarine to operate submerged for very 
long periods. However, advances in submarine technology and 


nonnuclear propulsion cause the nonnuclear submarine to re- 
main highly attractive to the navies of many nations. See SHIP 
NUCLEAR PROPULSION. 

Submarines can be classified by their primary military mis- 
sions. Attack submarines are fast, long-range ships equipped 
with torpedo tubes or cruise missile launch tubes. They carry 
sensitive underwater sound receivers and transmitters (sonar) 
used to detect enemy submarines. They may be armed with tor- 
pedoes of various kinds, cruise missiles, mines, and equipment 
for deployment of small units of clandestine troops. 

Ballistic-missile submarines carry long-range missiles fitted 
with nuclear warheads that can be launched while submerged. 
The submarine can remain submerged and undetected for many 
days and, on command, launch missiles on any target within 
range. The missiles are stowed in and launched from vertical 
tubes. See MISSILE. 

Experimental submarines are occasionally built to test new 
designs of hull shape, deeper depth capability, power plants, or 
controls. 

Submersibles are usually small, deep-diving vehicles. Their 
use is for exploration and study of the ocean depths, develop- 
ment of equipment, rescue, or commercial work. Some designs 
take advantage of the forces of gravity and buoyancy for ver- 
tical motion. Other designs use vertically oriented propellers to 
propel the craft up and down. Movement is restricted to short 
distances and slow speed because of small size and small battery 
capacity. See UNDERWATER VEHICLE. 

Compared with surface ships, the submarine has features that 
enable it to submerge and resist great sea pressure. Submarines 
have a pressure hull and a nonpressure hull. The pressure hull 
is the watertight, pressure-proof envelope in which equipment 
operates and the officers and crew live. In certain areas of the 
submarine there is a nonpressure hull of lighter structure, forming 
the main ballast tanks. A nonwatertight superstructure provides 
a smooth, fair envelope to cover pipes, valves, and fittings on 
top of the hull. Above the superstructure the fairwater similarly 
encloses the bridge, the periscope, and multiple mast supports. 
See PERISCOPE; SHIP DESIGN. 

The principal means of detecting the presence of a submerged 
submarine is to listen for sounds which may have been gener- 
ated on board or by its movement through the water. Very small 
amounts of acoustic energy can be detected by sophisticated 
sonars. Therefore, modern submarines are designed with multi- 
ple features to greatly reduce the amount of noise they generate. 

[M.S.F; J.H.W.] 


Submarine cable A cable, primarily for communica- 
tions, laid on the ocean floor to provide international links. This 
term is sometimes applied to power cables in water, but these 
are not usually of great length. 

Coaxial cables with 1840 circuits (each having 3-kHz band- 
width) are used extensively for shorter cables. Cables using 
digital transmission over optical fibers (lightguide transmission) 
offer a significant cost-per-channel advantage over coaxial sys- 
tems. Voice signals are transmitted by laser-generated digital light 
pulses over single-mode fiber pairs. See COAXIAL CABLE; COMMU- 
NICATIONS CABLE; MULTIPLEXING AND MULTIPLE ACCESS; TRANSMIS- 
SION LINES. 

SL280 digital lightguide transatlantic submarine cables utilize 
semiconductor laser diodes operating at a wavelength of 1.3- 
micrometers and carries 280 megabits per second on six hair- 
thin fibers, of which two pairs are active and the third is available 
as standby. Each fiber is a glass strand 125 wm in diameter, 
plastic-coated to 250 um, and embedded in an elastomer that is 
part of the unit fiber structure. The fibers are helically wrapped 
around a central kingwire for strength during manufacture and 
an outer sheath of nylon to ensure dimensional stability. This 
structure is then helically wrapped with steel armor wires for 
strength, ensheathed in a copper conductor, and coated with a 
final protective layer of polyethylene insulation (see illustration). 


Submarine canyon 2139 


~ steel king wire 










8 ultrahigh-strength 


2 Pa steel wires 


16 ultrahigh-strength 
J < steel wires 


copper sheath 
polyethylene insulation 


polyethylene insulation 


tar-covered galvanized 


~~ steel wires 


tar-soaked 
nylon yarn 


Ds tar-soaked 


nylon yarn 


Cutaway view of double-armored SL cable. 


The fiber-optic digital repeaters regenerate the signal in its 
original form, unlike the earlier analog repeaters which were 
signal amplifiers. The repeaters detect the optical signal entering 
the repeater, transform it into an electrical signal and amplify it, 
regenerate it by putting it through a high-speed clocked decision 
circuit, and then convert the regenerated electrical signal into 
an optical one by using a 1.3-um laser transmitter. Transmission 
rates of 560 megabits per second are provided in the subsequent 
SL560 design, utilizing lasers operating at 1.5 wm. 

Virtually all analog (coaxial) submarine cable systems were 
configured as a single line from one landing point to another. 
Traffic requirements, however, are not identical for all countries, 
and it was realized to be advantageous to implement a tree con- 
figuration, where the various branches go to different countries. 
Digital fiber-optic technology makes branching possible by ded- 
icating certain fiber pairs to certain subsystems. 

With the increased volume and capacity of fiber-optic sys- 
tems, customers and system owners cannot afford or support 
the potential loss of business and revenue associated with a ca- 
ble failure. To reduce this risk, many systems employ a ring- 
network design architecture. These systems use a self-healing 
loop network with built-in backup and redundancy, which pro- 
vide fiber-on-fiber restoration and connectivity. In the unlikely 
event of a cable failure of any nature, communications traffic 
can be shifted from one fiber cable to another so that digital 
voice, data, or video communications can continue without dis- 
ruption. See LASER; OPTICAL COMMUNICATIONS; OPTICAL FIBERS; 
PULSE MODULATION. [GJS] 


Submarine canyon A steep-sided valley developed on 
the sea floor of the continental slope. Submarine canyons serve 
as major conduits for sediment transport from land and the con- 
tinental shelf to the deep sea. Modern canyons are relatively 
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narrow, deeply incised, steeply walled, often sinuous valleys with 
predominantly V-shaped cross sections. Most canyons originate 
near the continental-shelf break, and generally extend to the base 
of the continental slope. Canyons often occur off the mouths of 
large rivers, such as the Hudson River or the Mississippi River, 
although many other canyons, such as the Bering Canyon in the 
southern Bering Sea, are developed along structural trends. See 
CONTINENTAL MARGIN. 

Modern submarine canyons vary considerably in their dimen- 
sions. The average lengths of canyons has been estimated to 
be about 34 mi (55 km); although the Bering Canyon is more 
than 680 mi (1100 km) long and is the world’s longest subma- 
rine canyon. The shortest canyons are those off the Hawaiian 
Islands, and average about 6 mi (10 km) in length. Submarine 
canyons are characterized by relatively steep gradients. The av- 
erage slope of canyon floors is 309 ft/mi (58 m/km). In general, 
shorter canyons tend to have higher gradients. For example, 
shorter canyons of the Hawaiian group have an average gradi- 
ent of 766 ft/mi (144 m/km), whereas the Bering Canyon has a 
slope of only 42 ft/mi (7.9 m/km). 

In comparison to modern canyons, dimensions of ancient 
canyons are considerably smaller. Deposits of ancient canyons 
are good hydrocarbon reservoirs. This is because submarine 
canyons and channels are often filled with sand that has the 
potential to hold oil and gas. Examples of hydrocarbon-bearing 
canyon-channel reservoirs are present in the subsurface in Cali- 
fornia, Louisiana, and Texas. 

Physical and biological processes that are common in subma- 
rine canyons are mass wasting, turbidity currents, bottom cur- 
rents, and bioerosion, mass wasting and turbidity currents being 
the more important. Mass wasting is a general term used for the 
failure, dislodgement, and downslope movement of sediment 
under the influence of gravity. Common examples of mass wast- 
ing are slides, slumps, and debris flows. Major slumping events 
can lead to formation of submarine canyons. The Mississippi 
Canyon in the Gulf of Mexico is believed to have been formed 
by retrogressive slumping during the late Pleistocene fall in sea 
level and has been partially infilled during the Holocene rise in 
sea level. Turbidity currents are one of the most important ero- 
sional and depositional processes in submarine canyons. There 
is considerable evidence to suggest that turbidity currents flow at 
velocities of 11-110 in./s (28-280 cm/s) in submarine canyons. 
Therefore, turbidity currents play a major role in the erosion of 
canyons. See DEPOSITIONAL SYSTEMS AND ENVIRONMENTS; MARINE 
GEOLOGY; MARINE SEDIMENTS; MASS WASTING; TURBIDITY CURRENT. 

[G.Sh.] 


Submillimeter astronomy 


Submillimeter astronomy Investigation of the uni- 
verse by probing the electromagnetic spectrum at wavelengths 
from approximately 0.3 to 1.0 millimeter: the submillimeter 
waveband. This waveband is bounded at longer wavelengths by 
the millimeter waveband (1-10 mm), and at shorter wavelengths 
by the far-infrared waveband (20-300 micrometers). Astronom- 
ical objects with temperatures between about 10 kelvins and 
several hundred kelvins, typically in the interstellar medium of 
galaxies, emit radiation strongly in the submillimeter waveband. 
See ELECTROMAGNETIC RADIATION. 

The submillimeter waveband is one of the last parts of the elec- 
tromagnetic spectrum to be investigated. This is due both to the 
technical challenge of making sensitive detectors, which must be 
held at temperatures close to absolute zero, and to the effects of 
the Earth’s atmosphere. Atmospheric molecules, primarily water 
vapor, both absorb (dim) the signal from astronomical sources 
and emit their own radiation that acts to mask the astronomical 
signals. The effects are most severe at the shortest wavelengths, 
and only high mountain sites and air or space-borne platforms 
can be used for submillimeter astronomy. From high mountains, 
observations are possible in only three atmospheric windows, at 
wavelengths of about 0.35, 0.45, and 0.85 mm. Radiation of 
0.85 mm penetrates down to about 2 km (1.2 mi) above sea 





James Clerk Maxwell Telescope, 49-ft-diameter (15-m) sub- 
millimeter telescope situated at Mauna Kea, Hawaii. 


level; while on the driest nights, only about one-half of photons 
with wavelengths of 0.35 or 0.45 ym that enter the upper at- 
mosphere can be detected at the 4200-m (14,000-ft) summit of 
Mauna Kea, Hawaii. 

Several submillimeter telescopes operate on high mountain 
tops, including the 15-m (49-ft) James Clerk Maxwell Telescope 
(JCMT; see illustration) and the 10.4-m (34-ft) Caltech Submil- 
limeter Observatory (CSO) telescope, both on Mauna Kea, and 
the 15-m (49-ft) Swedish-ESO Submillimeter Telescope (SEST) 
in Chile. These telescopes look like satellite antennas, but their 
surfaces are smooth and precisely shaped to an accuracy of or- 
der 10 xm. These ground-based telescopes are subject to more 
atmospheric absorption than airborne and space telescopes; but 
because they have larger apertures, they can collect more radi- 
ation, boosting their sensitivity. See TELESCOPE. 

In 2005 the Stratospheric Observatory for Infrared Astronomy 
(SOFIA), a 2.5-m-aperture (8.2-ft) telescope in a 747 aircraft, 
was expected to be operating. Like the former Kuiper Airborne 
Observatory (KAO), SOFIA will fly at up to 14 km (9 mi) altitude. 
Stratospheric balloons, flying at altitudes of about 40 km (25 mi), 
can also be used to carry submillimeter telescopes; an example is 
the BOOMERANG cosmic microwave background experiment 
reported in 2000. See INFRARED ASTRONOMY. 

The signals from several submillimeter telescopes can be com- 
bined together in an interferometer to provide a telescope with 
resolution as fine as one with an aperture as large as the greatest 
separation between the individual telescopes. Examples of this 
type of telescope are the Smithsonian Millimeter Array (SMA) 
on Mauna Kea, and the United States-Europe-Japan Atacama 
Large Millimeter Array (ALMA) under development in Chile. See 
RADIO TELESCOPE. 

Submillimeter observations are used by astronomers with a 
wide range of interests to study matter in the universe at tempera- 
tures of about 10-100 K and the cosmic microwave background 
radiation. Interstellar chemistry in clouds of gas and dust, the 
formation of protostars embedded in stellar nurseries within the 
Milky Way, the properties of the interstellar medium in even 
the most distant galaxies, and the evolution of galaxies are all 
studied in the submillimeter waveband. Submillimeter observa- 
tions are especially important for observations of regions that are 
rich in dust and gas. These are opaque to visible and infrared 
light, but submillimeter light can still escape, allowing these re- 
gions to be studied. [ABL] 


Submillimeter-wave technology The generation, 
detection, and application of radiation in the submillimeter-wave 
region of the electromagnetic spectrum. In practical terms, this 
region corresponds to wavelengths between about 30 microm- 
eters and 3 millimeters (see illustration). At its extremes the re- 
gion overlaps with the microwave and infrared, and a unique 
feature is that the distinct propagation methods of these adja- 
cent regions, guided-wave and free-space propagation, are used 
throughout it. See ELECTROMAGNETIC RADIATION; INFRARED RADIA- 
TION; MICROWAVE. 
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Electromagnetic spectrum in and near the submillimeter- 
wave region, showing its frequency, wavelength, wave num- 
ber, and photon energy equivalents. 


Widespread use of the submillimeter-wave region did not oc- 
cur until two revolutionary developments in the late 1950s and 
early 1960s: (1) the practical realization of Fourier transform 
spectroscopy, which led to significant improvements in the qual- 
ity of broad band submillimeter-wave spectroscopy; and (2) the 
discovery of two reasonably intense submillimeter-wave laser 
sources, the hydrogen cyanide and the water-vapor electrically 
excited lasers. See INFRARED SPECTROSCOPY; LASER. 

Radiation sources. The main application of incoherent ra- 
diation sources is as broad band sources in Fourier transform 
spectrometers. The two most commonly used sources are the 
mercury-vapor lamp and the globar. See MERCURY-VAPOR LAMP; 
MICROWAVE TUBE. 

There are several categories of coherent submillimeter-wave 
radiation sources: (1) nonrelativistic electron-beam tubes such 
as klystrons, extended interaction oscillators, orotrons, leda- 
trons, and carcinotrons or backward-wave oscillators, which os- 
cillate with varying amounts of tunability and output power, 
(2) relativistic electron-beam devices such as gyrotrons, free- 
electron lasers, and synchrotrons; and (3) solid-state devices 
such as Gunn and IMPATT oscillators. With the exception of the 
free-electron laser, these are long-wave devices. The final cat- 
egory, gas lasers, covers the entire submillimeter-wave region. 
See GYROTRON; KLYSTRON; MICROWAVE SOLID-STATE DEVICES; SYN- 
CHROTRON RADIATION. 

Detectors. Most submillimeter-wave detectors are either 
thermal, rectifying, or photon detectors. The first respond to 
heat generated by absorbed radiation, the second by rectifi- 
cation of radiation-field-induced currents, the third by absorp- 
tion of individual photons. They are commonly used in direct 
and heterodyne detection systems. Thermal detectors include 
room-temperature thermopile, Golay cell, and pyroelectric de- 
tectors, while more sensitive semiconductor bolometers operate 
at liquid-helium temperatures. Rectifying devices include metal- 
semiconductor and metal-metal point contacts and Schottky bar- 
rier diodes, all room-temperature devices, and Josephson-type 
devices operated at about 4.2 K (—452°F). Submillimeter-wave 
photon detectors are based on semiconductors and rely on the 
absorption of a photon changing the free-carrier density or mo- 
bility and hence the conductivity of the detector. See BOLOMETER; 
PYROELECTRICITY; RADIOMETRY; SUPERCONDUCTING DEVICES. 

Atmospheric transparency. Between the visible and mi- 
crowave regions the atmosphere is effectively opaque, with the 
exception of two windows in the near-infrared and three in the 
submillimeter-wave region, at 94, 140, and 220 GHz. The sig- 
nificant feature of the submillimeter-wave windows is that the 
levels of attenuation involved are not as seriously increased by 
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adverse atmospheric conditions as those of the near-infrared. 
This makes them attractive for many applications. The attenu- 
ation associated with these windows is, nonetheless, quite high 
and means that ground-level low-frequency submillimeter-wave 
systems have a relatively limited range. This can be advanta- 
geous for some applications. 

Applications. In the past, applications of radiation in the 
submillimeter-wave region were distributed evenly over the 
region and largely concerned research activities. Now, how- 
ever, as a result of developments toward integrated, inexpen- 
sive transmitter and receiver systems, applications include many 
radar- and telecommunications-related activities centered on the 
94-GHz atmospheric window. These applications are mainly mil- 
itary, but spin-offs into the civil area are expected. 

Photon energies in the submillimeter-wave region correspond 
to activation energies of many physical, chemical, and bioloical 
phenomena, and the spectroscopic techniques of this region 
have been widely applied to their study. Other spectroscopic 
applications have concerned the development of radiometric 
methods to meet the requirements of fusion plasma diagnostics 
and astronomy. See SUBMILLIMETER ASTRONOMY. 

The submillimeter-wave region has been extensively used for 
fundamental frequency metrology, relating the frequencies of 
submillimeter-wave gas lasers directly back to the primary fre- 
quency standard. Important consequences have been the defi- 
nition of the speed of light as a fixed quantity and the redefinition 
of the meter as a unit derived from it. See LIGHT; PHYSICAL MEA- 
SUREMENT. 

The submillimeter-wave region impinges on fusion plasma re- 
search in two ways. First, in machines such as tokamaks it pro- 
vides a way of studying the spatial and temporal evolution of 
the electron temperature. The second application is electron cy- 
clotron heating, in which electron heating by microwave radia- 
tion at the electron cyclotron frequency has been demonstrated. 
High-power gyrotron oscillators are generally used as the radi- 
ation sources. See NUCLEAR FUSION; PLASMA (PHYSICS); PLASMA 
DIAGNOSTICS. 

In the study and applications of fast electromagnetic 
pulse propagation, a number of novel techniques for the 
submillimeter-wave region have been developed. The gener- 
ation and detection of such pulses can be done with ultra- 
short optical pulses driving photoconductive switches, or by all- 
electronic systems based on nonlinear transmission lines. The 
submillimeter-wave radiation used in these systems is variously 
known as T waves or terahertz waves. See OPTICAL PULSES. 

[J.R.Bi.] 


Subsonic flight Movement of a vehicle through the at- 
mosphere at a speed appreciably below that of sound waves. 
Subsonic flight extends from zero (hovering) to a speed approx- 
imately 85% of sonic speed corresponding to the ambient tem- 
perature. At higher vehicle velocities the local velocity of air pass- 
ing over the vehicle surface may exceed sonic speed, and true 
subsonic flight no longer exists. 

Vehicle type may range from a small helicopter, which oper- 
ates at all times in the lower range of the velocity scale, to an 
intercontinental ballistic missile, which is operative throughout 
this and other velocity regimes, but is in subsonic flight for only 
a few seconds. The design of each is affected by the same prin- 
ciples of subsonic aerodynamics. Subsonic flow of a fluid such 
as air may be subdivided into a range of velocities in which the 
flow may be considered incompressible without appreciable er- 
ror (below a velocity of approximately 300 mi/h or 135 m/s), 
and a higher range in which the compressible nature of the fluid 
becomes significant. In both cases the viscosity of the fluid is im- 
portant. The theories which apply to compressible, inviscid fluids 
may be used almost without modification in some low-subsonic 
problems, and in other cases the results offered by these theo- 
ries may be modified to account for the effects of viscosity and 


2142 


compressibility. See COMPRESSIBLE FLOW; TRANSONIC FLIGHT; VIS- 
COSITY. 

A typical subsonic wing cross section (airfoil) has a rounded 
front portion (leading edge) and a sharp rear portion (trailing 
edge). Air approaching the leading edge comes to rest at some 
point on the leading edge, with flow above this point proceeding 
around the upper airfoil surface to the trailing edge, and flow be- 
low passing along the lower surface to the same point, where the 
flow again theoretically has zero velocity. The two points of zero 
local velocity are known as stagnation points. If the path from 
front to rear stagnation point is longer along the upper surface 
than along the lower surface, the mean velocity of flow along the 
upper surface must be greater than that along the lower surface. 
Thus, in accordance with the principle of conservation of energy, 
the mean static pressure must be less on the upper surface than 
on the lower surface. This pressure difference, applied to the sur- 
face area with proper regard to force direction, gives a net lifting 
force. Lift is defined as a force perpendicular to the direction 
of fluid flow relative to the body, or more clearly, perpendicu- 
lar to the free-stream velocity vector. See AIRFOIL; BERNOULLI S 
THEOREM. 

The wing, as the lifting device, whether fixed, as in the air- 
plane, or rotating, as in the helicopter, is probably the most im- 
portant aerodynamic part of an aircraft. However, stability and 
control characteristics of the subsonic airplane depend on the 
complete structure. Control is the ability of the airplane to rotate 
about any of the three mutually perpendicular axes meeting at its 
center of gravity. Static stability is the tendency of the airplane to 
return to its original flight attitude when disturbed by a moment 
about any of the axes. [J.E.Ma.] 


Substitution reaction 


Substitution reaction One of a class of chemical re- 
actions in which one atom or group (of atoms) replaces another 
atom or group in the structure of a molecule or ion. Usually, the 
new group takes the same structural position that was occupied 
by the group replaced. 

Substitution reactions involve the attack of a reagent, which 
is the source of the new atom or group, on the substrate, the 
molecule or ion in which the replacement occurs. They involve 
the formation of a new bond and the breaking of an old bond. 
Substitution reactions are classified according to the nature of 
the reagent (electrophilic, nucleophilic, or radical) and according 
to the nature of the site of substitution (saturated carbon atom 
or aromatic carbon atom). See ELECTROPHLIC AND NUCLEOPHILIC 
REAGENTS. 

Systematic names for substitution reactions are composed of 
the parts: name of group introduced + de + name of group 
replaced + ation, with suitable elision or change of vowels for 
euphony. Thus, the replacement of bromine by a methoxy group 
is called methoxydebromination. See ORGANIC REACTION MECHA- 
NISM. [J.FB.] 


Subsynchronous resonance The resonance be- 
tween a series-capacitor-compensated electric system and the 
mechanical spring-mass system of a turbine-generator at sub- 
synchronous frequencies. Beginning about 1950, series capaci- 
tors were installed in long alternating-current transmission lines 
[250 km (150 mi) or more] to cancel part of the inherent in- 
ductive reactance of the line. Until 1971, up to 70% of the 
60-Hz inductive reactance was canceled by series capacitors 
in some long lines with little concern for side effects. (If 70% 
of a line’s inductive reactance is canceled, the line is said to 
have 70% series compensation.) In 1970, and again in 1971, 
a turbine-generator at the Mohave Power Plant in southern 
Nevada experienced shaft damage that required several months 
of repairs on each occasion, following switching events that 
placed the turbine-generator radial on a series-compensated 
transmission line. The shaft damage was due to torsional os- 
cillations between the two ends of the generator-exciter shaft. 
Shortly after the second event, it was determined that the tor- 


sional oscillations were caused by torsional interaction, which 
is a type of subsynchronous resonance. There have been no 
reported occurrences of two other types of subsynchronous res- 
onance, the induction generator effect and torque amplification. 
See ALTERNATING-CURRENT CIRCUIT THEORY; ALTERNATING-CURRENT 
GENERATOR; TRANSMISSION LINES; TURBINE. 

It has been clearly established that subsynchronous resonance 
can be controlled with the use of countermeasures, thus making 
it possible to benefit from the distinct advantages of series ca- 
pacitors. About a dozen countermeasures have been successfully 
applied such that there has been no reported subsynchronous 
resonance event since 1971. Subsynchronous resonance coun- 
termeasures protect turbine-generator shafts from harmful tor- 
sional oscillations by one of two methods. First, the turbine- 
generator can be tripped when a subsynchronous resonance 
condition is detected. This limits the number of torsional os- 
cillations experienced by the turbine-generator shafts. This type 
of countermeasure is relatively inexpensive but is not acceptable 
if the anticipated subsynchronous resonance conditions are ex- 
pected to occur frequently. Generally, such a countermeasure will 
not be applied as the sole subsynchronous resonance protection 
if it is expected to cause a turbine-generator to be tripped more 
than once every 10 years. Three types of tripping countermea- 
sures are applied: torsional motion relay, armature current relay, 
and the generator tripping logic scheme. The second method 
of protection does not involve turbine-generator tripping, but 
eliminates or limits harmful torsional oscillations. See ELECTRIC 
POWER SYSTEMS. [R.G.F] 


Subtraction One of the four fundamental operations of 
arithmetic and algebra. Subtraction is often regarded as an op- 
eration inverse to addition, that is, if a and b are numbers, the 
number a — b is defined as that number which added to b gives a. 
The more modern viewpoint eliminates subtraction completely 
by considering the number a — b as the sum of a and that num- 
ber (denoted by —b) which added to b gives 0. The number 
symbolized by —b is called the inverse of b (with respect to ad- 
dition). Every real number has a unique inverse (the number 0 
is its own inverse). In this sense “subtraction” may be performed 
on objects of many different kinds, and the original numerical 
operation greatly extended. See ADDITION; ALGEBRA; DIVISION; 
MULTIPLICATION; NUMBER THEORY. [L.M.BL] 


Subway engineering The branch of transportation en- 
gineering that deals with feasibility study, planning, design, con- 
struction, and operation of subway (underground railway) sys- 
tems. In addition to providing rapid and comfortable service, 
subways consume less energy per passenger carried in compari- 
son with other modes of transportation such as automobiles and 
buses. They have been adopted in many cities as a primary mode 
of transportation to reduce traffic congestion and air pollution. 

Subways are designed for short trips with frequent stops, com- 
pared to above-ground, intercity railways. Many factors consid- 
ered in the planning process of subway systems are quite similar 
to those for railway systems. Subway system planning starts with 
a corridor study, which includes a forecast of ridership and rev- 
enues, an estimation of construction and operational costs, and 
a projection of the potential benefits from land development. 
See RAILROAD ENGINEERING; TRANSPORTATION ENGINEERING. 

All subway systems have three major types of structures: sta- 
tions, tunnels, and depots. The most important task in planning 
a new subway system or a new subway line is to locate sta- 
tions and depots and to determine the track alignment. Subway 
lines are normally located within the right-of-way of public roads 
and as far away as possible from private properties and sites 
of importance. Because stations and entrances are usually lo- 
cated in densely populated areas, land acquisition is often a ma- 
jor problem. One solution is to integrate entrances into nearby 
developments such as parks, department stores, and public 


buildings, which lessens the visual impact of the entrances and 
reduces their impediment to pedestrian flow. 

Design of the permanent works includes structural and ar- 
chitectural elements and electrical and mechanical facilities. 
There are two types of structures: stations and tunnels. For sta- 
tions, space optimization and passenger flow are important. The 
major elements in a typical station are rails, platform, staircases, 
and escalators. For handicapped passengers, provisions should 
be made for the movement of wheelchairs in elevators and at 
fare gates, and special tiles should be available to guide the blind 
to platforms. 

In both stations and tunnels, ventilation is essential for the 
comfort of the passengers and for removing smoke during a fire. 
Sufficient staircases are required for passengers to escape from 
the station platform to a point of safety in case of a fire. The elec- 
trical and mechanical facilities include the rolling stock, signal- 
ing, communication, power supply, automated fare collection, 
and environmental control (air-conditioning) systems. Corrosion 
has caused problems to structures in some subways; therefore, 
corrosion-resistant coatings may be required. To minimize noise 
and vibration from running trains, floating slabs can be used 
under rails or building foundations in sections of routes cross- 
ing densely populated areas and in commercial districts where 
vibration and secondary airborne noise inside buildings are un- 
acceptable. See VENTILATION. 

Underground stations are normally constructed by using an 
open-cut method. For open cuts in soft ground, the sides of the 
pits are normally retained by wall members and braced using 
struts. The pits are fitted with decks for maintaining traffic at the 
surface. For new lines that pass under existing lines, it is not 
possible to have open cuts. In such cases, stations have to be 
constructed using mining methods (underground excavation). 
See TUNNEL. 

Many modern subway systems are fully automated and re- 
quire only a minimal staff. Train movements are monitored and 
regulated by computers in a control center. Therefore, engineer- 
ing is limited to the function and maintenance of the electrical 
and mechanical facilities. The electrical and mechanical devices 
requiring constant care include the rolling stock, signaling, com- 
munication and broadcasting systems, power supply, elevators 
and escalators, automated fare collection, and environmental 
control systems. Also included are depot facilities, and station 
and tunnel service facilities. See ELECTRIC DISTRIBUTION SYSTEMS; 
RAILROAD CONTROL SYSTEMS. [Z.C.M.; R.N.Hw.] 


Sucrose An oligosaccharide, a-D-glucopyranosyl-f-p- 
fructofuranoside, also known as saccharose, cane sugar, or beet 
sugar. The structure is shown below. Sucrose is very soluble in 


CH20H 
H |o H HOCH, 0. H 
OH H> H HO 
HO 2 CH20H 
H OH OH H 


water and crystallizes from the medium in the anhydrous form. 
The sugar occurs universally throughout the plant kingdom 
in fruits, seeds, flowers, and roots of plants. Honey consists 
principally of sucrose and its hydrolysis products. Sugarcane 
and sugarbeets are the chief sources for the preparation of 
sucrose on a large scale. Another source of commercial interest 
is the sap of maple trees. See OLIGOSACCHARIDE; SUGARBEET; 
SUGARCANE. [W.Z.H.] 


Suctoria A small specialized subclass of the protozoan class 
Ciliatea whose members were long considered entirely sepa- 
rate from the “true” ciliates. The sole order of this subclass is 
Suctorida. These forms show a number of highly specialized 
features. Most conspicuous are their tentacles, often numerous, 
which serve as mouths. These multiple organelles of ingestion 


Sudden infant death syndrome (SIDS) 2143 





Endogenous budding in the suctorian Podophrya, a species 
which measure 10-28 micrometers. 


fasten to the pellicle of prey organisms, generally passing ciliates. 
By forces not entirely understood, the tentacles are used to suck 
out the prey’s protoplasm to provide sustenance for the suc- 
torian. Nearly all species are stalked, and the sedentary, mature 
forms are devoid of any external ciliature. Young larval forms are 
produced by both endogenous and exogenous budding. These 
forms bear locomotor cilia and serve, as in the case of species of 
the Peritrichia, for dissemination (see illustration). See CILIATEA; 
PERITRICHIA. [J.0.C.] 


Sudangrass _ An annual, warm-season grass of tropical ori- 
gin said to have been grown in Egypt since early times, though 
its value was first recognized in Sudan only in 1909. In that same 
year it was introduced into the United States as a replacement 
for johnsongrass, which had become a noxious weed in many 
southern states. Sudangrass (Sorghum bicolor var. sudanense, 
also called S. sudanense and S. vulgare var. sudanense) and 
its hybrids are commonly used as pasture, greenchop, silage, or 
hay. They fill an important need in many regions of the United 
States, because they produce high-quality forage for cattle and 
sheep during the summer, when other pasture is in short sup- 
ply or of low quality. Many of the varieties and hybrids produce 
forage until frost. See GRASS. [M.R.G.] 


Sudden infant death syndrome (SIDS) The 
sudden and unexpected death of an apparently normal infant 
that remains unexplained after an adequate autopsy. Of a group 
of apparently healthy infants dying suddenly and unexpectedly, 
15% will usually manifest pathologic evidence of a disease pro- 
cess which is sufficient to explain the death. The remaining 85% 
are unexplained and are classified as SIDS. In spite of the prob- 
able heterogeneity of diseases in SIDS cases, the consistent and 
distinctive characteristics of these infant deaths support the no- 
tion that many, if not the majority, represent a single disease 
process. 

The incidence of SIDS in the United States is about 2.0 cases 
per 1000 live births, which makes SIDS the leading cause of 
death between the ages of 1 month and 1 year. Most SIDS deaths 
occur at 2-4 months of age, and about 90% occur by 6 months. 
SIDS is more common in males, prematurely born infants, mul- 
tiple births, and the economically disadvantaged. SIDS is also 
increased in infants of teen-age or smoking mothers and in in- 
fants who have a history of a severe apparent life-threatening 
event, usually accompanied by marked cyanosis or pallor and 
limpness, and absence of breathing. SIDS also occurs more fre- 
quently during winter months. The rate among Native Americans 
is greater than among Blacks, which is greater than among Cau- 
casians; Asians have the lowest rate. While there is slight familial 
clustering of SIDS, there is probably not a genetic predisposition 
to SIDS. 
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The cause of SIDS is unknown; leading hypotheses include 
respiratory, cardiac, and metabolic mechanisms. Much attention 
has been focused on the “apnea hypothesis,” implicating a pri- 
mary respiratory arrest due to chronic or transient insufficiency 
or irregularity of breathing. An imbalance between sympathetic 
and parasympathetic influences on cardiac activity, leading to 
potentially fatal cardiac arrhythmias, is a popular cardiac hy- 
pothesis. 

While there is still no proof that SIDS can be prevented, elec- 
tronic cardiorespiratory monitors have been prescribed for many 
infants in high-risk categories for SIDS. Home monitors are rec- 
ommended only for infants at very high risk for SIDS. See Con- 
GENITAL ANOMALIES; HUMAN GENETICS. [A.St.; K.Wi.; J.G.Br.] 


Sugar Usually, sucrose, the common sugar of commerce. 
This sugar is a disaccharide, C}2H220 4, which is split, as shown 
in the reaction below, by hydrolysis into two monosaccharides, 


Cy2H2201, + H20 — C6H1206 + C6H1206 
Sucrose Water Glucose Fructose 


or simple sugars: glucose (dextrose) and fructose (levulose). Su- 
crose rotates the plane of polarized light to the right, as does 
glucose, but fructose is so strongly levorotatory that it overcomes 
the effect of glucose. Thus mixtures of equal amounts of glucose 
and fructose are levorotatory. The hydrolytic reaction is called 
inversion of sugar, and the product is invert sugar or, simply, 
invert. See CARBOHYDRATE; OPTICAL ACTIVITY. 

Sucrose is widely distributed in nature, having been found in 
all green plants that have been carefully examined for its pres- 
ence. The total quantity of all sugars formed each year has been 
estimated at a colossal 4 x 101! tons (3.6 x 10!! metric tons). 
In spite of its availability in all green plants, sucrose is obtained 
commercially in substantial amounts from only two plants: sug- 
arcane, which supplies about 56% of the world total, and the 
sugarbeet, which provides 44%. 

Cane sugar manufacture. The manufacture of cane sugar 
is usually done in two series of operations. First, raw sugar of 
about 98% purity is produced at a location adjacent to the cane 
fields. The raw sugar is then shipped to refineries, where a purity 
that is close to 100% is achieved. 

Raw cane sugar. The production of raw cane sugar begins 
with growing the cane in tropical or subtropical areas. The cane 
is harvested after a season varying from 7 months in subtropical 
areas to 12-22 months in the tropics. The cane stalks are har- 
vested either mechanically or by heavy hand knives. The trend 
is toward mechanization. The stalks are transported to a mill by 
oxcart, rail, or truck. See SUGARCANE. 

At the mill the stalks are crushed and macerated between 
heavy grooved iron rollers while being sprayed countercurrently 
with water to dilute the residual juice. The expressed juice con- 
tains 95% or more of the sucrose present. The fibrous residue, or 
bagasse, is usually burned under the boilers, although increas- 
ing amounts are being made into paper, insulating board, and 
hardboard as well as furfural, which is a chemical intermediate 
for the synthesis of furan and tetrahydrofuran. 

Cane juice. The cane juice is treated with lime to bring its pH 
to about 8.2. This pH prevents the inversion reaction, which is 
favored by heat and acid and would lower the yield of crystalliz- 
able sugar. The juice is then heated to facilitate the precipitation 
of impurities, which are removed by continuous filtration. 

The purified juice is concentrated by multiple-stage vacuum 
evaporation (usually four or five stages) and, when sufficiently 
concentrated, is boiled to grain or seeded with sucrose crystals 
in a single-stage vacuum pan. Usually three successive crops of 
crystals are grown, cooled, and centrifuged. The final mother 
liquor, which is resistant to further crystallization, is called black- 
strap molasses. It is used principally as a feed for cattle. Relatively 
small amounts are still fermented to produce industrial alcohol 
and rum. 


Raw cane sugar refining. The refining of raw sugar begins 
with dissolution of the molasses which remains in a thin film on 
the crystals in spite of the centrifugation. This step, called affina- 
tion, brings the purity from about 98 to about 99%. The crystals 
are dissolved in hot water and percolated through bone char 
columns to remove color by adsorption. The sucrose is finally 
concentrated by vacuum evaporation, crystallized by seeding, 
centrifuged, and dried. 

A major step forward has been the use of bone char in a contin- 
uous countercurrent manner (Grosvenor patent). The bone char 
is washed, dried, and burned to remove impurities; it is reused 
until it wears out mechanically and is discarded as fines. Even 
the fines have value as fertilizer because of their high calcium 
phosphate content. 

Bone char has been replaced in some refineries by granular 
carbon derived from coal or wood waste. It is used in columns 
and regenerated by a process similar to the one used for bone 
char, which unlike granular carbon is primarily calcium phos- 
phate. Granular carbon is frequently used in combination with 
decolorizing resins to reduce the color of process liquors coming 
from the carbon columns. Pulverized activated vegetable car- 
bon is used in a few instances to remove color from raw sugar 
liquors, but its use is declining. Ion-exchange resins are used to 
remove ash from raw sugar liquors, especially when the liquor is 
to be converted directly into refined liquid sugar without going 
through the crystallization step first. See ION EXCHANGE. 

Beet sugar. In the United States, sugarbeets are grown under 
contract by farmers from seed supplied by a beet sugar company. 
Because sugarbeets, like the other temperate-zone crops, thrive 
best under crop rotation, they are not well adapted to one-crop 
agriculture. See SUGARBEET. 

Beet processing. When beets are delivered to a factory, they 
are washed and sliced, and the slices are extracted countercur- 
rently with hot water to remove the sucrose. The resulting so- 
lution is purified by repeatedly precipitating calcium carbonate, 
calcium sulfite, or both, in it. Colloidal impurities are entangled 
in the growing crystals of precipitate and removed by continu- 
ous filtration. The resulting solution is nearly colorless, and the 
sucrose is concentrated by multiple-effect vacuum evaporation. 
The syrup is seeded, cooled, and centrifuged, and the beet sugar 
crystals are washed with water and dried. 

Beet molasses differs from cane molasses in having a much 
lower content of invert sugar. It is, therefore, relatively stable to 
the action of alkali and, in the United States, is usually treated 
with calcium oxide to yield a precipitate of calcium sucrate. This 
is a mixture of loose chemical aggregates of sucrose and calcium 
oxide which are relatively insoluble in water. The precipitate is 
filtered, washed, and added to the incoming crude sugar syrup, 
where it furnishes calcium for the precipitations of calcium car- 
bonate and sulfite referred to above, which remove impurities. 
Carbon dioxide in the form of flue gas is the other reagent, and 
sulfur dioxide from burning sulfur is used to produce sulfite. Ion 
exclusion processes have been developed for recovery of sucrose 
from beet molasses. 

Beet residue use. The beet tops and extracted slices, as well 
as the molasses, are valuable as feeds. More feed for cattle and 
other ruminants can be produced per acre-year from beets than 
from any other crop widely grown in the United States. This is 
independent of the food energy in the crystallized sucrose, which 
exceeds that available from any other temperate-zone plant. It is 
for these reasons that the densely populated countries of Europe 
have expanded their beet sugar production, in spite of the ready 
availability of cane sugar from the tropics. 

The increased use of nitrogenous fertilizer has resulted in aug- 
menting the protein content of the molasses and other beet by- 
products. 

Nutritional value. Sucrose has, in the past, been attacked 
by some nutritionists on the ground that it provides only “empty 
calories,” without protein, minerals, or vitamins. This argument 
lost much of its force when it was shown that all the vitamins 


and minerals recommended by the National Research Council 
can be obtained by consuming any of a great variety of foods 
in amounts that yield a total of only one-half of the caloric re- 
quirements of an average person. The wide use by the public of 
vitamin supplements has caused some nutritionists to express an 
opposite worry over excessive vitamin consumption. 

In addition to the charges that it provides empty calories, sugar 
has been blamed for such health problems as diabetes mellitus, 
coronary heart disease, dental caries, and obesity. These charges 
and the scientific evidence associated with them have been ex- 
tensively reviewed by a special committee of the Federation of 
American Societies for Experimental Biology. A report of the 
committee findings, published by the Food and Drug Adminis- 
tration in 1976, includes the following: Sucrose is a contributor 
to the formation of dental caries when used at levels and in the 
manner prevailing in 1976; otherwise, there is no clear evidence 
in the available information that demonstrates a hazard to the 
public when sucrose is used at those levels. 

Other sources of sugars. In Hawaii, the pineapple industry 
recovers both sucrose and citric acid from the rinds. The residue 
is fed to cattle. The total quantity of sucrose obtained from waste 
fruits is statistically negligible. On the other hand, a substantial 
fraction of the total sucrose consumed is that naturally present 
in a large number of fruits, vegetables, and nuts. See SUCROSE; 
SUGAR CROPS. 

Lactose. Cow’s milk, on a dry basis, is about 38% lactose or 
milk sugar. When milk is converted into cheese, the lactose re- 
mains in the whey, from which it may easily be isolated and 
purified. Lactose is a disaccharide which is split by hydrolysis 
into glucose and galactose. Lactose is about one-tenth as solu- 
ble in water as sucrose and it is one-sixth to one-half as sweet, 
depending on the concentration. Some of the major uses for 
lactose are as a direct-compression vehicle for the manufacture 
of pharmaceutical tablets and as a diluent for pharmaceuticals 
and synthetic sweeteners. See CHEESE; LACTOSE. 

Starch. Starches can be hydrolyzed either by dilute acid or 
by enzymes. The product of acid hydrolysis varies with time 
and conditions but contains glucose, maltose, maltotriose, mal- 
totetrose, and other sugars up to the dextrins. Only glucose, a 
monosaccharide, is readily isolated. Crystallized as the monohy- 
drate, it is used in foods as a sweetener. See CORN; GLUCOSE. 

Syrups high in maltose, a disaccharide, can be obtained by 
the action of amylases on starch. This hydrolysis has been of 
great importance for thousands of years in splitting starches for 
alcoholic fermentation. As yet, there is no large-scale production 
of pure maltose. See MALTOSE. 

A major type of syrup was developed from starch hydrolyzates 
by the mid-1980s. An enzyme that isomerizes dextrose (glucose) 
was commercially developed to manufacture hydrolyzed starch 
syrups (corn syrups) containing varying amounts of fructose (le- 
vulose). The enzyme isomerizes dextrose into fructose theoret- 
ically to a 50-50 mixture of dextrose and fructose. In practice, 
the isomerization is allowed to progress until a syrup containing 
(on a solids basis) approximately 50% dextrose, 42% fructose, 
and 8% higher saccharide is achieved. Syrups containing more 
than 42% fructose (solid basis) can be made by subjecting the 
42% fructose syrup to an ion-exclusion process that separates 
dextrose and fructose. The fructose-rich fraction from the sepa- 
ration is converted into a finished product, and the dextrose-rich 
fraction is returned to the isomerization process. Commercially 
produced syrups contain 42% or 55% fructose. The combined 
processes are also used to manufacture commercial fructose. The 
high-fructose corn syrups are competitive with sucrose and in- 
vert sugar in a variety of food products. The most widely used 
application for 55% high-fructose corn syrup is in soft drinks, 
displacing sucrose and invert sugar. See FRUCTOSE; STARCH. 

Maple sugar. Before America was settled by Europeans, In- 
dians were collecting and concentrating the juice of the hard 
maple (Acer saccharum), making maple syrup. The practice was 
copied by the new settlers, and production of maple syrup has 
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been an industry ever since in the regions where hard maples 
are common, principally the northeastern United States. 

The maple flavor does not exist in the sap but is developed 
by heating it. By additional heating at about 250°F (120°C), a 
flavor four or five times more intense can be developed. Maple 
syrup so produced is of special value for adding flavor to the 
less expensive products of the sucrose industry. Maple sugar is 
sucrose of about 95-98% purity; the characteristic flavor makes 
up only asmall percent. Fairly satisfactory imitation maple flavors 
are available. 

Honey. Honey is a form of relatively pure invert sugar dis- 
solved in water to form a concentrated solution. However, honey 
also contains flavors derived from the nectar of the flowers from 
which it was obtained by the bee. Nutritionally, it is nearly equiv- 
alent in invert sugar but contains an excess of fructose over glu- 
cose. The sucrose found in the flowers is inverted by the enzyme 
honey invertase. Tupelo honey is remarkable in containing about 
twice as much fructose as glucose, and hence has little tendency 
to deposit glucose crystals. 

The ready availability of the food energy in honey was known 
to athletes in ancient Greece. Only in recent times has it been 
discovered that, paradoxically, the energy of sucrose is still more 
quickly available. The flavors of various honeys run a wide 
gamut. That from the Mount Hymettus region, which is flavored 
by wild thyme, has been known and treasured since the poems 
of Homer. 

Molasses. Virtually all molasses is distributed in the form of a 
concentrated viscous solution, but it can be reduced to a pow- 
der by means of spray-drying. It can then be handled without an 
investment by the consumer in tanks, pipes, and pumps. Unfor- 
tunately, later contact with moisture converts the dried molasses 
to a gummy mass. The availability of vaporproof bags (for ex- 
ample, those lined with polyethylene) has provided one solution 
to this problem. There are also various additives which, when 
mixed with molasses, reduce its tendency to pick up moisture. So 
far, dried molasses has made little headway against the practice 
of handling concentrated solutions. 

Syrups. Syrups are relatively concentrated, somewhat vis- 
cous, solutions of various sugars, frequently in admixture to 
hinder crystallization. Large amounts of sucrose have been dis- 
tributed in the form of high-purity syrups. These syrups consist of 
sucrose, invert sugar, or both. In some instances syrups include 
mixtures with dextrose and various corn syrups. The so-called 
liquid sugars are important for several reasons: the requirement 
for the use of syrups in a number of food processes (conver- 
sion of granular sucrose to a syrup was eliminated at the point 
of use); economy of handling, being moved with pumps, pipes, 
tank trucks, and tank cars; the high degree of sanitation inher- 
ent in the storage and distribution of sugar within the plant in a 
closed system, including the elimination of dust from the dump- 
ing of bags; and the availability of a combination of sugars with 
differing characteristics in a single product. 

The quantity of sucrose distributed in syrup form has been 
reduced drastically because of the inroads made by high-fructose 
corn syrups, which have a composition somewhat comparable to 
that of invert sugar. These products are available only as syrups. 
They do not have taste characteristics identical to sucrose and 
invert sugar syrups, but lower prices catalyze their substitution 
for sucrose. 

It is the aim of the sugar refiner and the beet processor to 
eliminate color and all flavors other than sweetness. The man- 
ufacture of table syrups, which are widely used on waffles and 
pancakes, aims at a broader spectrum of flavor. Corn syrup, 
which is somewhat lacking in sweetness, ordinarily has about 
15% sucrose added. The high viscosity of the corn syrup, result- 
ing from the content of dextrins, tends to hinder crystallization 
and is an advantage in the manufacture of certain candies. 

Some sugar refiners manufacture so-called refiners’ syrups for 
use in the manufacture of foods and table syrups. In addition to 
these syrups, a group of products generally referred to as cane 
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syrup and edible molasses are manufactured from cane juice. 
Cane syrup is concentrated whole cane juice, while edible mo- 
lasses is concentrated cane juice from which some sugar has been 
removed by crystallization. These syrups and molasses contain 
both sucrose and invert sugar, and have a relatively dark color 
and a distinctive flavor. They also contain many of the nonsug- 
ars found in sugarcane juice. They are used principally by food 
processors, but substantial amounts are packaged for home use. 
Sorghum syrup is made by extracting juice from the stalk of 
sweet sorghum and evaporating the juice to syrup consistency. 
It is a dark, pungently flavored product that is not widely dis- 
tributed. See FOOD ENGINEERING; FOOD MANUFACTURING; SUGAR 
CROPS. [H.B.H; Ch.B.B.] 


Sugarbeet 


Sugarbeet The plant Beta vulgaris, developed in modern 
times to fill the need for a sugar crop that could be grown in tem- 
perate climates. The sugarbeet was the source of only 5% of the 
world’s commercial sugar in 1840, but by 1890 it supplied al- 
most 50%, and since about 1920 this has dropped to about 40% 
with 60% derived from sugarcane. European countries produce 
most of their sugar from sugarbeet, with some countries having 
surplus sugar for export. See SUGARCANE. 

The sugarbeet, fodder beet, garden beet, and leaf beet (Swiss 
chard) are cultivars of B. vulgaris and are genetically related. 
In addition to B. vulgaris, which includes all cultivated forms 
of beet, there are 12 species of wild beet in the Mediterranean 
and Middle Eastern regions. These wild beets are of interest in 
sugarbeet improvement as sources of disease resistance. Most 
varieties of sugarbeet in the United States are hybrid. —[D Ste.] 


Sugarcane Saccharum officinarum, a member of the grass 
family. This crop originated in New Guinea about 15,000- 
8000 B.c., and was later moved by primitive peoples westward 
into Southeast Asia and India and eastward into Polynesia. Most 
current commercial varieties are interspecific hybrids involving 
primarily two or more of the following species: S. officinarum, 
S. robustum, and S. spontaneum. 

Sugar is generally removed from sugarcane in large factories 
by either milling or diffusion. Milling crushes the stalks as they 
pass between a series of large metal rollers separating the fiber 
(bagasse) from the juice laden with sugar. The diffusion process 
separates sugar from finely cut stalks by dissolving it in hot water 
or hot juice. Processing of the juice is completed by clarification to 
remove nonsugar components, by evaporation to remove water, 
and by removal of molasses in high-speed centrifuges to produce 
centrifugal sugar. Bagasse and molasses are the principal by- 
products from processing sugarcane. [R.E.Co] 


Sulfide phase equilibria The chemistry of the sulfides 
is rather simple inasmuch as most sulfide minerals involve only 
two major elements, although some involve three, and a few 
four. Understanding of the phase relations among these minerals 
is important both to the geologist, whose task it is to locate and 
exploit ore deposits, and to the metallurgist, whose task it is to 
extract the metals from the ores for industrial use. 

Sulfide ore deposits are the most important sources of numer- 
ous metals such as lead, zinc, copper, nickel, cobalt, and molyb- 
denum. In addition, sulfide ores provide substantial amounts of 
noble metals such as platinum, gold, and silver, and of other 
industrially important elements such as cadmium, rhenium, and 
selenium. Although iron sulfides usually are the most common 
minerals in such deposits, most commercial iron is mined from 
iron oxide ores and most sulfur from elemental sulfur deposits. 

Some of the more common sulfide minerals are listed in the 
table. The first eight sulfide minerals listed may be plotted in 
the ternary system copper-iron-sulfur. Similarly, the first two and 
the last two minerals may be plotted in the ternary system iron- 
nickel-sulfur. Thus, by studying in detail the phase equilibria in 
two ternary systems a great deal of information can be obtained 
about many of the common sulfides occurring in ore deposits. 


Common sulfide minerals 


Mineral name Chemical formula 


Pyrite FeSo 
Pyrrhotites (three or more varieties) Fe;_,S 
Covellite CuS 
Digenite CuoSs 
Chalcocite CugS 
Bornite CusFeSq 
Chalcopyrite CuFeS2 
Cubanite CuFe2S3 
Galena PbS 
Sphalerite and wurtzite ZnS 
Metacinnabar and cinnabar HgS 
Argentite and acanthite AgS 
Molybdenite MoS2 
Millerite NiS 
Pentlandite (Fe,Ni)oSg 


However, before the ternary systems can be explored in a system- 
atic way, the binary systems bounding the ternaries have to be 
studied in detail. Similarly, it is necessary that the four bounding 
ternary systems be fully understood before a quaternary system 
can be systematically investigated. Thus, it is seen that before a 
systematic study of, for example, the immensely important qua- 
ternary system copper-iron-nickel-sulfur can proceed, much pre- 
liminary information is required. The prerequisite data include 
complete knowledge of the four ternary systems Cu-Fe-S, Cu- 
Fe-Ni, Fe-Ni-S, and Cu-Ni-S. In turn, the phase relations in these 
ternary systems cannot be systematically studied before the six 
binary systems Fe-Cu, Fe-Ni, Cu-Ni, Fe-S, Cu-S, and Ni-S have 
been thoroughly explored. See PHASE EQUILIBRIUM. 

The enormous differences in the vapor pressures over the dif- 
ferent phases occurring in sulfide systems add complications 
to the diagrammatic representation. For instance, in the Fe-S 
system the vapor pressure over pure iron is about 10-2° atm 
(107% pascal) at 450°C (840°F), whereas that over pure sul- 
fur is a little more than 1 atm (10° pascals) at the same 
temperature. A complete diagrammatic representation of the 
relations in such a system, therefore, requires coordinates for 
composition and temperature, as well as for pressure. In a 
two-component system, such as the Fe-S system, such a 
representation is feasible because only three coordinates are 
necessary. However, in ternary (where such diagrams involve 
four-dimensional space) and in multicomponent systems, this 
type of diagrammatic representation is not possible. For this 
reason it is customary to use composition and temperature coor- 
dinates only for the diagrammatic representation of sulfide sys- 
tems. The relations as shown in such diagrams in reality repre- 
sent a projection from composition-temperature-pressure space 
onto a two-dimensional composition-temperature plane or onto 
a three-dimensional prism, depending upon whether the system 
contains two or three components. 

Pyrite or pyrrhotite, or both, occur almost ubiquitously, not 
only in ore deposits but in nearly all kinds of rocks. Of the binary 
systems mentioned above, therefore, the iron-sulfur system is 
of the most importance to the economic geologist. See PYRITE. 

[G.Kul.] 


Sulfonamide One of the group of organosulfur com- 
pounds, RSO,NHp». Many sulfonamides, which are the amides 
of sulfonic acids, have the marked ability to halt the growth of 
bacteria. The therapeutic drugs of this group are known as sulfa 
drugs. See ORGANOSULFUR COMPOUND. 

The antibacterial spectrum of sulfonamides comprises a wide 
variety of gram-positive and gram-negative bacteria, including 
staphylococci, streptococci, meningococci, and gonococci, as 
well as the gangrene, tetanus, coli, dysentery, and cholera bacilli. 
They have only slight activity against Mycobacterium tuberculo- 
sis, while certain closely related sulfones are quite active against 


M. leprae. The use of sulfonmethoxine has proved to be effective 
in the treatment of chloroquine-resistant malaria. The relative 
potency of sulfonamides against the different microorganisms 
varies, and their action is bacteriostatic rather than bactericidal. 

The antibacterial effect, both in patients and in test tube cul- 
tures, is antagonized by p-aminobenzoic acid (PABA) and PABA- 
containing natural or synthetic products, such as folic acid and 
procaine. Accordingly, the mode of action of sulfonamides is 
considered to be an antimetabolite activity, dependent upon the 
inhibition of enzyme systems involving the essential PABA. See 
BACILLARY DYSENTERY; CHOLERA; LEPROSY; MALARIA; MENINGO- 
coccus; STAPHYLOCOCCUS; STREPTOCOCCUS; TETANUS; TUBERCU- 
LOSIS. 

Bacterial resistance has developed to all known sulfonamides, 
and many sulfonamide-resistant strains are encountered among 
the gram-positive and gram-negative bacteria. The emergence 
of resistance to sulfonamides, however, seems less rapid and less 
widespread than resistance to most antibiotics. 

Sulfonamides are used today mostly as auxiliary drugs or 
in combination with antibiotics. In certain infectious diseases, 
however (for instance, in meningococcal infections and most in- 
fections of the urinary tract), sulfonamides deserve preference 
over antibiotics. See ANTIMICROBIAL AGENTS; DRUG RESISTANCE. 

[R.L.M.; E.Gru.] 


Sulfonation and sulfation Sulfonation is a chemical 
reaction in which a sulfonic acid group, —SO3H, is introduced 
into the structure of a molecule or ion in place of a hydrogen 
atom. Sulfation involves the attachment of the —OSO2OH group 
to carbon, yielding an acid sulfate, ROSO2OH, or of the —SOQ,— 
group between two carbons, forming the sulfate, ROSO2OR. 
Sulfonation of aromatic compounds is the most important 
type of sulfonation. This is accomplished by treating the aromatic 
compound with sulfuric acid, as in the reaction below. The prod- 


uct of sulfonation is a sulfonic acid. 

SO3H 
320°F or 160°C 

E ai E m 

B-Napthalenesulfonic acid 


Napthalene 





Sulfonation may also be defined as any chemical process by 
which the sulfonic acid group, —SO,OH, or the corresponding 
salt or sulfonyl halide group, for example, —SO¢Cl, is introduced 
into an organic compound. These groups may be bonded to 
either a carbon or a nitrogen atom. The latter compounds are 
designated N-sulfonates or sulfamates. 

Most sulfonates are employed as such in acid or salt form 
for applications where the strongly polar hydrophilic —SO,0H 
group confers needed properties on a comparatively hydropho- 
bic nonpolar organic molecule. Some sulfonates, such as 
methanesulfonic and toluenesulfonic acids, are used as cata- 
lysts. The major quantity of sulfonates and sulfates is both mar- 
keted and used in salt form. This category includes detergents; 
emulsifying, demulsifying, wetting, and solubilizing agents; lu- 
bricant additives; and rust inhibitors. See ORGANOSULFUR COM- 
POUND; SUBSTITUTION REACTION. [PH.G/R.S.K.] 


Sulfonic acid A derivative of sulfuric acid (HOSO2OH) 
in which an OH has been replaced by a carbon group, as 
shown in the structure below. Sulfonic acids are strongly acidic, 


water-soluble, nonvolatile, and hygroscopic; they do not act 
as oxidizing agents and are typically highly stable compounds. 
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Sulfonic acids rarely occur naturally. An exception, taurine, 
NH2CH2CH2SO3H, occurs in bile. 

The aliphatic sulfonic acids are generally made by oxidation 
of thiols. Several have unique properties. For example, trifluo- 
romethanesulfonic acid, CF3SO3H, is such a strong acid that it 
will protonate sulfuric acid. A compound derived from natural 
camphor, 10-camphorsulfonic acid, is used extensively in the 
optical resolution of amines. 

Aromatic sulfonic acids are much more important than those 
of the aliphatic series. Aromatic sulfonic acids are produced by 
sulfonation of aromatic compounds with sulfuric acid or fum- 
ing sulfuric acid. Sulfonation of aromatic hydrocarbons is a re- 
versible process; treatment of an aromatic sulfonic acid with su- 
perheated steam removes the —SO3H group. This process can 
be used in purifying aromatic hydrocarbons. Aromatic sulfonic 
acids and their derivatives, especially metal salts, are important 
industrial chemicals. See SULFURIC ACID. 

The most extensive use of the sulfonation reaction is in the 
production of detergents. The most widely used synthetic de- 
tergents are sodium salts of straight-chain alkylbenzenesulfonic 
acids. See DETERGENT. 

Sulfonated polymers, particularly sulfonated polystyrenes, act 
as ion-exchange resins which have important applications in wa- 
ter softening, ion-exchange chromatography, and metal separa- 
tion technology. Both sulfonated polymers and simple aromatic 
sulfonic acids, particularly p-toluenesulfonic acid, are frequently 
used as acid catalysts in organic reactions such as esterification 
and hydrolysis. See HOMOGENEOUS CATALYSIS; ION EXCHANGE. 

The sulfonic group, in either acid or salt form, is capable of 
making many substances water soluble, increasing their use- 
fulness. This application is particularly significant in the dying 
industry, in which a majority of the dyes are complex sodium 
sulfonates. Many acid-base indicators are soluble due to the pres- 
ence of a sodium sulfonate moiety. Some pigments used in the 
paint and ink industry are insoluble metal salts or complexes of 
sulfonic acid derivatives. Most of the brighteners used in deter- 
gents compounded for laundering are sulfonic acid derivatives of 
heterocyclic compounds. See ACID-BASE INDICATOR; DETERGENT; 
DyE; INK; ORGANOSULFUR COMPOUND; PAINT. [C.R.J.] 


Sulfur A chemical element, S, atomic number 16, and 
atomic weight 32.064. The atomic weight reflects the fact that 
sulfur is composed of the isotopes 3S (95.1%), 33S (0.74%), %S 
(4.2%), and %®S (0.016%). The ratios of the various isotopes vary 
slightly but measurably according to the history of the sample. 
By virtue of its position in the periodic table, sulfur is classified 
as a main-group element. See PERIODIC TABLE. 

The chemistry of sulfur is more complex than that of any other 
elemental substance, because sulfur itself exists in the largest 
variety of structural forms. At room temperature, all the stable 
forms of sulfur are molecular; that is, the individual atoms ag- 
gregate into discrete molecules, which in turn pack together to 
form the solid material. In contrast, other elements near sulfur 
in the periodic table normally exist as polymers (silicon, phos- 
phorus, arsenic, selenium, tellurium) or as diatomic molecules 
(oxygen, nitrogen, chlorine). Selenium and phosphorus can ex- 
ist as molecular solids, but the stable forms of these elements are 
polymeric. 

At room temperature the most stable form of sulfur is the cyclic 
molecule Sg. The molecule adopts a crownlike structure, con- 
sisting of two interconnected layers of four sulfur atoms each. 
The S—S bond distances are 0.206 nanometer and the S—S—S 
bond angles are 108°. Three allotropes are known for cyclo-Sg. 
The most common form is orthorhombic a-sulfur, which has a 
density of 2.069 g/cm® (1.200 oz/in.3) and a hardness of 2.5 
on the Mohs scale. It is an excellent electrical insulator, with a 
room temperature conductivity of 10!° ohm7! cm7!. Sublimed 
sulfur and “flowers” of sulfur are generally composed of a-Sg. 
Sulfur is quite soluble in carbon disulfide (CS; 35.5/100 g or 
1.23 02/3.52 oz at 25°C or 77°F), poorly soluble in alcohols, and 
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practically insoluble in water. At 95.3°C (203°F), sulfur changes 
into the monoclinic £ allotrope. This form of sulfur also consists 
of cyclic Sg molecules, but it has a slightly lower density at 1.94— 
2.01 g/cm? (1.12-1.16 oz/in.3). A third allotrope containing Sg 
is triclinic y-sulfur. The £ and y allotropes of sulfur slowly revert 
to the a form at room temperature. Crystals of sulfur are yellow 
and have an absorption maximum in the ultraviolet at 285 nm, 
which shifts to higher energy as the temperature decreases. At 
low temperatures, Sg is colorless. Even at room temperature, 
however, finely powdered sulfur can appear to be nearly white. 

The best-studied system is a-Sg, which converts to the 6 form 
at 90°C (194°F), which then melts at 120°C (248°F) to give a 
golden yellow liquid. If this melt is quickly recooled, it refreezes 
at 120°C (248°F), thus indicating that it consists primarily of Sg 
molecules. If the melt is maintained longer at 120°C (248°F), 
then the freezing point is lowered about 5°C (9°F), indicating 
the formation of about 5% of other rings and some polymer. 
At 159.4°C (318.9°F), the melt suddenly assumes a red-brown 
color. Over the range 159.4-195°C (318.9-383°F), the viscos- 
ity of the melt increases 10,000-fold before gradually decreasing 
again. This behavior is very unusual, since the viscosity of most 
liquids decreases with increasing temperature. The strong tem- 
perature dependence of the viscosity is due to the polymerization 
and eventual depolymerization of sulfur. Polymeric sulfur retains 
its elastomeric character even after being cooled to room tem- 
perature. There are several polymeric forms of sulfur, but all of 
them revert to œ-Sg after a few hours. 

Sublimination of Sg occurs when it is maintained in a vacuum 
at a temperature below its melting point. It vaporizes at 444.61°C 
(832.30°F). Below 600°C (1110°F), the predominant species in 
the gas are Sg followed by S; and Sẹ. Above 720°C (1328°F), 
violet S» is the major species. 

Principal inorganic compounds. Hydrogen sulfide (H2S) 
is the most important compound that contains only sulfur and 
hydrogen. It is a gas at room temperature with a boiling point of 
—61.8°C (—79.2°F) and a freezing point of —82.9°C (—117°F). 
The low boiling point of hydrogen sulfide is attributed to the 
weakness of intermolecular S---H hydrogen bonding; the O---H 
hydrogen bond is much stronger, as evidenced by the high boil- 
ing point of water. Gaseous hydrogen sulfide is 1.19 times more 
dense than air, and air-H2S mixtures are explosive. Hydrogen 
sulfide has a strong odor similar to that of rotten eggs; its odor 
is detectable at concentrations below 1 microgram/m*. At high 
concentrations, HS has a paralyzing effect on the olfactory sys- 
tem, which is very hazardous because HeS is even more toxic 
than carbon monoxide (CO). 

The most common compound that contains only carbon and 
sulfur is carbon disulfide (CS2). Carbon disulfide molecules are 
linear, consisting of two sulfur atoms connected to a central car- 
bon atom. Carbon disulfide is a toxic, highly flammable, and 
volatile liquid that melts at —111°C (—168°F) and boils at 46°C 
(115°F). Commercial carbon disulfide has a strong unpleas- 
ant odor due to impurities. It is manufactured from methane 
and elemental sulfur and is used for the production of carbon 
tetrachloride, rayon, and cellophane. Structurally related to car- 
bon disulfide is carbonyl sulfide (SCO), which forms from car- 
bon monoxide and elemental sulfur. The chlorination of CS», 
gives ClgCSCl, which can be reduced by HS to thiophosphene, 
CSCly. Thiophosgene (CSClz) [boiling point 73°C or 163°F] is 
a planar molecule with the carbon at the center of a triangle 
defined by the sulfur and two chlorine atoms. Thiocyanate, the 
linear anion NCS , is prepared by the reaction of cyanide (CN) 
with elemental sulfur. 

Several sulfur oxides exist, but the dioxide and trioxide are 
of preeminent importance. Sulfur dioxide (SO,) is a colorless 
gas that boils at —10.02°C (113.97°F) and freezes at —75.46°C 
(—103.8°F). The density of liquid sulfur dioxide at —10°C (14°F) 
is 1.46 g/cm? (0.84 oz/in.%). Liquid sulfur dioxide is an excellent 
solvent. The sulfur dioxide molecule is bent, with an O—S—O 
angle of 119°. 


Sulfur trioxide (SO3) is a planar molecule that is a liquid at 
room temperature that exists in equilibrium with a cyclic trimeric 
structure known as B-SO3. When B-SOs, actually S3Os, is treated 
with traces of water, it converts to either of two polymeric forms 
referred to as y- and a-sulfur trioxide. These are fibrous ma- 
terials, proposed to have the formula (SO3),H», where x is in 
the thousands. Sulfur trioxide is prepared by the oxidation of 
sulfur dioxide, although at very high temperatures this reaction 
reverses. Exposure of sulfur trioxide to water yields sulfuric acid 
(H2SO3); exposure of SO; to sulfuric acid yields disulfuric acid 
(H2S207). See SULFURIC ACID. 

Chlorine and sulfur react to give a family of compounds with 
the general formula S,Cle, several members of which have been 
obtained in pure form. The structures of these compounds are 
based on an atom or chain of sulfur atoms terminated with Cl. 
Sulfur monochloride (SzCl,), also known as sulfur monochloride, 
is the most widely available of the series. It is a yellow oil that 
boils at 138°C (280°F), and reacts with chlorine in the presence 
of iron(II) chloride (FeCl3) catalyst to give sulfur dichloride (SC2), 
which is a red volatile liquid with a boiling point of 59°C (138°F). 
Treatment of sulfur dichloride with sodium fluoride (NaF) gives 
SF4. 

Thionyl chloride (OSCl2) is a colorless reactive compound 
with a boiling point of 76°C (169°F); it is used to convert hy- 
droxy compounds to chlorides. Important applications include 
the preparation of anhydrous metal halides and alkyl halides. 
Sulfuryl chloride (O2SCly; boiling point 69°C or 156°F) is used 
as a source of chlorine. 

Organosulfur compounds. This family of compounds con- 
tains carbon, hydrogen, and sulfur, and it is a particularly vast 
area of sulfur chemistry. Thiols, also known as mercaptans, fea- 
ture the linkage C—S—H. Mercaptans are foul-smelling com- 
pounds. They are the sulfur analogs of alcohols, but they are 
more volatile. They can be prepared by the action of hydrogen 
sulfide (H2S) on olefins. Deprotonation of thiols gives thiolate 
anions, which form stable compounds with many heavy metals. 
Thiols and especially thiolates can be oxidized to form disulfides 
(persulfides), which have the connectivity of C—S—S—C. The 
organic persulfides are also related to organic polysulfides, which 
have chains of sulfur atoms terminated with carbon. The intro- 
duction of such mono-, di-, and polysulfide linkages is the basis 
of the vulcanization process, which imparts desirable mechani- 
cal properties to natural or synthetic polyolefin rubbers. This is 
accomplished by heating the polymer with sulfur in the presence 
of a zinc catalyst. See RUBBER. 

Thioethers, also known as organic sulfides, feature the connec- 
tivity C—S—C and are often prepared from the reaction of thi- 
olates and alkyl halides. Like mercaptans, thioethers often have 
strong unpleasant odors, but they are also responsible for the 
pleasant odors of many foods and perfumes. They are inten- 
tionally introduced at trace levels in order to impart an odor to 
gaseous hydrocarbon fuels. The reaction of alkyl dihalides and 
sodium polysulfides affords organic polysulfide polymers known 
as thiokols. 

There are many organic sulfur oxides; prominent are sulfonic 
acids (RSO3H), which are the organic derivatives of sulfuric acid. 
These compounds are prepared by the oxidation of thiols as 
well as by treatment of benzene derivatives with sulfuric acid, 
for example, benzene sulfonic acid. Most detergents are salts of 
sulfonic acids. See DETERGENT. 

Biochemistry. Sulfur is required for life. Typical organisms 
contain 2% sulfur dry weight. Three amino acids contain sulfur, 
as do many prosthetic groups in enzymes. Some noteworthy sul- 
fur compounds include the disulfide lipoic acid, the thioethers 
biotin and thiamine (vitamin B,), and the thiol coenzyme A. 
Sulfide ions, S?~, are found incorporated in metalloproteins and 
metalloenzymes such as the ferredoxins, nitrogenases, and hy- 
drogenases. See AMINO ACIDS; ENZYME; PROTEIN. 

Many bacterial species obtain energy by the oxidations of sul- 
fides. Bacteria of the genus Thiobacillus couple the conversion of 


carbon dioxide (CO3) to carbohydrates to the aerobic oxidation 
of mineral sulfides to sulfuric acid. This activity can be turned to 
good use for leaching low-grade mineral ores. Often, however, 
the sulfuric acid runoff (such as in mines or sewers) has negative 
environmental consequences. The purple and green bacteria as 
well as the blue-green algae are remarkable because they are 
photosynthetic but anaerobic; they oxidize sulfide, not water (as 
do most photosynthetic organisms). Depending on the species, 
the sulfur produced in this energy-producing pathway can accu- 
mulate inside or outside the cell wall. See BACTERIAL PHYSIOLOGY 
AND METABOLISM; PHOTOSYNTHESIS. 

Minerals. Sulfide minerals are among the most important 
ores for several metals. These compounds are two- or three- 
dimensional polymers containing interconnected metal cations 
and sulfide S*~ or persulfido S_2~ anions. In general, metal sul- 
fides are darkly colored, often black, and they are not soluble 
in water. They can sometimes be decomposed by using strong 
acids, with liberation of hydrogen sulfide. Certain sulfides will 
also dissolve in the presence of excess sulfide or polysulfide ions. 

Pyrites (FeS), also known as iron pyrites or fool’s gold, are 
the most common sulfide minerals and can be obtained as very 
large crystals that have a golden luster. Sphalerite (zinc blende; 
ZnS) and galena (PbS) are major sources of zinc and lead. Or- 
ange cinnabar (HgS) and yellow greenockite (CdS) are the ma- 
jor ores for mercury and cadmium, respectively. Molybdenite 
(MoS,) is the major ore of molybdenum. 

The sulfur content of fossil fuels results from the sulfur in the 
ancient organisms as well as from subsequent incorporation of 
mineral sulfur into the hydrocarbon matrix. Gaseous fossil fu- 
els are often contaminated with hydrogen sulfide, which is an 
increasingly important source of sulfur. Organic derivatives con- 
taining the C—S—C linkage are primarily responsible for the sul- 
fur content of petroleum and coal. The so-called organic sulfur 
in petroleum can be removed by hydrodesulfurization catalysis, 
involving reaction with hydrogen over a molybdenum catalyst, 
to give hydrocarbons and hydrogen sulfide. [TB.R.] 


Sulfuric acid A strong mineral acid with the chemical for- 
mula H2SQz,. It is a colorless, oily liquid, sometimes called oil 
of vitriol or vitriolic acid. The pure acid has a density of 1.834 
at 25°C (77°F) and freezes at 10.5°C (50.90°F). It is an impor- 
tant industrial commodity, used extensively in petroleum refining 
and in the manufacture of fertilizers, paints, pigments, dyes, and 
explosives. 

Sulfuric acid is produced on a large scale by two commercial 
processes, the contact process and the lead-chamber process. 
In the contact process, sulfur dioxide, SO», is converted to sul- 
fur trioxide, SO3, by reaction with oxygen in the presence of a 
catalyst. Sulfuric acid is produced by the reaction of the sulfur 
trioxide with water. The lead-chamber process depends upon 
the oxidation of sulfur dioxide by nitric acid in the presence of 
water, the reaction being carried out in large lead rooms. 

Sulfuric acid reacts vigorously with water to form several hy- 
drates. The concentrated acid, therefore, acts as an efficient dry- 
ing agent, taking up moisture from the air and even abstracting 
the elements of water from such compounds as sugar and starch. 
The concentrated acid also acts as a strong oxidizing agent. It re- 
acts with most metals upon heating to produce sulfur dioxide. 
See SULFUR. [EJ.J.] 


Sum rules Formulas in quantum mechanics for transi- 
tions between energy levels, in which the sum of the transition 
strengths is expressed in a simple form. Sum rules are used to de- 
scribe the properties of many physical systems, including solids, 
atoms, atomic nuclei, and nuclear constituents such as protons 
and neutrons. The sum rules are derived from quite general 
principles, and are useful in situations where the behavior of 
individual energy levels is too complex to describe by a pre- 
cise quantum-mechanical theory. See ENERGY LEVEL (QUANTUM 
MECHANICS). 
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In general, sum rules are derived by using Heisenberg’s 
quantum-mechanical algebra to construct operator equalities, 
which are then applied to particles or the energy levels of a sys- 
tem. See QUANTUM MECHANICS. [G.FBe.] 


Sun The star around which the Earth revolves, and the 
planet’s source of light and heat, hence life. The Sun is a globe 
of gas, 1.4 x 10° km (8.65 x 10° mi) in diameter with a mass 
333,000 times the Earth, held together by its own gravity. The 
surface temperature of the Sun is about 6000 K (10,000°F); 
since solids and liquids do not exist at these temperatures, the 
Sun is entirely gaseous. Almost all the gas is in atomic form, al- 
though a few molecules exist in the coolest surface regions, such 
as sunspots. 

The Sun is a typical member of the spectral class dG2, stars of 
surface temperature 6000 K. The d stands for a dwarf, a normal 
star of that class. See SPECTRAL TYPE. 

Solar structure. The interior of the Sun can be studied only 
by inference from the observed properties of the entire star. The 
great mass of the Sun presses down on the center, requiring a 
gas with a central density of near 90 g/cm? and 2 x 10’ K (3.6 x 
10°°F) temperature to support it. At these huge temperatures 
and densities, nuclear reactions take place. The radiation pro- 
duced flows outward till it is radiated into space by the surface 
(photosphere) at 6000 K (10,000°F). 

Energy production. The energy of the Sun is produced by the 
conversion of hydrogen into helium. For each hydrogen atom 
converted, one neutrino is produced. These neutrinos are de- 
tected, but less than the expected number. See NEUTRINO; SOLAR 
NEUTRINOS. 

The material at the center of the Sun is so dense that a few mil- 
limeters are opaque, so the photons created by nuclear reactions 
are continually absorbed and reemitted and thus make their way 
to the surface by a random walk. The atoms in the center of the 
Sun are entirely stripped of their electrons by the high temper- 
atures, and most of the absorption is by continuum processes, 
such as the scattering of light by electrons. Because there are so 
many absorption and emission processes along the way, it can 
take as long as a million years to complete the random walk to 
the surface. 

Convection. In the outer regions of the solar interior, the tem- 
perature is low enough for ions and even neutral atoms to form 
and, as a result, atomic absorption becomes very important. The 
high opacity makes it very difficult for the radiation to continue 
outward, so steep temperature gradients are established that re- 
sult in convective currents. Most of the outer envelope of the 
Sun is in such convective equilibrium. These large-scale mass 
motions are responsible for the complex phenomena observed 
at the surface. See CONVECTION (HEAT). 

Radiation. Electromagnetic energy is produced by the Sun 
in essentially all wavelengths. However, more than 95% of the 
energy is concentrated in the relatively narrow band between 
290 and 2500 nm and is accessible to routine observation from 
ground stations on Earth. The maximum radiation is in the green 
region, and the eyes of human beings have naturally evolved 
to be sensitive to this range of the spectrum. The total radia- 
tion is called the solar constant. It is not exactly constant but 
varies slightly (40.1%) with the solar cycle. The ultraviolet flux, 
however, varies by substantial factors depending on the exact 
wavelength, and this affects the Earth’s upper atmosphere. See 
ELECTROMAGNETIC RADIATION; SOLAR CONSTANT. 

Atmosphere. Although the Sun is gaseous, it can be seen only 
to the point at which the density is so high that the material 
is opaque. This layer, the visible surface of the Sun, is termed 
the photosphere. Light from father down reaches the Earth by 
repeated absorption and emission by the atoms, but the deepest 
layers cannot be seen directly. The surface is actually not sharp, 
but the Sun is so far away that the smallest distance that can be 
resolved with the best telescope is about 300 km (200 mi). Since 
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the density e-folding height (scale height) is less than 200 km 
(120 mi), the edge appears sharp. See PHOTOSPHERE. 

Above the photosphere the atmosphere is transparent, and 
its density falls off much more slowly because magnetic fields 
support the ionized particles. The atmosphere can be seen by 
using a narrow-band filter or a spectrograph to pick out the iso- 
lated wavelengths absorbed by the atmospheric gases. In the 
upper photosphere it is cooler, and the lines are dark. If the light 
is imaged in the strongest lines, such as those of hydrogen, a 
region higher still is seen, called the chromosphere. The light 
from this region is dominated by the red hydrogen alpha (level 
2 — 3 transition) line, which gives it a rosy color seen at a solar 
eclipse. The chromosphere is a rapidly fluctuating region of jets 
and waves coming up from the surface. When all the convected 
energy coming up from below reaches the surface, it is concen- 
trated in the thin material and produces considerable activity. 
Where the magnetic field is stronger, these waves are absorbed, 
and raise the temperature to 7000-8000 K (12,000-14,000°F). 
The scale height of the chromosphere is 1000 km (600 mi) or 
more, so there no longer is a sharp edge. See CHROMOSPHERE; 
ECLIPSE. 

When the Moon obscures the Sun at a total solar eclipse, 
the vast extended atmosphere of the Sun called the corona can 
be seen. The corona is a million times fainter than the photo- 
sphere, so it is visible only when seen against the dark sky of an 
eclipse or with very special instruments. Its density is low, but 
its temperature is high (more than 10° K or 1.8 x 10°°F). The 
hot gas evaporating out from the corona flows steadily to the 
Earth and farther in the solar wind. See SOLAR CORONA; SOLAR 
WIND. 

Coronal holes. Early coronal observations showed that the 
corona was occasionally absent over certain regions. In particu- 
lar, at sunspot minimum it was quite weak over the poles. X-ray 
pictures revealed great bands of the solar surface essentially de- 
void of corona for many months. These proved to be regions 
where the local magnetic fields were connected to quite distant 
places, so the fields actually reached out to heights from which 
the solar wind could sweep the gas outward. Analysis of solar 
wind data showed that equatorial coronal holes were associated 
with high-velocity streams in the solar wind, and recurrent geo- 
magnetic storms were associated with the return of these holes. 
Thus the relative intensity of the corona over sunspot regions is 
partly due to their strong, closed magnetic fields which trap the 
coronal gas. 

Solar activity. There are a number of transient phenomena 
known collectively as solar activity. These are all connected with 
sunspots. 

Sunspots. Sunspots were discovered around 1610. Heinrich 
Schwabe announced in 1843 that their number rose and fell with 
a 10-year period. Subsequent study of the old records revealed 
an 11-year period since the original discovery. 

The number of sunspots peaks soon after the beginning of 
each cycle and decays to a minimum in 11 years. The first spots 
of a number cycle always occur at higher latitudes, between 20° 
and 35°, and the latitude of occurrence decreases as the cycle 
unfolds (Spérer’slaw). Almost no spots are observed outside the 
latitude range of 5-35°. The great majority are small and last 
a few days, but some last for two rotations. In 1908, George 
Ellery Hale discovered that sunspots had strong magnetic fields. 
Each spot group contains positive and negative magnetic po- 
larity (monopoles are forbidden by Maxwell’s laws). Hale found 
that the polarities were mirrored, with the same polarity generally 
leading in one hemisphere and following in the other. He found 
that with each new number cycle the lead polarity switches, so 
that the complete magnetic cycle lasts 22 years. But each new 
number cycle starts a few years before the end of the previous 
one, so the average duration of a half-cycle is nearly 14 years. 
See MAGNETISM. 

The darkness of sunspots (Fig. 1) is probably due to the intense 
magnetic fields (3000 gauss or 0.3 tesla), which cool the surface 





Fig. 1. Large symmetric sunspot photographed in Ha light. 
Clock indicates time of photograph. (Big Bear Solar Obser- 
vatory) 


by suppressing the normal convective energy flow from below. 
It takes several days for the darkening to occur. 

Although the sunspot is cool, its neighborhood is the scene of 
the hottest and most intense activity, generally referred to as an 
active region. Magnetic energy is continually released there. The 
corona above an active region is hot and dense, roughly three 
times hotter and denser than in quiet regions. 

Prominences. The term “prominence” is used for any cloud 
of cool gas in the corona, where it appears bright against the 
sky. Because these clouds absorb the chromospheric light and 
scatter it, they appear dark against the solar disk in Ha and other 
strong lines. In continuous light they are transparent. At the limb 
we see the chromospheric light they scatter against the dark sky. 
Since they are much denser than the corona, something must 
hold them up against gravity. Prominences are found only in re- 
gions of horizontal magnetic fields that support them. Thus fila- 
ments on the disk, which may last for weeks, are good markers of 
the magnetic boundaries. When the magnetic structure changes, 
prominences become unstable and erupt, always upward. They 
also may be ejected by solar flares or appear as graceful loops 
raining from the corona after flares. Erupting prominences are 
probably the source of coronal mass ejections, in which a bubble 
of coronal material erupts outward at several hundred kilometers 
per second and flows out into interplanetary space. 

Plages. Just as prominences occur when the magnetic field 
changes from one sign to the other, plages occur whenever the 
magnetic field is vertical and relatively strong but not strong 
enough to form a sunspot. They are bright regions in any strong 
spectrum line, because the chromosphere is heated there. In a 
typical active region, the preceding magnetic field is clumped in 
a sunspot and the following field spread out in a plage. In Ha 
light, the plage is seen to be connected to the sunspot by dark 
fibrils outlining the lines of force. 

Flares. The most spectacular activity associated with sunspots 
is the solar flare (Fig. 2). A flare is defined as an abrupt increase 
in the Ha emission from the sunspot region. The brightness of 
the flare may be up to eight times that of the chromosphere; the 
rise time is seldom longer than a few minutes. The Ha brighten- 
ing results from heating of the chromosphere at the foot points 
of the magnetic field by a tremendous energy release in the at- 
mosphere. While flares are usually visible only in chromospheric 
lines, the foot points of big flares can be seen in white light. From 
the foot points, a cloud of hot material, up to 3 x 107 K (5.4 x 
10’°F) arises and concentrates at the arch tops. This cloud con- 
denses out in an array of loop prominences. An active sunspot 





Fig. 2. The great “sea horse” flare of August 7, 1972, late 
in the flare, photographed in the blue wing of the Ha line. 
The neutral line between two bright strands is crossed by 
an arcade of bright loop prominences raining down from the 
corona. (Big Bear Solar Observatory) 


group produces a hierarchy of flares, a few big and many small 
ones. 

Flares are often associated with the eruption of filaments. A few 
minutes after the eruption begins, there is an abrupt acceleration 
and a storm of energetic particles is produced, heating the corona 
to flare brightness. 

The flare produces a huge stream of solar energetic particles 
(SEP) as well as shock waves. A huge magnetohydrodynamic 
shock wave flies out at about 1000 km/s (600 mi/s) and con- 
tinues into interplanetary space, often reaching the Earth. The 
wave produces a huge radio burst in the meter-wavelength range 
as it excites the coronal layers. The energetic nuclei produce 
gamma-ray lines from nuclear reactions as they penetrate to 
the photosphere. If they are sufficiently numerous, they heat the 
photosphere faster than it can reemit energy and a white light 
flare is observed, usually in the form of bright transient flashes 
at the foot points of the flare loops. The particles reach the Earth 
in a great particle storm. [H.Zi.] 


Sun dog A bright spot of light that sometimes appears on 
either side of the Sun, the same distance above the horizon 
as the Sun, and separated from it by an angle of about 22° 
(see illustration). For higher Sun elevations, the angle increases 
slightly. These spots are known by many common names: sun 





Sun dogs on either side of the Sun. Also visible are the 22° 
halo and the parhelic circle. (Courtesy of Robert Greenler) 
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dogs, mock suns, false suns, or the 22° parhelia. They usually 
show a red edge on the side closest to the Sun. On some oc- 
casions the entire spectrum of colors can be spread out in the 
sun-dog spot but, commonly, the red edge is followed by an 
orange or yellow band that merges into a diffuse white region. 
The effects result from the refraction by sunlight through small, 
flat, hexagonal-shaped ice crystals falling through the air such 
that their flat faces are oriented nearly horizontally. See HALO; 
METEOROLOGICAL OPTICS. [R.Gr.] 


Sunflower 


Sundew Any plant of the genus Drosera (90 species) of the 
family Droseraceae. Sundews are small, herbaceous, insectivo- 
rous plants that grow on all the continents, especially Australia. 
Numerous glandular hairs (tentacles) on the leaf secrete a vis- 
cous fluid which traps a visiting insect. The tentacles then bend 
inward about the victim, bringing it into contact with the sur- 
face of the leaf where it is digested. The droplets secreted by 
the glands on the leaves glitter like dewdrops in the morning 
sunlight; hence, the name, sundew. See INSECTIVOROUS PLANTS; 
NEPENTHALES; SECRETORY STRUCTURES (PLANT). [PD.St./E.1.C.] 


Sundial An instrument for telling time by the Sun. It is com- 
posed of a style that casts a shadow and a dial plate, which is 
the surface upon which hour lines are marked and upon which 
the shadow falls. The style lies parallel to Earth’s axis. The con- 
struction of the hour lines is based on the assumption that the 
apparent motion of the Sun is always on the celestial equator. 
The most widely used form is the horizontal dial that indicates 
local apparent time (Sun time). Other forms of the sundial indi- 
cate local mean time, and standard time. See TME. [R.N.M.] 


Sunfish A name given to certain members of the freshwater 
family Centrarchidae as well as to members of the marine family 
Molidae. There are two species of oceanic sunfishes in the genus 
Mola; both are large, stout, deep-bodied fishes with rough skin 
and are found in warm seas, where they are frequently seen near 
the surface. These fishes appear to be tailless. 

The fresh-water sunfishes are grouped together with the crap- 
pies. The warmouth sunfish (Chaenobryttus gulosus) is the only 
species found in the United States, where it occurs in the Missis- 
sippi River drainage and in streams along the Atlantic coast. It 
differs from other sunfishes in having teeth on the tongue. 

The crappies are game fish of considerable importance. Crap- 
pies are a favorite species for pond culture. They can be readily 
transplanted and, under favorable conditions, multiply prodi- 
giously. See PERCIFORMES. [C.B.C.] 


Sunflower Helianthus annuus, the most widely distributed 
of the 50 native North American species of this genus of the 
family Compositae. It is an extremely variable species, with two 
main divisions. The first involves wild weedy plants found along 
roadways and other recently disturbed areas; the second, do- 
mesticated plants grown in fields and gardens. See ASTERALES. 
Within the domestic type there are two categories of plants: 
the ornamental, which has a few branches with larger heads 
than the wild, and the crop type, which has only a single stem 
and the largest head of all sunflowers (see illustration). Crop 
types are either oil or nonoil. Plant breeders have modified the 
plant for adaptation to modern, large-scale farming and have 
increased the oil content of the seeds. The present worldwide 
interest in growing sunflowers as a crop is due to the increased 
yield of the new commercially available oilseed hybrids. 
Sunflowers are grown on all the continents and in many coun- 
tries throughout the world. Russia is the major producer, fol- 
lowed by Argentina, the United States, and Canada. Sunflower 
oil (sunoil) is the second most important vegetable crop oil. It is 
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Sunlamp 





Maturing sunflower. (U.S. Department of Agriculture) 


a high-quality oil and is high in linoleic fatty acid. The oil is used 
in cooking, salad dressing, mayonnaise, margarine, and soap. 
[B.H.B.] 


Sunlamp A special form of mercury arc discharge lamp de- 
signed to produce ultraviolet radiation. These lamps also pro- 
duce some radiant energy in the visible region of the spectrum, 
thus having a light output as well as an ultraviolet output. The 
lamps are principally used for producing a skin tan on the human 
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Structure of sunlamps. (a) FS-40. (b) RS. 


body. The less common uses include therapeutically producing 
vitamin D in the body for the treatment of rickets and causing 
fluorescence or photochemical reactions. 

Sunlamps have ultraviolet radiation at wavelengths above 
280 nanometers. The lower limit is set by the fact that the quartz 
and high-silica glass used for lamp envelopes do not transmit be- 
low 280 nm. The two lamps designed for tanning are the FS-40 
and the RS. The FS-40 tubular fluorescent lamp (illus. a) operates 
with a low-pressure mercury arc that causes a special chemical 
phosphor coating inside the tube to radiate ultraviolet energy. 
The RS sunlamp (illus. b) is a reflector unit containing a high- 
pressure mercury arc tube for generating ultraviolet energy plus 
a tungsten filament (similar to incandescent lamp filaments) in 
series with the arc tube to serve as a ballast. See MERCURY-VAPOR 
LAMP; ULTRAVIOLET LAMP. [G.R.P] 


Superacid An acid which has an extremely great proton- 
donating ability. It has proved convenient to define a superacid 
somewhat arbitrarily as an acid, or more generally, an acidic 
medium, which has a proton-donating ability equal to or greater 
than that of anhydrous (100%) sulfuric acid. 

Superacids belong to the general class of proton or Brönsted 
acids. A proton acid is defined as any species which can act as a 
source of protons and which will therefore protonate a suitable 
base, as in reaction (1). 


HA +B = BH* + A7 (1) 


The strengths of acids are often compared by measuring the 
extent of their ionization in water, that is, the extent to which 
they can protonate the base water, as in reaction (2). 


HA + H20 = H30* +AT (2) 


However, all strong acids are fully ionized in dilute aqueous 
solution, and they therefore appear to have the same strength. 
Their strengths are said to be reduced or leveled to that of the 
hydronium ion (H30*), which is the most highly acidic species 
that can exist in water. In any case, many of the superacids react 
with and are destroyed by water. For these reasons, the strengths 
of superacids cannot be measured by the conventional means of 
utilizing their aqueous solutions. The acidities of superacids can, 
however, be conveniently measured in terms of the Hammett 
acidity function. See ACID AND BASE. 

The Hammett acidity function is a method of measuring acid- 
ity based on the determination of the ionization ratios of suit- 
able weak bases (indicators), usually by means of the change 
in absorption spectrum that occurs on protonation of the base, 
although the nuclear magnetic resonance (NMR) spectrum has 
also been used. The Hammett acidity function (Ho) is defined 
by Eq. (3), where Kgy, is the dissociation constant of the acid 


[BH"] 
[B] 





Ho = pKgu+ — log (3) 


form of the indicator and [BH*]/[B] is the ionization ratio of the 
indicator. Hammett acidity function (Ho) values for a number of 
superacids are given in the table. In each case the value refers to 
the 100% (anhydrous) acid. Each of the superacids in the table 


Hammett acidity function values for several superacids 


Superacid Formula —Ho 
Sulfuric acid H2SO4 11.9 
Chlorosulfuric acid HSO3Cl 13.8 
Trifluoromethane sulfonic acid HSO3CF3 14.0 
Disulfuric acid HeS207 14.4 
Fluorosulfuric acid HSO3F AS 
Hydrogen fluoride HE 1S 


is a liquid at room temperature, and each forms the basis of a 
solvent system. See IONIC EQUILIBRIUM; SOLUTION. [R.J.Gi.] 


Supercharger An air pump or blower in the intake system 
of an internal combustion engine. Its purpose is to increase the 
air-charge weight and therefore the power output from an engine 
of a given size. In an aircraft engine, the supercharger counteracts 
the power loss that results from the decrease of atmospheric 
pressure with increase of altitude. Various types of pumps and 
compressors may be used as superchargers, which are either 
mechanically driven by the engine crankshaft or powered by the 
engine exhaust gas. See COMPRESSOR; PUMP; TURBOCHARGER. 

Some production automobile engines use a spiral-type super- 
charger, while others use a pressure-wave supercharger. How- 
ever, automotive, marine, and stationary engines generally use 
a positive-displacement Roots blower driven from the engine 
crankshaft. See INTERNAL COMBUSTION ENGINE. 

In a supercharged diesel engine, the increased air charge al- 
lows the engine to burn more fuel and produce greater power 
without creating excessive pressure inside the cylinder. Super- 
charging the diesel engine makes ignition of the fuel easier with- 
out requiring a fuel of better quality. See DIESEL ENGINE. 

To enable a reciprocating aircraft engine to develop its rated 
sea-level power at altitude, a supercharger must be used to in- 
crease the pressure and weight of the intake air charge. Centrifu- 
gal compressors are generally used because of their relatively 
small size for a given capacity, and are driven either by a gear 
drive from the crankshaft or by the engine exhaust gas. With 
gear drive, a lower ratio is commonly used at low and medium 
altitudes, with a change to a higher ratio at high altitudes. See 
AIRCRAFT ENGINE. [J.A.B.; D.L.An.] 


Supercomputer A computer which, among existing 
general-purpose computers at any given time, is superlative, of- 
ten in several senses: highest computation rate, largest mem- 
ory, or highest cost. Predominantly, the term refers to the fastest 
“number crunchers,” that is, machines designed to perform nu- 
merical calculations at the highest speed that the latest electronic 
device technology and the state of the art of computer architec- 
ture allow. 

The demand for the ability to execute arithmetic operations 
at the highest possible rate originated in computer applications 
areas collectively referred to as scientific computing. Large-scale 
numerical simulations of physical processes are often needed in 
fields such as physics, structural mechanics, meteorology, and 
aerodynamics. A common technique is to compute an approxi- 
mate numerical solution to a set of partial differential equations 
which mathematically describe the physical process of interest 
but are too complex to be solved by formal mathematical meth- 
ods. This solution is obtained by first superimposing a grid ona 
region of space, with a set of numerical values attached to each 
grid point. Large-scale scientific computations of this type often 
require hundreds of thousands of grid points with 10 or more 
values attached to each point, with 10 to 500 arithmetic opera- 
tions necessary to compute each updated value, and hundreds 
of thousands of time steps over which the computation must be 
repeated before a steady-state solution is reached. See COMPU- 
TATIONAL FLUID DYNAMICS; NUMERICAL ANALYSIS; SIMULATION. 

Two lines of technological advancement have significantly 
contributed to what roughly amounts to a doubling of the fastest 
computers’ speeds every year since the early 1950s—the steady 
improvement in electronic device technology and the accumula- 
tion of improvements in the architectural designs of digital com- 
puters. 

Computers incorporate very large-scale integrated (VLSI) cir- 
cuits with tens of millions of transistors per chip for both logic and 
memory components. A variety of types of integrated circuitry is 
used in contemporary supercomputers. Several use high-speed 
complementary metallic oxide semiconductor (CMOS) technol- 
ogy. Throughout most of the history of digital computing, su- 
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percomputers generally used the highest-performance switching 
circuitry available at the time—which was usually the most ex- 
otic and expensive. However, many supercomputers now use 
the conventional, inexpensive device technology of commodity 
microprocessors and rely on massive parallelism for their speed. 
See COMPUTER STORAGE TECHNOLOGY; CONCURRENT PROCESSING; 
INTEGRATED CIRCUITS; LOGIC CIRCUITS ; SEMICONDUCTOR MEMORIES. 

Increases in computing speed which are purely due to the ar- 
chitectural structure of a computer can largely be attributed to 
the introduction of some form of parallelism into the machine’s 
design: two or more operations which were performed one after 
the other in previous computers can now be performed simulta- 
neously. See COMPUTER SYSTEMS ARCHITECTURE. 

Pipelining is a technique which allows several operations to be 
in progress in the central processing unit at once. The first form 
of pipelining used was instruction pipelining. Since each instruc- 
tion must have the same basic sequence of steps performed, 
namely instruction fetch, instruction decode, operand fetch, and 
execution, it is feasible to construct an instruction pipeline, where 
each of these steps happens at a separate stage of the pipeline. 
The efficiency of the instruction pipeline depends on the likeli- 
hood that the program being executed allows a steady stream of 
instructions to be fetched from contiguous locations in memory. 

The central processing unit nearly always has a much faster 
cycle time than the memory. This implies that the central process- 
ing unit is capable of processing data items faster than a memory 
unit can provide them. Interleaved memory is an organization 
of memory units which at least partially relieves this problem. 

Parallelism within arithmetic and logical circuitry has been 
introduced in several ways. Adders, multipliers, and dividers 
now operate in bit-parallel mode, while the earliest machines 
performed bit-serial arithmetic. Independently operating paral- 
lel functional units within the central processing unit can each 
perform an arithmetic operation such as add, multiply, or shift. 
Array processing is a form of parallelism in which the instruction 
execution portion of a central processing unit is replicated several 
times and connected to its own memory device as well as to a 
common instruction interpretation and control unit. In this way, 
a single instruction can be executed at the same time on each 
of several execution units, each on a different set of operands. 
This kind of architecture is often referred to as single-instruction 
stream, multiple-data stream (SIMD). 

Vector processing is the term applied to a form of pipelined 
arithmetic units which are specialized for performing arithmetic 
operations on vectors, which are uniform, linear arrays of data 
values. It can be thought of as a type of SIMD processing, since a 
single instruction invokes the execution of the same operation on 
every element of the array. See COMPUTER PROGRAMMING; PRO- 
GRAMMING LANGUAGES. 

A central processing unit can contain multiple sets of the in- 
struction execution hardware for either scalar or vector instruc- 
tions. The task of scheduling instructions which can correctly 
execute in parallel with one another is generally the responsibil- 
ity of the compiler or special scheduling hardware in the central 
processing unit. Instruction-level parallelism is almost never vis- 
ible to the application programmer. 

Multiprocessing is a form of parallelism that has complete cen- 
tral processing units operating in parallel, each fetching and ex- 
ecuting instructions independently from the others. This type 
of computer organization is called multiple-instruction stream, 
multiple-data stream (MIMD). See MULTIPROCESSING. [D.W.M.] 


Superconducting devices 


Superconducting devices Devices that perform 
functions in the superconducting state that would be difficult 
or impossible to perform at room temperature, or that con- 
tain components which perform such functions. The supercon- 
ducting state involves a loss of electrical resistance and oc- 
curs in many metals and alloys at temperatures near absolute 
zero. An enormous impetus was provided by the discovery in 
1986 of a new class of ceramic, high-transition-temperature (T,) 
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superconductors, which has resulted in a new superconducting 
technology at liquid nitrogen temperature. Superconducting de- 
vices may be conveniently divided into two categories: small- 
scale thin-film devices, and large-scale devices which employ 
zero-resistance superconducting windings made of type II super- 
conducting materials. See SUPERCONDUCTIVITY. 

Small-scale devices. A variety of thin-film devices offer 
higher performance than their nonsuperconducting counter- 
parts. The prediction and discovery in the early 1960s of the 
Josephson effects introduced novel opportunities for ultrasensi- 
tive detectors, high-speed switching elements, and new physical 
standards. Niobium-based devices, patterned on silicon wafers 
using photolithographic techniques taken over from the semi- 
conductor industry, have reached a high level of development, 
and a variety of such devices are commercially available. These 
devices operate at or below 4.2 K (—452°F), the temperature 
of liquid helium boiling under atmospheric pressure. See LIQUID 
HELIUM. 

The discovery of the high-transition-temperature supercon- 
ductors has enabled the operation of devices in liquid nitrogen 
at 77 K (—321°F). Not only is liquid nitrogen much cheaper and 
more readily available than liquid helium, but it also boils away 
much more slowly, enabling the use of simpler and more com- 
pact dewars or simpler, relatively inexpensive refrigerators. Of 
the new ceramic superconductors, only YBagCu307_, (YBCO) 
has been developed in thin-film form to the point of practical 
applications, and several devices are available. Intensive mate- 
rials research has resulted in techniques, notably laser-ablation 
and radio-frequency sputtering, for the epitaxial growth of high- 
quality films with their crystalline planes parallel to the surface of 
the substrate. Most of the successful Josephson-junction devices 
have been formed at the interface between two grains of YBCO. 
These so-called grain-boundary junctions are made by deposit- 
ing the film either on a bicrystal in which the two halves of the 
substrate have a carefully engineered in-plane misalignment of 
the crystal axes, or across a step-edge patterned in the substrate. 
See CRYOGENICS; GRAIN BOUNDARIES. 

Two types of superconducting quantum interference device 
(SQUID) detect changes in magnetic flux: the dc SQUID and 
the rf SQUID. The dc SQUID, which operates with a dc bias 
current, consists of two Josephson junctions incorporated into 
a superconducting loop. The maximum dec supercurrent, known 
as the critical current, and the current-voltage (I-V) characteristic 
of the SQUID oscillate when the magnetic field applied to the 
device is changed. The oscillations are periodic in the magnetic 
flux ® threading the loop with a period of one flux quantum, 
Po = h/2e © 2.07 x 107° weber, where h is Planck’s constant 
and e is the magnitude of the charge of the electron. Thus, when 
the SQUID is biased with a constant current, the voltage is pe- 
riodic in the flux. The SQUID is almost invariably operated in 
a flux-locked loop. A change in the applied flux gives rise to a 
corresponding current in the coil that produces an equal and op- 
posite flux in the SQUID. The SQUID is thus the null detector in 
a feedback circuit, and the output voltage is linearly proportional 
to the applied flux. See JOSEPHSON EFFECT; SQUID. 

The rf SQUID consists of a single Josephson junction incor- 
porated into a superconducting loop and operates with an rf 
bias. The SQUID is coupled to the inductor of an LC-resonant 
circuit excited at its resonant frequency, typically 3B0 MHz. The 
characteristics of rf voltage across the tank circuit versus the rf 
current depends on applied flux. With proper adjustment of the 
rf current, the amplitude of the rf voltage across the tank cir- 
cuit oscillates as a function of applied flux. The rf SQUID is also 
usually operated in a feedback mode. 

SQUIDs are mostly used in conjunction with an input circuit. 
For example, magnetometers are made by connecting a super- 
conducting pickup loop to the input coil to form a flux trans- 
former. A magnetic field applied to the pickup loop induces a 
persistent current in the transformer and hence a magnetic flux 
in the SQUID. These magnetometers have found application in 
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geophysics, for example, in magnetotellurics. See GEOPHYSICAL 
EXPLORATION; MAGNETOMETER. 

Low-transition-temperature SQUIDs are widely used to mea- 
sure the magnetic susceptibility of tiny samples over a wide 
temperature range. Another application is a highly sensitive volt- 
meter, used in measurements of the Hall effect and of ther- 
moelectricity. Low-transition-temperature SQUIDs are used as 
ultrasensitive detectors of nuclear magnetic and nuclear 
quadrupole resonance, and as transducers for gravitational- 
wave antennas. So-called scanning SQUIDs are used to ob- 
tain magnetic images of objects ranging from single-flux quanta 
trapped in superconductors to subsurface damage in two metallic 
sheets riveted together. See HALL EFFECT; MAGNETIC SUSCEPTIBIL- 
ITY; NUCLEAR MAGNETIC RESONANCE (NMR); NUCLEAR QUADRUPOLE 
RESONANCE; THERMOELECTRICITY; VOLTMETER. 

Perhaps the single largest area of application is biomagnetism, 
notably to image magnetic sources in the human brain or heart. 
In these studies an array of magnetometers or gradiometers is 
placed close to the subject, both generally being in a magneti- 
cally shielded room. The fluctuating magnetic signals recorded 
by the various channels are analyzed to locate their source. These 
techniques have been used, for example, to pinpoint the ori- 
gin of focal epilepsy and to determine the function of the brain 
surrounding a tumor prior to its surgical removal. See BIOMAG- 
NETISM. 

The most sensitive detector available for millimeter and 
submillimeter electromagnetic radiation is the superconductor- 
insulator-superconductor (SIS) quasiparticle mixer. In this tun- 
nel junction, usually niobium-aluminum oxide-niobium, the 
Josephson supercurrent is quenched and only single electron 
tunneling occurs. The current-voltage characteristic exhibits a 
very sharp onset of current at a voltage 2A/e, where A is the 
superconducting energy gap. The mixer is biased near this on- 
set where the characteristics are highly nonlinear and used to 
mix the signal frequency with the frequency of a local oscilla- 
tor to produce an intermediate frequency that is coupled out 
into a low-noise amplifier. These mixers are useful at frequencies 
up to about 750 GHz (wavelengths down to 400 micrometers). 
Such receivers are of great importance in radio astronomy, no- 
tably for airborne, balloon-based, or high-altitude, ground-based 
telescopes operating above most of the atmospheric water vapor. 
See RADIO ASTRONOMY. 

The advent of high-transition-temperature superconductors 
stimulated major efforts to develop passive radio-frequency and 
microwave components that take advantage of the low elec- 
trical losses offered by these materials compared with normal 
conductors in liquid nitrogen. The implementation of thin-film 
YBCO receiver coils has improved the signal-to-noise ratio of 
nuclear magnetic resonance (NMR) spectrometers by a factor of 
3 compared to that achievable with conventional coils. This im- 
provement enables the data acquisition time to be reduced by 
an order of magnitude. These coils also have potential applica- 
tions in low-frequency magnetic resonance imaging (MRI). High- 
transition-temperature bandpass filters have application in cellu- 
lar communications. See ELECTRIC FILTER; MOBILE RADIO. —_[J.Cl.] 

Large-scale devices. Large-scale applications of supercon- 
ductivity comprise medical, energy, transportation, high-energy 
physics, and other miscellaneous applications such as high- 
gradient magnetic separation. When strong magnetic fields are 
needed, superconducting magnets offer several advantages over 
conventional copper or aluminum electromagnets. Most impor- 
tant is lower electric power costs because once the system is 
energized only the refrigeration requires power input, generally 
only 5-10% that of an equivalent-field resistive magnet. Rela- 
tively high magnetic fields achievable in unusual configurations 
and in smaller total volumes reduce the costs of expensive force- 
containment structures. See MAGNET. 

Niobium-titanium (NbTi) has been used most widely for 
large-scale applications, followed by the A15 compounds, 
which include niobium-tin (Nb3Sn), niobium-aluminum (Nb-Al), 


niobium-germanium (Nb-Ge), and vanadium-gallium (Va3Ga). 
Niobium-germanium held the record for the highest critical field 
(23 K; —418.5°F) until the announcement of high-temperature 
ceramic superconductors. See A15 PHASES. 

Significant advances have been made in high-temperature su- 
perconducting wire development. Small coils have been wound 
that operate at 20 K (—410°F). Current leads are in limited com- 
mercial use. Considerable development remains necessary to 
use these materials in very large applications. 

MRI dominates superconducting magnet systems applications. 
Most of the MRI systems are in use in hospitals and clinics, and 
incorporate superconducting magnets. See MEDICAL IMAGING. 

Some of the largest-scale superconducting magnet systems 
are those considered for energy-related applications. These in- 
clude magnetic confinement fusion, superconducting magnetic 
energy storage, magnetohydrodynamic electrical power gener- 
ation, and superconducting generators. See MAGNETOHYDRODY- 
NAMIC POWER GENERATOR; NUCLEAR FUSION. 

In superconducting magnetic energy storage superconduct- 
ing magnets are charged during off-peak hours when elec- 
tricity demand is low, and then discharged to add electric- 
ity to the grid at times of peak demand. The largest sys- 
tems would require large land areas, for example, an 1100- 
m-diameter (3600-ft) site for a 5000-MWh system. However, 
intermediate-size systems are viable. A 6-T peak-field solenoidal 
magnet system designed for the Alaskan power network stores 
1800 megajoules (0.5 MWh). High-purity-aluminum-stabilized 
niobium-titanium alloy conductor carrying 16 kiloamperes cur- 
rent is used for the magnet winding. 

Superconducting magnets have potential applications for 
transportation, such as magnetically levitated vehicles. In ad- 
dition, superconducting magnets are used in particle accelera- 
tors and particle detectors. See MAGNETIC LEVITATION; PARTICLE 
ACCELERATOR; PARTICLE DETECTOR. [A.M.Da.] 


Superconductivity A phenomenon occurring in many 
electrical conductors, in which the electrons responsible for con- 
duction undergo a collective transition into an ordered state with 
many unique and remarkable properties. These include the van- 
ishing of resistance to the flow of electric current, the appearance 
of a large diamagnetism and other unusual magnetic effects, 
substantial alteration of many thermal properties, and the oc- 
currence of quantum effects otherwise observable only at the 
atomic and subatomic level. 

Superconductivity was discovered by H. Kamerlingh Onnes in 
Leiden in 1911 while studying the temperature dependence of 
the electrical resistance of mercury within a few degrees of abso- 
lute zero. He observed that the resistance dropped sharply to an 
unmeasurably small value at a temperature of 4.2 K (—452°F). 
The temperature at which the transition occurs is called the tran- 
sition or critical temperature, T.. The vanishingly small resistance 
(very high conductivity) below T, suggested the name given the 
phenomenon. 

In 1933 W. Meissner and R. Ochsenfeld discovered that a 
metal cooled into the superconducting state in a moderate mag- 
netic field expels the field from its interior. This discovery demon- 
strated that superconductivity involves more than simply very 
high or infinite electrical conductivity, remarkable as that alone 
is. See MEISSNER EFFECT. 

In 1957, J. Bardeen, L. N. Cooper, and J. R. Schrieffer re- 
ported the first successful microscopic theory of superconductiv- 
ity. The Bardeen-Cooper-Schrieffer (BCS) theory describes how 
the electrons in a conductor form the ordered superconducting 
state. The BCS theory still stands as the basic explanation of 
superconductivity, even though extensive theoretical work has 
embellished it. 

There are a number of practical applications of superconduc- 
tivity. Powerful superconducting electromagnets guide elemen- 
tary particles in particle accelerators, and they also provide the 
magnetic field needed for magnetic resonance imaging. Ultra- 
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sensitive superconducting circuits are used in medical studies of 
the human heart and brain and for a wide variety of physical 
science experiments. A completely superconducting prototype 
computer has even been built. See MEDICAL IMAGING; PARTICLE 
ACCELERATOR; SUPERCONDUCTING DEVICES. 

Transition temperatures. It was realized from the start that 
practical applications of superconductivity could become much 
more widespread if a high-temperature superconductor, that is, 
one with a high T,, could be found. For instance, the only prac- 
tical way to cool superconductors with transition temperatures 
below 20 K (—424°F) is to use liquid helium, which boils at a 
temperature of 4.2 K (—452°F) and which is rather expensive. 
On the other hand, a superconductor with a transition temper- 
ature of 100 K (—280°F) could be cooled with liquid nitrogen, 
which boils at 77 K (—321°F) and which is roughly 500 times less 
expensive than liquid helium. Another advantage of a high-T, 
material is that, since many of the other superconducting proper- 
ties are proportional to T,, such a material would have enhanced 
properties. In 1986 the discovery of transition temperatures pos- 
sibly as high as 30 K (—406°F) was reported in a compound 
containing barium, lanthanum, copper, and oxygen. In 1987 a 
compound of yttrium, barium, copper, and oxygen was shown to 
be superconducting above 90 K (—298°F). In 1988 researchers 
showed that a bismuth, strontium, calcium, copper, and oxygen 
compound was superconducting below 110 K (—262°F), and 
transition temperatures as high as 135 K (—216°F) were found 
in a mercury, thallium, barium, calcium, copper, and oxygen 
compound. 

Occurrence. Some 29 metallic elements are known to be 
superconductors in their normal form, and another 17 become 
superconducting under pressure or when prepared in the form of 
thin films. The number of known superconducting compounds 
and alloys runs into the thousands. Superconductivity is thus a 
rather common characteristic of metallic conductors. The phe- 
nomenon also spans an extremely large temperature range. 
Rhodium is the element with the lowest transition temperature 
(370 uK), while Hgo2TlpsCazBazgCu30 is the compound with 
the highest (135 K or —216°F). 

Despite the existence of a successful microscopic theory of 
superconductivity, there are no completely reliable rules for pre- 
dicting whether a metal will be a superconductor. Certain trends 
and correlations are apparent among the known superconduc- 
tors, however—some with obvious bases in the theory—and 
these provide empirical guidelines in the search for new super- 
conductors. Superconductors with relatively high transition tem- 
peratures tend to be rather poor conductors in the normal state. 

The ordered superconducting state appears to be incompat- 
ible with any long-range-ordered magnetic state: Usually the 
ferromagnetic or antiferromagnetic metals are not superconduct- 
ing. The presence of nonmagnetic impurities in a superconduc- 
tor usually has very little effect on the superconductivity, but the 
presence of impurity atoms which have localized magnetic mo- 
ments can markedly depress the transition temperature even in 
concentrations as low as a few parts per million. See ANTIFERRO- 
MAGNETISM; FERROMAGNETISM. 

Some semiconductors with very high densities of charge 
carriers are superconducting, and others such as silicon and 
germanium have high-pressure metallic phases which are su- 
perconducting. Many elements which are not themselves super- 
conducting form compounds which are. 

Certain organic conductors are superconducting. For instance, 
brominated polymeric chains of sulfur and nitrogen, known as 
(SNBro 4),, are superconducting below 0.36 K. Other more com- 
plicated organic materials have T, values near 10 K (—442°F). 
See ORGANIC CONDUCTOR. 

Although nearly all the classes of crystal structure are repre- 
sented among superconductors, certain structures appear to be 
especially conducive to high-temperature superconductivity. The 
so-called A15 structure, shared by a series of intermetallic com- 
pounds based on niobium, produced several superconductors 
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with T, values above 15 K (—433°F) as well as the record holder, 
NbGe, at 23 K (—418°F). Indeed, the robust applications of su- 
perconductivity that depend on the ability to carry high current 
in the presence of high magnetic fields still exclusively use two 
members of this class: NbTi with T, = 8 K (—445°F), and Nb3Sn 
with T, = 18.1 K (—427°F). See A15 PHASES. 

After 1986 the focus of superconductivity research abruptly 
shifted to the copper-oxide-based planar structures, due to their 
significantly higher transition temperatures. Basically there are 
three classes of these superconductors, all of which share the 
common feature that they contain one or more conducting 
planes of copper and oxygen atoms. The first class is designated 
by the chemical formula Lag_,A,CuOu,, where the A atom can be 
barium, strontium, or calcium. Superconductivity was originally 
discovered in the barium-doped system, and systematic study of 
the substitutions of strontium, calcium, and so forth have pro- 
duced transition temperatures as high as 40 K (—388°F). 

The second class of copper-oxide superconductor is desig- 
nated by the chemical formula Y;BagCu307_;, with 5 < 1.0. 
Here, single sheets of copper and oxygen atoms straddle the 
rare-earth yttrium ion and chains of copper and oxygen atoms 
thread among the barium ions. The transition temperature, 92 K 
(—294°F), is quite insensitive to replacement of yttrium by many 
other rare-earth ions. See RARE-EARTH ELEMENTS. 

The third class is the most complicated. These com- 
pounds contain either single thallium-oxygen layers, repre- 
sented by the chemical formula Tl;Ca,_;BazCu,,O2,,3, where 
n refers to the number of copper-oxygen planes, or double 
thallium-oxygen layers, represented by the chemical formula 
Tlp,Ca,,_;BagCu,,O2,44. The number of copper-oxygen planes 
may be varied, and as many as three planes have been included 
in the structure. Thallium may be replaced by bismuth, thus gen- 
erating a second family of superconductors. In all of these com- 
pounds, the transition temperature appears to increase with the 
number of planes, but T, decreases for larger values of n. 

The spherical molecule comprising 60 carbon atoms (Ceo), 
known as a buckyball, can be alloyed with various alkaline atoms 
which contribute electrons for conduction. By varying the num- 
ber of conductors in Ceo, it is possible to boost T, to a maximum 
value of 52 K (—366°F). See FULLERENE. 

Superconductivity was discovered in magnesium diboride 
(MgB») in January 2001 in Japan. This material may be a good 
alternative for some of the applications envisioned for high-T, 
superconductivity, since this compound has T, of 39 K (—389°F), 
is relatively easy to make, and consists of only two elements. 

Magnetic properties. The existence of the Meissner- 
Ochsenfeld effect, the exclusion of a magnetic field from the 
interior of a superconductor, is direct evidence that the super- 
conducting state is not simply one of infinite electrical conduc- 
tivity. Instead, it is a true thermodynamic equilibrium state, a new 
phase which has lower free energy than the normal state at tem- 
peratures below the transition temperature and which somehow 
requires the absence of magnetic flux. 

The exclusion of magnetic flux by a superconductor costs some 
magnetic energy. So long as this cost is less than the condensa- 
tion energy gained by going from the normal to the supercon- 
ducting phase, the superconductor will remain completely su- 
perconducting in an applied magnetic field. If the applied field 
becomes too large, the cost in magnetic energy will outweigh 
the gain in condensation energy, and the superconductor will 
become partially or totally normal. The manner in which this 
occurs depends on the geometry and the material of the super- 
conductor. The geometry which produces the simplest behavior 
is that of a very long cylinder with field applied parallel to its 
axis. Two distinct types of behavior may then occur, depending 
on the type of superconductor—type I or type II. 

Below a critical field H, which increases as the temperature 
decreases below T,, the magnetic flux is excluded from a type I 
superconductor, which is said to be perfectly diamagnetic. For a 
type II superconductor, there are two critical fields, the lower crit- 
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ical field H,, and the upper critical field H,2. In applied fields less 
than H,,, the superconductor completely excludes the field, just 
as a type I superconductor does below H,. At fields just above 
H1, however, flux begins to penetrate the superconductor, not in 
a uniform way, but as individual, isolated microscopic filaments 
called fluxoids or vortices. Each fluxoid consists of a normal core 
in which the magnetic field is large, surrounded by a supercon- 
ducting region in which flows a vortex of persistent supercurrent 
which maintains the field in the core. See DIAMAGNETISM. 

Thermal properties. The appearance of the superconduct- 
ing state is accompanied by rather drastic changes in both the 
thermodynamic equilibrium and thermal transport properties of 
a superconductor. 

The heat capacity of a superconducting material is quite dif- 
ferent in the normal and superconducting states. In the nor- 
mal state (produced at temperatures below the transition tem- 
perature by applying a magnetic field greater than the critical 
field), the heat capacity is determined primarily by the normal 
electrons (with a small contribution from the thermal vibrations 
of the crystal lattice) and is nearly proportional to the tempera- 
ture. In zero applied magnetic field, there appears a discontinuity 
in the heat capacity at the transition temperature. At tempera- 
tures just below the transition temperature, the heat capacity is 
larger than in the normal state. It decreases more rapidly with de- 
creasing temperature, however, and at temperatures well below 
the transition temperature varies exponentially as e~“/"", where 
A is a constant and k is Boltzmann’s constant. Such an expo- 
nential temperature dependence is a hallmark of a system with 
a gap A in the spectrum of allowed energy states. Heat capacity 
measurements provided the first indications of such a gap in su- 
perconductors, and one of the key features of the macroscopic 
BCS theory is its prediction of just such a gap. See SPECIFIC HEAT 
OF SOLIDS. 

Ordinarily a large electrical conductivity is accompanied by 
a large thermal conductivity, as in the case of copper, used in 
electrical wiring and cooking pans. However, the thermal con- 
ductivity of a pure superconductor is less in the superconducting 
state than in the normal state, and at very low temperatures 
approaches zero. Crudely speaking, the explanation for the as- 
sociation of infinite electrical conductivity with vanishing thermal 
conductivity is that the transport of heat requires the transport of 
disorder (entropy). The superconducting state is one of perfect 
order (zero entropy), and so there is no disorder to transport 
and therefore no thermal conductivity. See ENTROPY; THERMAL 
CONDUCTION IN SOLIDS. 

Two-fluid model. C. J. Gorter and H. B. G. Casimir intro- 
duced in 1934 a phenomenological theory of superconductivity 
based on the assumption that in the superconducting state there 
are two components of the conduction electron “fluid” (hence 
the name given this theory, the two-fluid model). One, called 
the superfluid component, is an ordered condensed state with 
zero entropy; hence it is incapable of transporting heat. It does 
not interact with the background crystal lattice, its imperfections, 
or the other conduction electron component and exhibits no re- 
sistance to flow. The other component, the normal component, 
is composed of electrons which behave exactly as they do in 
the normal state. It is further assumed that the superconduct- 
ing transition is a reversible thermodynamic phase transition be- 
tween two thermodynamically stable phases, the normal state 
and the superconducting state, similar to the transition between 
the liquid and vapor phases of any substance. The validity of 
this assumption is strongly supported by the existence of the 
Meissner-Ochsenfeld effect and by other experimental evidence. 
This assumption permits the application of all the powerful and 
general machinery of the theory of equilibrium thermodynamics. 
The results tie together the observed thermodynamic properties 
of superconductors in a very satisfying way. 

Microscopic (BCS) theory. The key to the basic interaction 
between electrons which gives rise to superconductivity was pro- 
vided by the isotope effect. It is an interaction mediated by the 


background crystal lattice and can crudely be pictured as fol- 
lows: An electron tends to create a slight distortion of the elastic 
lattice as it moves, because of the Coulomb attraction between 
the negatively charged electron and the positively charged lat- 
tice. If the distortion persists for a brief time (the lattice may ring 
like a struck bell), a second passing electron will see the distor- 
tion and be affected by it. Under certain circumstances, this can 
give rise to a weak indirect attractive interaction between the 
two electrons which may more than compensate their Coulomb 
repulsion. 

The first forward step was taken by Cooper in 1956, when he 
showed that two electrons with an attractive interaction can bind 
together to form a “bound pair” (often called a Cooper pair) if 
they are in the presence of a high-density fluid of other elec- 
trons, no matter how weak the interaction is. The two partners 
of a Cooper pair have opposite momenta and spin angular mo- 
menta. Then, in 1957, Bardeen, Cooper, and Schrieffer showed 
how to construct a wave function in which all of the electrons (at 
least, all of the important ones) are paired. Once this wave func- 
tion is adjusted to minimize the free energy, it can be used as the 
basis for a complete microscopic theory of superconductivity. 

The successes of the BCS theory and its subsequent elabo- 
rations are manifold. One of its key features is the prediction 
of an energy gap. Excitations called quasiparticles (which are 
something like normal electrons) can be created out of the su- 
perconducting ground state by breaking up pairs, but only at the 
expense of a minimum energy of A per excitation; A is called 
the gap parameter. The original BCS theory predicted that A 
is related to T, by A = 1.76kT, at T = O for all superconduc- 
tors. This turns out to be nearly true, and where deviations occur 
they are understood in terms of modifications of the BCS the- 
ory. The manifestations of the energy gap in the low-temperature 
heat capacity and in electromagnetic absorption provide strong 
confirmation of the theory. [D.N.L.; R.J.So.; M.O.] 


Supercontinent The six major continents today are 
Africa, Antarctica, Australia, Eurasia, North America, and South 
America. Prior to the formation of the Atlantic, Indian, and 
Southern ocean basins over the past 180 million years by 
the process known as sea-floor spreading, the continents were 
assembled in one supercontinent called Pangea (literally “all 
Earth”). Pangea came together by the collision, about 300 mil- 
lion years ago (Ma), of two smaller masses of continental rock, 
Laurasia and Gondwanaland. Laurasia comprised the com- 
bined continents of ancient North America (known as Lauren- 
tia), Europe, and Asia. Africa, Antarctica, Australia, India, and 
South America made up Gondwanaland (this name comes from 
a region in southern India). The term “supercontinent” is also 
applied to Laurasia and Gondwanaland; hence it is used in refer- 
ring to a continental mass significantly bigger than any of today’s 
continents. A supercontinent may therefore incorporate almost 
all of the Earth’s continental rocks, as did Pangea, but that is not 
implied by the word. See CONTINENT; CONTINENTS, EVOLUTION OF. 
Laurasia, Gondwanaland, and Pangea are the earliest super- 
continental entities whose former existence can be proven. Evi- 
dence of older rifted continental margins, for example surround- 
ing Laurentia and on the Pacific margins of South America, 
Antarctica, and Australia, point to the existence of older super- 
continents. The hypothetical Rodinia (literally “the mother of 
all continents”) may have existed 800-1000 Ma, and Pannotia 
(meaning “the all-southern supercontinent”) fleetingly around 
550 Ma. Both are believed to have included most of the Earth’s 
continental material. There may have been still earlier supercon- 
tinents, because large-scale continents, at least the size of south- 
ern Africa or Western Australia, existed as early as 2500 Ma at 
the end of Archean times. See ARCHEAN; CONTINENTAL MARGIN. 
The amalgamation and fragmentation of supercontinents are 
the largest-scale manifestation of tectonic forces within the Earth. 
The cause of such events is highly controversial. See PLATE TEC- 
TONICS. [I.W.0.D.] 
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Supercritical fields Static fields that are strong enough 
to cause the normal vacuum, which is devoid of real particles, to 
break down into a new vacuum in which real particles exist. This 
phenomenon has not yet been observed for electric fields, but it 
is predicted for these fields as well as others such as gravitational 
fields and the gluon field of quantum chromodynamics. 

Vacuum decay in quantum electrodynamics. The orig- 
inal motivation for developing the new concept of a charged 
vacuum arose in the late 1960s in connection with attempts to 
understand the atomic structure of superheavy nuclei expected 
to be produced by heavy-ion linear accelerators. See PARTICLE 
ACCELERATOR. 

The best starting point for discussing this concept is to consider 
the binding energy of atomic electrons as the charge Z of a heavy 
nucleus is increased. If the nucleus is assumed to be a point 
charge, the total energy E of the 1s1/2 level drops to 0 when Z = 
137. This so-called Z = 137 catastrophe had been well known, 
but it was argued loosely that it disappears when the finite size of 
the nucleus is taken into account. However, in 1969 it was shown 
that the problem is not removed but merely postponed, and 
reappears around Z = 173. Any level E(nj) can be traced down 
to a binding energy of twice the electronic rest mass if the nuclear 
charge is further increased. At the corresponding charge number, 
called Z,,, the state dives into the negative-energy continuum of 
the Dirac equation (the so-called Dirac sea). The overcritical 
state acquires a width and is spread over the continuum. See 
ANTIMATTER; RELATIVISTIC QUANTUM THEORY. 

When Z exceeds Z,, a K-shell electron is bound by more than 
twice its rest mass, so that it becomes energetically favorable to 
create an electron-positron pair. The electron becomes bound 
in the 1s, orbital and the positron escapes. The overcritical 
vacuum state is therefore said to be charged. See POSITRON. 

Clearly, the charged vacuum is a new ground state of space 
and matter. The normal, undercritical, electrically neutral vac- 
uum is no longer stable in overcritical fields: it decays spon- 
taneously into the new stable but charged vacuum. Thus the 
standard definition of the vacuum, as a region of space without 
real particles, is no longer valid in very strong external fields. 
The vacuum is better defined as the energetically deepest and 
most stable state that a region of space can have while being 
penetrated by certain fields. 

Superheavy quasimolecules. Inasmuch as the formation 
of a superheavy atom of Z > 173 is very unlikely, a new idea 
is necessary to test these predictions experimentally. That idea, 
based on the concept of nuclear molecules, was put forward in 
1969: a superheavy quasimolecule forms temporarily during the 
slow collision of two heavy ions. It is sufficient to form the quasi- 
molecule for a very short instant of time, comparable to the time 
scale for atomic processes to evolve in a heavy atom, which is 
typically of the order 10718 to 107%. Suppose a uranium ion 
is shot at another uranium ion at an energy corresponding to 
their Coulomb barrier, and the two, moving slowly (compared 
to the K-shell electron velocity) on Rutherford hyperbolic trajec- 
tories, are close to each other (compared to the K-shell electron 
orbit radius). Then the atomic electrons move in the combined 
Coulomb potential of the two nuclei, thereby experiencing a field 
corresponding to their combined charge of 184. This happens 
because the ionic velocity (of the order of c/10) is much smaller 
than the orbital electron velocity (of the order of c), so that there 
is time for the electronic molecular orbits to be established, that 
is, to adjust to the varying distance between the charge centers, 
while the two ions are in the vicinity of each other. See QuASI- 
ATOM. 

Giant nuclear systems. The energy spectrum for positrons 
created in, for example, a uranium-curium collision consists of 
three components: the induced, the direct, and the spontaneous, 
which add up to a smooth spectrum. The presence of the spon- 
taneous component leads only to 5-10% deviations for normal 
nuclear collisions along Rutherford trajectories. This situation 
raises the question as to whether there is any way to get a clear 
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qualitative signature for spontaneous positron production. Sup- 
pose that the two colliding ions, when they come close to each 
other, stick together for a certain time At before separating again. 
The longer the sticking, the better is the static approximation. For 
At very long, a very sharp line should be observed in the positron 
spectrum with a width corresponding to the natural lifetime of 
the resonant positron-emitting state. The observation of such a 
sharp line will indicate not only the spontaneous decay of the 
vacuum but also the formation of giant nuclear systems (Z > 
180). See LINEWIDTH; NUCLEAR MOLECULE. 

Search for spontaneous positron emission. The search 
for spontaneous positron emission in heavy-ion collisions began 
in 1976. Of special interest are peak structures in the positron 
energy distribution. However, the issue of spontaneous positron 
production in strong fields remains open. If line structures have 
been observed at all, they are most likely due to nuclear conver- 
sion processes. The observation of vacuum decay very much de- 
pends on the existence of sufficiently long-lived (at least 10~?° s) 
giant nuclear molecular systems. Therefore, the investigation of 
nuclear properties of heavy nuclei encountering heavy nuclei at 
the Coulomb barrier is a primary task. 

Other field theories. The idea of overcriticality also has ap- 
plications in other field theories, such as those of pion fields, 
gluon fields (quantum chromodynamics), and gravitational fields 
(general relativity). 

A heavy meson may be modeled as an ellipsoidal bag, with 
a heavy quark Q and antiquark Q located at the foci of the el- 
lipsoid. The color-electric or glue-electric field lines do not pene- 
trate the bag surface. The Dirac equation may be solved for light 
quarks q and antiquarks gq in this field of force. In the spherical 
case the potential is zero, and the solutions with different charges 
degenerate. As the source charges Q and Q are pulled apart, 
the wave functions start to localize. At a critical deformation of 
the bag, positive and negative energy states cross; that is, overr- 
criticality is reached and the color field is strong enough that the 
so-called perturbative vacuum inside the bag rearranges so that 
the wave functions are pulled to opposite sides and the color 
charges of the heavy quarks are completely shielded. Hence two 
new mesons of types Qq and Q appear; the original meson fis- 
sions. See GLUONS; QUANTUM CHROMODYNAMICS; QUARKS. [W.Gr.] 
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Supercritical fluid Any fluid at a temperature and a 
pressure above its critical point; also, a fluid above its critical 
temperature regardless of pressure. Below the critical point the 
fluid can coexist in both gas and liquid phases, but above the crit- 
ical point there can be only one phase. Supercritical fluids are of 
interest because their properties are intermediate between those 
of gases and liquids, and are readily adjustable. See CRITICAL 
PHENOMENA; PHASE EQUILIBRIUM. 

In a given supercritical fluid the thermodynamic and transport 
properties are a function of density, which depends strongly on 
the fluid’s pressure and temperature. The density may be ad- 
justed from a gaslike value of 0.1 g/ml to a liquidlike value as 
high as 1.2 g/ml. Furthermore, as conditions approach the critical 
point, the effect of temperature and pressure on density becomes 
much more significant. Increasing the density of supercritical car- 
bon dioxide from 0.2 to 0.5 g/ml, for example, requires raising 
the pressure from 85 to 140 atm (8.6 to 14.2 megapascals) at 
158°F (70°C), but at 95°F (35°C) the required change is only 
from 65 to 80 atm (6.6 to 8.1 MPa). 

For a given fluid, the logarithm of the solubility of a solute is ap- 
proximately proportional to the solvent density at constant tem- 
perature. Therefore, a small increase in pressure, which causes a 
large increase in the density, can raise the solubility a few orders 
of magnitude. While almost all of a supercritical fluid’s proper- 
ties vary with density, some of these properties are more like 
those of a liquid while others are more like those of a gas. See 
SUPERCRITICAL-FLUID CHROMATOGRAPHY. 


In most supercritical-fluid applications the fluid’s critical tem- 
perature is less than 392°F (200°C) and its critical pressure is less 
than 80 atm (8.1 MPa). High critical temperatures require oper- 
ating temperatures that can damage the desired product, while 
high critical pressures result in excessive compression costs. In 
addition to these pure fluids, mixed solvents can be used to im- 
prove the solvent strength. [K.Jo.; R.Len.] 


Spercritical-fluid chromatography Any separa- 
tion technique in which a supercritical fluid is used as the mobile 
phase. For any fluid, a phase diagram can be constructed to 
show the regions of temperature and pressure at which gases 
and liquids, gases and solids, and liquids and solids can coexist. 
For the gas-liquid equilibrium, there is a certain temperature and 
pressure, known as the critical temperature and pressure, below 
which a gas and a liquid can coexist but above which only a 
single phase (known as a supercritical fluid) can form. 

For example, the density of supercritical fluids is usually be- 
tween 0.25 and 1.2 g/ml and is strongly pressure-dependent. 
Their solvent strength increases with density, so that molecules 
that are retained on the column can often be eluted simply by 
increasing the pressure under which the fluid is compressed. 

Diffusion coefficients of solutes in supercritical fluids are ten- 
fold greater than the corresponding values in liquid solvents 
(although about three orders of magnitude less than the cor- 
responding values in gases). The high diffusivity of solutes in 
supercritical fluids decreases their resistance to mass transfer in 
a chromatographic column and hence allows separations to be 
made either very quickly or at high resolution. Molecules can 
be separated in a few minutes or less by using packed columns 
of the type used for high-performance liquid chromatography 
(HPLC). See DIFFUSION IN GASES AND LIQUIDS. 

Despite the relatively large number of separations made 
using packed-column supercritical-fluid chromatography that 
have been described since 1962, supercritical-fluid chromatog- 
raphy became popular only in the late 1980s with the com- 
mercial introduction of capillary (open-tubular) supercritical- 
fluid chromatographs. While supercritical-fluid chromatography 
may never be as widely used as gas chromatography or high- 
performance liquid chromatography, it provides a useful com- 
plement to these chromatographies. See CHEMICAL SEPARATION 
TECHNIQUES; CHROMATOGRAPHY; FLUIDS; GAS CHROMATOGRAPHY; 
LIQUID CHROMATOGRAPHY. [PR.G.] 


Supercritical wing A wing with special streamwise sec- 
tions, or airfoils, which provide substantial delays in the onset 
of the adverse aerodynamic effects which usually occur at high 
subsonic flight speeds. 

When the speed of an aircraft approaches the speed of sound, 
the local airflow about the airplane, particularly above the up- 
per surface of the wing, may exceed the speed of sound. Such 
a condition is called supercritical flow. On previous aircraft, this 
supercritical flow resulted in the onset of a strong local shock 
wave above the upper surface of the wing (illus. a). This local 
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Comparison of airflow about airfoils. (a) Conventional air- 
foils, Mach number = 0.7. (b) Supercritical airfoils. Mach 
number = 0.8. 
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wave caused an abrupt increase in the pressure on the surface 
of the wing, which may cause the surface boundary-layer flow 
to separate from the surface, with a resulting severe increase in 
the turbulence of the flow. The increased turbulence leads to a 
severe increase in drag and loss in lift, with a resulting decrease 
in flight efficiency. The severe turbulence also caused buffet or 
shaking of the aircraft and substantially changed its stability or fly- 
ing qualities. See AERODYNAMIC FORCE; AERODYNAMIC WAVE DRAG; 
LOW-PRESSURE GAS FLOW; SHOCK WAVE; TRANSONIC FLIGHT. 
Supercritical airfoils are shaped to substantially reduce the 
strength of the shock wave and to delay the associated boundary- 
layer separation (illus. b). Since the airfoil shape allows efficient 
flight at supercritical flight speeds, a wing of such design is called 
a supercritical wing. See AIRPLANE; WING. [R.T.Wh.] 


Superfluidity The frictionless flow of liquid helium at low 
temperature; also, the flow of electric current without resistance 
in certain solids at low temperature (superconductivity). 

Both helium isotopes have a superfluid transition, but the de- 
tailed properties of their superfluid states differ considerably be- 
cause they obey different statistics. “He, with an intrinsic spin 
of 0, is subject to Bose-Einstein statistics, and “He, with a spin 
of 1/5, to Fermi-Dirac statistics. There are two distinct superfluid 
states in °He called A and B. 

The term “superfluidity” usually implies He II or the A and B 
phases of ?He, but the basic similarity between these and the 
“fluid” consisting of pairs of electrons in superconductors is suf- 
ficiently strong to designate the latter as a charged superfluid. 
Besides flow without resistance, superfluid helium and super- 
conducting electrons display quantized circulating flow patterns 
in the form of microscopic vortices. See BOSE-EINSTEIN STATISTICS; 
LIQUID HELIUM; QUANTIZED VORTICES; SECOND SOUND; SUPERCON- 
DUCTIVITY. [LJ.C.] 


Supergiant star A member of a class of evolved stars 
that occupy the top of the Hertzsprung-Russell diagram to the 
right of the main sequence. The absolute visual magnitudes (My) 
of supergiants range approximately between —4 and —10, and 
they are the largest and brightest stars. They are recognized by 
their spectroscopic characteristics. For example, those in class 
A have narrow hydrogen lines. Supergiants are subdivided into 
classes Ib (My about —5) and Ia (My about —7). A “hypergiant 
class zero” was later added near My = —10; the use of transition 
class Ia-0 at absolute visual magnitudes near —9 is now common. 
Red supergiants are the largest of all stars, and at maximum can 
reach diameters approaching that of the orbit of Saturn. 

In an evolutionary sense, supergiants are stars above about 10 
solar masses and absolute visual magnitude —6 that began main- 
sequence life hotter than class B1 and cannot evolve to become 
white dwarfs. As progeny of O stars, supergiants are exceedingly 
rare. Supergiant masses allow further burning of carbon and 
oxygen to oxygen, neon, and magnesium; thence to silicon and 
sulfur; and thence to iron. The iron cores collapse to produce type 
II supernovae, the condensed remnants at the centers becoming 
either neutron stars or, from the most massive stars, black holes. 
See ASTRONOMICAL SPECTROSCOPY; BLACK HOLE; HERTZSPRUNG- 
RUSSELL DIAGRAM; NEUTRON STAR; SPECTRAL TYPE; STAR; STELLAR 
EVOLUTION; SUPERNOVA. [J.B.Ka.] 


Supergranulation A system of convective cells, with 
typical diameters of 20,000 km (12,000 mi), that cover the 
Sun’s surface. Solar convection cells are invisible in ordinary 
photographs. 

High-resolution photographs of the Sun’s visible surface re- 
veal the granulation, a closely packed cellular grid having bright 
(hot) centers surrounded by dark (cool) lanes. Granules have 
lifetimes of 10-30 minutes and average diameters of 1000 km 
(620 mi). In the 1930s, L. Biermann suggested that, in the outer 
30% of the Sun, heat from the Sun’s interior is transported to 
the surface by convection, just like hot rising bubbles in a pot 
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of boiling soup heated from below. The granules are the boil- 
ing bubbles (plumes) of this convective process. Theorists in the 
1950s proposed that the bubble sizes are approximately equal to 
the local scale height H (the distance in which density or pressure 
changes by a factor e ~ 2.7). Since H varies from about 1000 km 
(620 mi) at the Sun’s surface to 100,000 km (62,000 mi) at the 
base of the convection zone, a very large range of plume sizes 
was hypothesized. See CONVECTION (HEAT). 

In 1959, R. Leighton modified the spectroheliograph to image 
the Sun using the Doppler and Zeeman effects. Images of line-of- 
sight (approaching and receding) gas motions (Dopplergrams) 
and of magnetic fields (magnetograms) could be obtained. The 
Dopplergrams showed a new cell structure with area 400 times 
that of the granulation, having a mainly horizontal flow pattern. 
Lifetimes of most supergranules are 1-2 days, but observations 
in 1998 with the Michelson Doppler Imager (MDI) on the Solar 
and Heliospheric Observatory (SOHO) spacecraft showed that 
some live longer than 4 days. The cells form an irregular polyg- 
onal structure, with most of the down-flow occurring at the poly- 
gon vertices. The supergranules fit neatly within an essentially 
identical network (grid) structure seen in magnetograms. The ki- 
netic energy of the supergranular motions at the Sun’s surface 
exceeds the magnetic field’s ability to resist such motions, and 
thus the magnetic field is dragged forcibly to the supergranule 
boundaries until the two patterns coincide. This magnetic field, 
in turn, causes local heating of the upper solar atmosphere (chro- 
mosphere), producing a similar chromospheric network pattern 
seen in high-temperature spectral lines. See DOPPLER EFFECT; 
FRAUNHOFER LINES; SOLAR MAGNETIC FIELD; SPECTROGRAPH; SPEC- 
TROHELIOGRAPH; SUN; ZEEMAN EFFECT. [G.WSi.] 


Supergravity 


Supergravity A theory that attempts to unify gravitation 
with the other fundamental interactions. The first, and only, com- 
pletely successful unified theory was constructed by James Clerk 
Maxwell, in which the up-to-then unrelated electric and magnetic 
phenomena were unified in his electrodynamics. See FUNDAMEN- 
TAL INTERACTIONS; MAXWELL’S EQUATIONS. 

Electroweak theory. The second stage of unification con- 
cerns the unification of electromagnetic and weak interactions, 
using Maxwell’s theory as a guide. This was accomplished mak- 
ing use of the nonabelian gauge theories invented by C. N. 
Yang and R. L. Mills, and of spontaneous symmetry break- 
ing. The symmetry of Maxwell’s theory is very similar to spa- 
tial rotations about an axis, rotating the vector potentials while 
leaving the electric and magnetic fields unchanged. It is a lo- 
cal invariance because the rotations about a fixed axis can 
be made by different amounts at different points in space- 
time. Thus, Maxwell’s theory is invariant under a one-parameter 
group of transformations U(1). In Yang-Mills theory this lo- 
cal invariance was generalized to theories with larger sym- 
metry groups such as the three-dimensional rotation group 
SO(3) ~ SU(2) which has three parameters. The number of 
parameters of the local symmetry (gauge) group is also equal 
to the number of 4-vector potentials in the gauge theory based 
on that group. A detailed analysis of weak and electromagnetic 
forces shows that their description requires four 4-vector po- 
tentials (gauge fields), so that the gauge group must be a four- 
parameter group. In fact, it is the product SU(2) - U(1). See ELEc- 
TROWEAK INTERACTION; GAUGE THEORY; GROUP THEORY; SYMMETRY 
BREAKING. 

Grand unified theories. In the third stage of unification, 
electroweak and strong forces are regarded as different com- 
ponents of a more general force which mediates the interac- 
tions of particles in a grand unified model. Strong forces are re- 
sponsible for the interactions of hadrons and for keeping quarks 
confined inside hadrons. They are described by eight massless 
4-vector potentials (gluons), the corresponding eight-parameter 
group being SU(3). This local symmetry is called color, and 
the corresponding theory quantum chromodynamics (QCD). 
The combination SU(3) - SU(2) - SU(1) has strong experimental 
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support, and has come to be known as the standard model. Thus 
the gauge group of any grand unified model must include the 
standard model as a subsymmetry. The most dramatic prediction 
of these theories is the decay of protons. See GLUONS; GRAND UNI- 
FICATION THEORIES; PROTON; QUANTUM CHROMODYNAMICS; QUARKS; 
STANDARD MODEL. 

Supersymmetry and supergravity theories. A still higher 
and more ambitious stage of unification deals with the possibility 
of combining grand unified and gravity theories into a superuni- 
fied theory, also known as supergravity. To achieve this, use is 
made of the dual role played by local internal symmetry groups. 
On the one hand, they describe the behavior of forces. On the 
other hand, they classify the elementary particles (fields) of the 
theory into multiplets: spin-zero fields in one multiplet, spin-1/2 
fields in another multiplet, and so forth, but never fermions and 
bosons in a single irreducible multiplet of internal symmetry. 
This last restriction used to be a major obstacle on the way to 
superunification. This is because, of all the elementary particles, 
only the quanta of the gravitational field (gravitons) have spin 
2, so that a multiplet of elementary particles including the gravi- 
ton must of necessity involve particles of different spin. But then 
by an internal symmetry transformation, which is by definition 
distinct from space-time (Lorentz) transformations, it is possi- 
ble to “rotate” particles of different spin into one another, thus 
altering their space-time transformation properties. This appar- 
ent paradox can be circumvented if both the internal symmetry 
and Lorentz transformations are part of a larger (supersymmet- 
ric) transformation group which also includes the spin-changing 
transformations. The irreducible multiplets of such supergroups 
naturally contain both fermions and bosons. This is how super- 
symmetry makes its appearance in supergravity theories. See 
GRAVITON; LORENTZ TRANSFORMATIONS; RELATIVITY; SUPERSYMME- 
TRY; SYMMETRY LAWS (PHYSICS). 

Effective theory. If supergravity models are regarded not as 
fundamental theories but as effective theories describing the low- 
energy behavior of superstring theories, it is possible to make a 
strong case for their usefulness. In that case, since supergravity 
is no longer a fundamental theory, it is no longer crucial that 
supergravity satisfy very stringent physical requirements such as 
renormalizability. In its role as an effective theory, supergrav- 
ity has been used in a number of problems in particle physics. 
See ELEMENTARY PARTICLE; QUANTUM FIELD THEORY; SUPERSTRING 
THEORY. [EM.] 


Superheater 


Superheater A component of a steam-generating unit in 
which steam, after it has left the boiler drum, is heated above its 
saturation temperature. The amount of superheat added to the 
steam is influenced by the location, arrangement, and amount 
of superheater surface installed, as well as the rating of the boiler. 
The superheater may consist of one or more stages of tube banks 
arranged to effectively transfer heat from the products of com- 
bustion. See STEAM-GENERATING UNIT. [G.W.K.] 


Superluminal motion Proper motion of an astronomi- 
cal object apparently exceeding the velocity of light, c. This phe- 
nomenon is relatively common in the nuclei of quasars, many 
of which exhibit systematic changes in images of their radio- 
frequency emission over periods of months to years. In some 
cases, features in the image appear to separate at a speed in- 
ferred to be more than 10 times the speed of light, given the 
great distance of the quasars from Earth. 

Superluminal motion was one of the most exciting discoveries 
to emerge from a technique in radio astronomy first developed 
in the late 1960s and called very long baseline interferometry 
(VLBI). This method involves the tape recording of radio signals 
from large antennas at up to 10-15 locations across the Earth, 
and the combination of these signals in a computer to form 
a radio image of the quasar at extremely high resolution (less 
than 0.001 arcsecond). See QUASAR; RADIO ASTRONOMY; RADIO 
TELESCOPE. 


Superluminal motion is seen mostly in quasars but also in 
some other active galactic nuclei. This rapid motion is confined to 
within a few tens of parsecs of the nucleus, whose power source 
is believed to be a massive black hole. At least 30 examples 
of superluminal motion are now known. Most show apparent 
speeds less than 10c, but examples of speeds above 20c have 
been found. A very few objects in the Milky Way Galaxy also 
show superluminal motion. An example is GRS 1915 + 105, a 
relativistic jet source, which emits strongly in the x-ray as well as 
in the radio spectrum. Because the object is within the Milky Way 
Galaxy, its apparent speed of 1.25c is detectable in less than a 
day. See GALAXY, EXTERNAL; X-RAY ASTRONOMY. 

Announcement of the discovery of superluminal motion in 
1972 caused widespread concern because of the apparent vio- 
lation of Albert Einstein’sspecial theory of relativity, even though 
the basic explanation still favored now was in fact predicted some 
years before the announcement. Many explanations were pro- 
posed (besides Einstein’s theory being incorrect), but only the 
relativistic jet model has stood the test of time. Superluminal 
motion is explained in this model as primarily a geometric ef- 
fect. A feature (perhaps a cloud of relativistic plasma) moves 
away from the nucleus of the quasar at high (relativistic) speed 
(but less than c) at a small angle to the line of sight to the Earth. 
Radio waves from the moving feature arrive only slightly later 
than waves from the nucleus, whereas the feature took a much 
longer time to reach its current position. The motion appears 
superluminal because the speed is calculated using this much 
shorter time interval. As the speed approaches c and the angle 
to the line of sight decreases, the apparent speed can be arbi- 
trarily large. In this explanation, no material speeds faster than 
c are required, so there is no conflict with special relativity. See 
RELATIVITY. [S.C.U.] 


Supermassive stars Hypothetical objects with masses 
exceeding 60 solar masses, the mass of the largest known ordi- 
nary stars (1 solar mass equals 4.4 x 10°° Ibm or 2 x 10° kg). 
The term is most often used in connection with objects larger than 
10* solar masses that might be the energy source in quasars and 
active galaxies. These objects probably do not exist. However, 
their nonexistence is one of the major assumptions that makes 
the case for giant black holes, rather than supermassive stars, 
being the central engines of quasars. See BLACK HOLE; QUASAR; 
STAR. [H.L.Sh.] 


Supermultiplet A generalization of the concept of a mul- 
tiplet. A multiplet is a set of quantum-mechanical states, each of 
which has the same value of some fundamental quantum num- 
ber and differs from the other members of the set by another 
quantum number which takes values from a range of numbers 
dictated by the fundamental quantum number. The number of 
states in the set is called the multiplicity or dimension of the mul- 
tiplet. The concept was originally introduced to describe the set 
of states in a nonrelativistic quantum-mechanical system with 
the same value of the orbital angular momentum, L, and dif- 
ferent values of the projection of the angular momentum on an 
axis, M. The values that M can take are the integers between 
—L and L, 2L + 1 in all. This is the dimension of the multiplet. 
If the hamiltonian operator describing the system is rotationally 
invariant, all states of the multiplet have the same energy. A su- 
permultiplet is a generalization of the concept of multiplet to the 
case when there are several quantum numbers that describe the 
quantum-mechanical states. See ANGULAR MOMENTUM; SYMMETRY 
LAWS (PHYSICS). 

Both concepts, multiplet and supermultiplet, acquire a precise 
mathematical meaning by the use of the theory of group transfor- 
mations. A multiplet is an irreducible representation of a group, 
G. The quantum number called fundamental in the paragraph 
above labels the representation of the group. The other quan- 
tum number labels the representation of a subgroup G’ of G. For 
angular momentum, the group G is the rotation group, called 


special orthogonal group in three dimensions, SO(3), and its 
subgroup G’ is the group of special orthogonal transformations 
in two dimensions, SO(2). A supermultiplet is a generalization to 
the case in which the group G is not a group of rank one but has 
larger rank. A group of rank one has only one quantum number 
to label its representations. The concept of a multiplet or su- 
permultiplet is particularly useful in the classification of states of 
physical systems. See GROUP THEORY; NONRELATIVISTIC QUANTUM 
THEORY; QUANTUM MECHANICS; QUANTUM NUMBERS. 

The term supermultiplet was first used by E. P Wigner in 1932 
in order to classify the quantum-mechanical states of light atomic 
nuclei. The constituents of these are protons, p, and neutrons, n. 
Each proton and neutron has an intrinsic spin, S, of 1/2 in units 
of h, which is Planck’s constant divided by 27. The projection 
of the intrinsic spin on an axis, S,, is then S, = 1/2 or —1/o (spin 
up or down). In addition to having the same spin, the proton 
and neutron have essentially the same mass but differ in that 
the proton is charged whereas the neutron is not. They can thus 
be regarded as different charge states of the same particle, a 
nucleon. The distinction can be made formal by introducing a 
quantum number called isotopic spin, T, which has the value 
1/2. The two charge orientations, T,, are taken to be 1/2 for the 
proton and —}/p for the neutron. There are thus four constituents 
of nuclei, protons and neutrons with spin up and down, that 
is, p t, p 4, n t, and n J. The set of transformations among 
these constituents forms a group called SU(4), the special unitary 
group in four dimensions. This is the group G for Wigner’ stheory. 
The representations of SU(4), that is, Wigner supermultiplets, are 
characterized by three quantum numbers (Aj, A2, A3) with Ay > 
Ag = A3. See I-SPIN; NUCLEAR STRUCTURE. (FI.] 


Supernova The catastrophic, explosive death of a star, ac- 
companied by the sudden, transient brightening of the star to an 
optical luminosity comparable to that of an entire galaxy. 

A supernova shines typically for several weeks to several 
months with a luminosity between 2 x 10° and 5 x 10? times 
that of the Sun, then gradually fades away. Each explosion ejects 
from one to several tens of solar masses at speeds ranging from 
thousands to tens of thousands of kilometers per second. The 
total kinetic energy, 10% joules (2.5 x 1078 megatons of high 
explosive), is about 100 times the total light output, making su- 
pernovae some of the highest-energy explosions in the universe. 
Unlike its fainter relative, the nova, a supernova does not recur 
for the same object. See Nova. 

Supernovas may be grouped according to either their obser- 
vational characteristics or their explosion mechanism. Basically, 
type I supernovae have no hydrogen in their spectrum; type II 
supernovae do. Two mechanisms are involved: thermonuclear 
explosion in white dwarfs and gravitational collapse in massive 
stars. Type I supernovae of different subclasses can occur by ei- 
ther mechanism, but it is thought that most type II supernovae 
are powered by gravitational collapse. 

During the last thousand years, there have been approxi- 
mately seven supernovae visible to the unaided eye, in 1006, 
1054, 1181, 1408, 1572, 1604, and 1987. SN 1006 may have 
been as bright as the quarter moon. The first six of these oc- 
curred in the Earth’s vicinity of the Milky Way Galaxy. But the 
last, and only, naked-eye supernova since the invention of mod- 
ern instrumentation occurred in the Large Magellanic Cloud, a 
small satellite galaxy of the Milky Way about 160,000 light-years 
away. Supernovae are discovered in other galaxies at a rate of 
about 150 per year. Most supernovae in the Milky Way Galaxy 
are obscured by dust, but various arguments suggest that about 
two type II supernovae per century and one type Ia every other 
century occur in the Milky Way Galaxy. See MAGELLANIC CLOUDS; 
MILKy Way GALAXY. 

Type la supernovae. Type Ia supernovae may be regarded 
as nature’s largest thermonuclear bombs. They occur when an 
accreting white dwarf, composed of carbon and oxygen, grows 
to amass 1.38 times that of the Sun, almost the critical mass that 
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can be supported by electron degeneracy pressure, and ignites 
carbon fusion near its center. Ignition occurs when carbon fu- 
sion at the center releases energy faster than neutrinos can carry 
it away. Because the pressure is insensitive to the temperature, 
a nuclear runaway occurs. Fusion releases energy, which raises 
the temperature, which makes fusion go faster, but the gas can- 
not expand and cool. The nuclear runaway spreads in about 
1 second through the star. The energy released by this nuclear 
burning is more than enough to completely blow the white dwarf 
apart with high velocity. Nothing remains—no neutron star, no 
black hole, and no burst of neutrino emission. See BINARY STAR; 
THERMONUCLEAR REACTION. 

Type Il supernovae. A typical type II supernova results from 
a star somewhat over 8 solar masses, on the main sequence, that 
spends its last years as a red supergiant burning progressively 
heavier fuels in its center. The radius of the star, after hydrogen 
has burned and the star is part way through helium burning, 
is roughly 500 solar radii, and its luminosity is already about 
100,000 times that of the Sun. Each burning stage is shorter 
than the previous one. The last stage turns silicon and sulfur into 
a ball of roughly 1.4 solar masses of iron. Once iron has been 
produced, no more nuclear energy is available. See SUPERGIANT 
STAR. 

A combination of instabilities now leads to the implosion of 
the iron core to a neutron star. When the density at the center 
reaches several times that of the atomic nucleus, the collapse 
halts and briefly springs back owing to the short-range repulsive 
component of the nuclear force. But the energy of this bounce 
is soon dissipated, and a hot young neutron star remains which, 
over the next few seconds, radiates away its heat and binding 
energy as neutrinos. See NEUTRINO; STRONG NUCLEAR INTERAC- 
TIONS. 

The energy output in these neutrinos is enormous, about 3 x 
10*° joules or 15% of the rest mass of the Sun converted to 
energy; rivaling the luminosity of the rest of the observable uni- 
verse in light. A small fraction of these neutrinos, about 0.3%, are 
absorbed in reactions with neutrons and protons in the regions 
just outside the neutron star and deposit their energy. Even this 
small amount of energy is much greater than the gravitational 
binding of the remaining part of the star external to the newly 
formed neutron star. A bubble of radiation is inflated by the neu- 
trino energy deposition, the outer boundary of which expands 
supersonically, driving a shock wave through the rest of the star 
and ejecting it with high velocity. The main energy of the explo- 
sion, though, is carried away as neutrinos. This general picture 
was confirmed when a neutrino burst of the predicted energy 
and duration was detected February 23, 1987, from the Large 
Magellanic Cloud in conjunction with SN 1987A. See NEUTRINO 
ASTRONOMY; SHOCK WAVE. 

Nucleosynthesis. Supernovae are major element factories, 
responsible for producing most of the elements in nature heavier 
than nitrogen. The largest yields are of the more abundant ele- 
ments, including oxygen, silicon, magnesium, neon, iron, anda 
portion of carbon, but dozens of other elements are also made. 
See NUCLEOSYNTHESIS. 

Type la cosmological applications. Because of their 
brightness and the regularity of their light curves, type Ia su- 
pernovae have long been used as standard candles to survey 
cosmological distances. More recently it has been realized that 
the relatively small variation that occurs in the peak brilliance of 
such supernovae may be correlated with their decline rates. Use 
of this so-called Phillips relation allows even greater precision in 
distance determination. Using type Ia supernovae in this fashion 
reveals a surprising result. Two independent analyses show that 
the expansion rate of the universe is not slowing as might be ex- 
pected long after the big bang, but is actually accelerating. The 
pull of gravity can only cause deceleration, so the acceleration is 
attributed to an invisible form of dark energy that enters into the 
cosmological equations as a repulsive term. See ASTROPHYSICS, 
HIGH-ENERGY; COSMOLOGY; STAR; STELLAR EVOLUTION; UNIVERSE; 
VARIABLE STAR. [S.E.Wo.] 


Supernova 
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Superoxide chemistry A branch of chemistry that 
deals with the reactivity of the superoxide ion (Og), a one- 
electron (e7) adduct of molecular oxygen (dioxygen; Oz) formed 
by the combination of Oz and e~. Because 1-15% of the O; that 
is respired by mammals goes through the O, oxidation state, 
the biochemistry and reaction chemistry of the species are im- 
portant to those concerned with oxygen toxicity, carcinogenesis, 
and aging. Although the name superoxide has prompted many 
to assume an exceptional degree of reactivity for O.”, the use 
of the prefix in fact was chosen to indicate stoichiometry. Su- 
peroxide was the name given in 1934 to the newly synthesized 
potassium salt (KO2) to differentiate its two-oxygens-per-metal 
stoichiometry from that of most other metal-oxygen compounds 
(NA2O, NagOz, NaOH, Fe203). 

Ionic salts of superoxide (yellow-to-orange solids), which form 
from the reaction of dioxygen with metals such as potassium, ru- 
bidium, or cesium, are paramagnetic, with one unpaired electron 
per two oxygen atoms. 

In 1969, by means of electron spin resonance (ESR) spec- 
troscopy, superoxide ion was detected as a respiratory interme- 
diate, and metalloproteins were discovered that catalyze the dis- 
proportionation of superoxide, that is, superoxide dismutases 
(SODs), as shown in the reaction below. 


Superoxide chemistry 


2027 +2H+ =, O; + H202 

The biological function of superoxide dismutases is believed 
to be the protection of living cells against the toxic effects of su- 
peroxide. The possibility that superoxide might be an important 
intermediate in aerobic life provided an impetus to the study of 
superoxide reactivity. 

The most general and universal property of O27 is its tendency 
to act as a strong Br@nsted base. Its strong proton affinity mani- 
fests itself in any media. Another characteristic of Og" is its ability 
to act as a moderate one-electron reducing agent. See ACID AND 
BASE; OXIDATION-REDUCTION. 

In general, superoxide ion chemistry does not appear to be 
sufficiently robust to make superoxide ion a toxin. However, it 
can interact with protons, halogenated carbons, and carbonyl 
compounds to yield peroxy radicals that are toxic. See OXYGEN 
TOXICITY; REACTIVE INTERMEDIATES. 

Superoxide does not appear to have exceptional reactiv- 
ity. Nevertheless, superoxide will continue to be an interesting 
species for study because of the multiplicity of its chemical reac- 
tions and because of its importance as an intermediate in reac- 
tions that involve dioxygen and hydrogen peroxide. See BIOINOR- 
GANIC CHEMISTRY; OXYGEN. [D.TS.] 


Superplastic forming A process for shaping super- 
plastic materials, a unique class of crystalline materials that ex- 
hibit exceptionally high tensile ductility. Superplastic materials 
may be stretched in tension to elongations typically in excess of 
200% and more commonly in the range of 400-2000%. There 
are rare reports of higher tensile elongations reaching as much as 
8000%. The high ductility is obtained only for superplastic mate- 
rials and requires both the temperature and rate of deformation 
(strain rate) to be within a limited range. The temperature and 
strain rate required depend on the specific material. A variety of 
forming processes can be used to shape these materials; most of 
the processes involve the use of gas pressure to induce the de- 
formation under isothermal conditions at the suitable elevated 
temperature. The tools and dies used, as well as the superplas- 
tic material, are usually heated to the forming temperature. The 
forming capability and complexity of configurations producible 
by the processing methods of superplastic forming greatly ex- 
ceed those possible with conventional sheet forming methods, 
in which the materials typically exhibit 10-50% tensile elonga- 
tion. See SUPERPLASTICITY. 

There are a number of commercial applications of super- 
plastic forming and combined superplastic forming and diffusion 


bonding, including aerospace, architectural, and ground trans- 
portation uses. Examples are wing access panels in the Airbus 
A310 and A320, bathroom sinks in the Boeing 737, turbo-fan- 
engine cooling-duct components, external window frames in the 
space shuttle, front covers of slot machines, and architectural sid- 
ing for buildings. See METAL FORMING. [C.H.Ha.] 


Superplasticity The unusual ability of some metals and 
alloys to elongate uniformly thousands of percent at elevated 
temperatures, much like hot polymers or glasses. Under normal 
creep conditions, conventional alloys do not stretch uniformly, 
but form a necked-down region and then fracture after elonga- 
tions of only 100% or less. The most important requirements 
for obtaining superplastic behavior include a very small metal 
grain size, a well-rounded (equiaxed) grain shape, a deforma- 
tion temperature greater than one-half the melting point, and 
a slow deformation rate. See ALLOY; CREEP (MATERIALS); EUTEC- 
TICS. 

Superplasticity is important to technology primarily because 
large amounts of deformation can be produced under low 
loads. Thus, conventional metal-shaping processes (for example, 
rolling, forging, and extrusion) can be conducted with smaller, 
and cheaper equipment. Nonconventional forming methods can 
also be used; for instance, vacuum-forming techniques, bor- 
rowed from the plastics industry, have been applied to sheet 
metal to form car panels, refrigerator door linings, and TV chas- 
sis parts. See METAL FORMING; PLASTICITY. [E.E.U.] 


Superposition principle The principle, obeyed by 
many equations describing physical phenomena, that a linear 
combination of the solutions of the equation is also a solution. 

An effect is proportional to a cause in a variety of phenomena 
encountered at the level of fundamental physical laws as well 
as in practical applications. When this is true, equations which 
describe such a phenomenon are known as linear, and their 
solutions obey the superposition principle. Thus, when f, g, h, - --, 
solve the linear equation, then s (s = af + Bg+ yh+---,where 
a, B, y,--+, are coefficients) also satisfies the same equation. See 
LINEAR ALGEBRA; LINEARITY. 

For example, an electric field is proportional to the charge 
that generates it. Consequently, an electric force caused by a 
collection of charges is given by a superposition—a vector sum— 
of the forces caused by the individual charges. The same is true 
for the magnetic field and its cause—electric currents. Each of 
these facts is connected with the linearity of Maxwell’s equations, 
which describe electricity and magnetism. See ELECTRIC FIELD; 
MAXWELL’S EQUATIONS. 

The superposition principle is important both because it sim- 
plifies finding solutions to complicated linear problems (they can 
be decomposed into sums of solutions of simpler problems) and 
because many of the fundamental laws of physics are linear. 
Quantum mechanics is an especially important example of a 
fundamental theory in which the superposition principle is valid 
and of profound significance. This property has proved most 
useful in studying implications of quantum theory, but it is also a 
source of the key conundrum associated with its interpretation. 

Its effects are best illustrated in the double-slit superposition 
experiment, in which the wave function representing a quan- 
tum object such as a photon or electron can propagate toward 
a detector plate through two openings (slits). As a consequence 
of the superposition principle, the wave will be a sum of two 
wave functions, each radiating from its respective slit. These two 
waves interfere with each other, creating a pattern of peaks and 
troughs, as would the waves propagating on the surface of wa- 
ter in an analogous experimental setting. However, while this 
pattern can be easily understood for the normal (for example, 
water or sound) waves resulting from the collective motion of vast 
numbers of atoms, it is harder to understand its origin in quan- 
tum mechanics, where the wave describes an individual quan- 
tum, which can be detected, as a single particle, in just one spot 


along the detector (for example, photographic) plate. The inter- 
ference pattern will eventually emerge as a result of many such 
individual quanta, each of which apparently exhibits both wave 
(interference-pattern) and particle (one-by-one detection) char- 
acteristics. This ambivalent nature of quantum phenomena is 
known as the wave-particle duality. See INTERFERENCE OF WAVES; 
QUANTUM MECHANICS. [W.H.Z.] 


Superposition theorem (electric networks) 
Essentially, that it is permissible, if there are two or more sources 
of electromotive force in a linear electrical network, to compute 
at any element of the network the response of voltage or of cur- 
rent that results from one source alone, and then the response 
resulting from another source alone, and so on for all sources, 
and finally to compute the total response to all sources acting 
together by adding these individual responses. 

Thus, if a load of constant resistance is supplied with electrical 
energy from a linear network containing two batteries, two gen- 
erators, or one battery and one generator, it would be correct 
to find the current that would be supplied to the load by one 
source (the other being reduced to zero), then to find the current 
that would be supplied to the load by the second source (the 
first source now being reduced to zero), and finally to add the 
two currents so computed to find the total current that would be 
produced in the load by the two sources acting simultaneously. 

By means of the principle of superposition, effects are added 
instead of causes. This principle seems so intuitively valid that 
there is far greater danger of applying superposition where it is 
incorrect than of failing to apply it where it is correct. It must 
be recognized that for superposition to be correct the relation 
between cause and effect must be linear. [H.H.Sk.] 


Supersaturation A solution is at the saturation point 
when dissolved solute in its crystallizes from it at the same rate 
at which it dissolves. Under prescribed experimental conditions 
of temperature and pressure, a solution can contain at satura- 
tion only one fixed amount of dissolved solute. However, it is 
possible to prepare relatively stable solutions which contain a 
quantity of a dissolved solute greater than that of the saturation 
value provided solute phase is absent. Such solutions are said 
to be supersaturated. They can be prepared by changing the 
experimental conditions of a system so that greater solubility is 
obtained, perhaps by heating the solution, and then carefully 
returning the system to or near its original state. The addition of 
solute phase will immediately relieve supersaturation. Solutions 
in which there is no spontaneous formation of solute phase for 
extended periods of time are said to be metastable. There is 
no sharp line of demarcation between an unstable and 
metastable solution. The process whereby initial aggregates 
within a supersaturated solution develop spontaneously into par- 
ticles of new stable phase is known as nucleation. The greater the 
degree of supersaturation, the greater will be the number of nu- 
clei formed. See NUCLEATION; PHASE EQUILIBRIUM; PRECIPITATION 
(CHEMISTRY). [L.Go/R.W.Mu.] 


Supersonic diffuser A passive compressor (or shaped 
duct) in which gas enters at a velocity greater than the speed of 
sound, is decelerated in a contracting section, and reaches sonic 
speed at a throat. 

Supersonic compression systems can be categorized to three 
basic types (see illustration). External-compression inlets have 
the supersonic diffusion taking place at or ahead of the cowl lip 
(or throat station) and generally employ one or more oblique 
waves ahead of the normal shock. Internal-compression inlets 
accomplish supersonic diffusion internally downstream of the 
cowl lip. Deceleration of the flow is produced by a number of 
weak reflecting waves in a gradually convergent channel. The 
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Basic supersonic compression systems. (a) External com- 
pression. (b) Internal compression. (c) Combination of ex- 
ternal and internal compression. 


third system is a combination of external and internal com- 
pression and appears to represent an effective compromise. 
[J.EC.; L.J.O.] 


Supersonic flight Relative motion of a solid body and 
a gas at a velocity greater than that of sound propagation under 
the same conditions. The general characteristics of supersonic 
flight can be understood by considering the laws of propagation 
of a disturbance or pressure impulse, in a compressible fluid. 

If the fluid is at rest, the pressure impulse propagates uniformly 
with the velocity of sound in all directions, the effect always 
acting along an ever-increasing spherical surface. If, however, 
the source of the impulse is placed in a uniform stream, the 
impulse will be carried by the stream simultaneously with its 
propagation at sonic velocity relative to the stream. Hence the 
resulting propagation is faster in the direction of the stream and 
slower against the stream. If the velocity of the stream past the 
source of disturbance is supersonic, the effect of the impulse is 
restricted to a cone whose vertex is the source of the impulse and 
whose vertex angle decreases from 90° (corresponding to Mach 
number equal to 1) to smaller and smaller values as the Mach 
number of the stream increases (see illustration). If the source of 
the pressure impulse travels through the air at rest, the conditions 
are analogous. See MACH NUMBER. 

Consider the supersonic motion of a wing moving into air at 
rest. Because signals cannot propagate ahead of the wing, the 
presence of the wing has no effect on the undisturbed air until the 
wing passes through it. Hence there must be an abrupt change 
in the properties of the undisturbed air as it begins to flow over 
the wing. This abrupt change takes place in a shock wave which 
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Generation of Mach wave by body at supersonic velocity; 
zones of action and silence are separated. 
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is attached to the leading edge of the wing, provided that the 
leading edge is sharp and the flight Mach number is sufficiently 
large. As the air passes through the shock wave, its pressure, 
temperature, and density are markedly increased. 

Further aft of the leading edge, the pressure of the air is de- 
creased as the air expands over the surface of the wing. Hence 
the pressure acting on the front part of the wing is higher than 
the ambient pressure, and the pressure acting on the rear part 
of the wing is lower than the ambient pressure. The pressure dif- 
ference between front and rear parts produces a drag, even in the 
absence of skin friction and flow separation. The wing produces 
a system of compression and expansion waves which move with 
it. This phenomenon is similar to that of a speedboat moving with 
a velocity greater than the velocity of the surface waves. Because 
of this analogy, supersonic drag is called wave drag. It is peculiar 
to supersonic flight, and it may represent the major portion of the 
total drag of a body. See HYPERSONIC FLIGHT; SUBSONIC FLIGHT; 
TRANSONIC FLIGHT. (J.E.Sc.] 


Supersonic flow 


Supersonic flow Fluid motion in which the Mach num- 
ber M, defined as the speed of the fluid relative to the sonic speed 
in the same medium, is more than unity. It is, however, common 
to call the flow transonic when 0.8 < M < 1.4, and hypersonic 
when M > 5. See MACH NUMBER. 

Mach waves. A particle moving in a compressible medium, 
such as air, emits acoustic disturbances in the form of spherical 
waves. These waves propagate at the speed of sound (M = 1). 
If the particle moves at a supersonic speed, the generated waves 
cannot propagate upstream of the particle. The spherical waves 
are enveloped in a circular cone called the Mach cone. The gen- 
erators of the Mach cone are called Mach lines or Mach waves. 

Shock waves. When a fluid at a supersonic speed ap- 
proaches an airfoil (or a high-pressure region), no information 
is communicated ahead of the airfoil, and the flow adjusts to 
the downstream conditions through a shock wave. Shock waves 
propagate faster than Mach waves, and the flow speed changes 
abruptly from supersonic to less supersonic or subsonic across 
the wave. Similarly, other properties change discontinuously 
across the wave. A Mach wave is a shock wave of minimum 
strength. A normal shock is a plane shock normal to the direc- 
tion of flow, and an oblique shock is inclined at an angle to the 
direction of flow. The velocity upstream of a shock wave is always 


Mach wave 
N 





Typical normal shock, oblique shock, and Mach wave pattern 
in supersonic flow past a blunt body. M is the Mach number 
and Vis the particle speed. The curved line parallel to normal 
and oblique shock waves indicates the end of the velocity 
vectors. 


supersonic. Downstream of an oblique shock, the velocity may 
be subsonic resulting in a strong shock, or supersonic resulting 
in a weak shock. The downstream velocity component normal 
to any shock wave is always subsonic. There is no change in the 
tangential velocity component across the shock. 

In a two-dimensional supersonic flow around a blunt body 
(see illustration), a normal shock is formed directly in front of the 
body, and extends around the body as a curved oblique shock. 
At a sufficient distance away, the flow field is unaffected by the 
presence of the body, and no discontinuity in velocity occurs. The 
shock then reduces to a Mach wave. See COMPRESSIBLE FLOW; 
FLUID FLOW; SUPERSONIC FLIGHT. [M.A.S.] 


Superstring theory A proposal for a unified theory of 
all interactions, including gravity. At present, the strong, weak, 
and electromagnetic interactions are accounted for within the 
framework of the standard model. This model correctly describes 
experiments up to the highest energies performed so far, and 
gives a complete description of the elementary particles and their 
interactions down to distances of the order of 10718 m. Never- 
theless, it has serious limitations, and attempts to overcome them 
and to unify the forces of nature have been only partly successful. 
Moreover, these attempts have left standing fundamental diffi- 
culties in reconciling gravitation and the laws of quantum me- 
chanics. Superstring theory represents an ambitious program to 
unify all of the interactions observed in nature, including grav- 
itation, in a theory with no unexplained parameters. In other 
words, this theory, if successful, should be able to account for 
all of the particles observed in nature and their interactions. See 
ELEMENTARY PARTICLE; FUNDAMENTAL INTERACTIONS. 

String concept. In string theory, the fundamental objects are 
not point particles, as in standard theories of elementary parti- 
cles, but one-dimensional extended objects, the open and closed 
strings. In such a theory, what are usually called the elemen- 
tary particles are simply particular quantum states of the string. 
In superstring theories, space-time is ten-dimensional (space is 
nine-dimensional). If such theories are to describe nature, six 
dimensions must be “curled up” or “compact.” The main con- 
sequence of such extra dimensions is the existence of certain 
very massive particles. See SPACE-TIME. 

The essential features of string theories can be understood by 
analogy with the strings of a musical instrument. Such strings 
vibrate at a characteristic frequency, as well as any integer multi- 
ple of that frequency. Each of these modes of vibration (so-called 
normal modes) can be excited by plucking or striking the string. 
In classical physics, the amplitudes of vibration of each mode can 
take on a continuum of values. If there were a string of atomic 
dimensions, subject to the laws of quantum mechanics, the en- 
ergies of this quantum string could take on only discrete values, 
corresponding to particular quantum states. See QUANTUM ME- 
CHANICS; VIBRATION. 

The strings of superstring theory are quite similar. The main 
difference is that they obey Einstein’s principles of special rel- 
ativity. As a result, since each quantum state has a particular 
energy, it has a definite mass. Thus, each state of the string be- 
haves as a particle of definite mass. Because it is possible, in 
principle, to pump an arbitrarily large amount of energy into the 
string, the theory contains an infinity of different types of parti- 
cles of arbitrarily large mass. The interactions of these particles 
are governed by the ways in which the strings themselves inter- 
act. To be consistent with the principles of relativity, a string can 
interact only by splitting into two strings or by joining together 
with another string to form a third string. As a result, the interac- 
tions of strings are nearly unique. This geometric picture of string 
interactions translates into a precise set of rules for calculating 
the interaction of individual string states, that is, particles. See 
RELATIVITY. 

Classical solutions. Obtaining a description of superstring 
theory analogous to quantum field theory is an active topic of 
research. However, even though the equations that describe this 


field theory are not completely known at present, it is known how 
to find classical solutions of these equations, and by various tech- 
niques, an enormous number of such solutions have been found. 
These include states in which space-time has any dimension be- 
tween one and ten, and states with many bizarre symmetries and 
spectra. Each of these solutions then corresponds to a possible 
ground state of the system. The theories built around some of 
these states look very much like the real world. Not only are four 
dimensions flat while six are compact, but they possess gauge 
symmetries close to that of the standard model. Some have three 
or four generations of quarks and leptons, as well as light Higgs 
particles, which are of crucial importance in the standard model. 
Many of these solutions possess space-time supersymmetry. See 
GAUGE THEORY; HIGGS BOSON; LEPTON; QUARKS. 

However, if the theory does describe nature, it must have some 
mechanism that chooses one of the possible ground states. Be- 
cause the masses and couplings of the elementary particles de- 
pend only on the choice of ground state, determining this true 
ground state will yield a set of predictions for these quantities. 
If string theory is a correct theory, these predictions must agree 
with the experimental values. [M.Di.] 


Supersymmetry A conjectured enhanced symmetry of 
the laws of nature that would relate two fundamental observed 
classes of particles, bosons and fermions. 

All particles can be classified as fermions, such as the electron 
and quarks, or bosons, such as the photon and graviton. A fun- 
damental characteristic distinguishing these two classes is that 
they carry different quantum-mechanical spin. If the amount 
of spin of an elementary particle is measured in terms of the 
fundamental quantum unit of angular momentum—A, equal to 
Planck’s constant divided by 2x —1then bosons always have in- 
teger amounts of spin (that is, 0, 1, 2 ...), while fermions have 
odd half-integer amounts of spin (that is, 1/2, 3/2, 5/2, ...). See 
SPIN (QUANTUM MECHANICS). 

There is seemingly a fundamental distinction between parti- 
cles with differing amounts of spin. For example, bosons like to 
act collectively (Bose-Einstein statistics), producing such distinc- 
tive behavior as the laser, while, conversely, fermions obey the 
Pauli exclusion principle (and the Pauli-Dirac statistics), which 
disallows two identical fermions to be in the same state, and ex- 
plains the stability of matter. Moreover, all the symmetries that are 
observed in the world relate different particles of the same spin. 
See BOSE-EINSTEIN STATISTICS; FERMI-DIRAC STATISTICS; QUANTUM 
STATISTICS; SYMMETRY LAWS (PHYSICS). 

In contrast, supersymmetry would relate bosons and fermions. 
This would be a remarkable step forward in understanding the 
physical world. However, if supersymmetry were realized as an 
exact symmetry, the particles so related should have almost all 
their characteristics, such as mass and charge, preserved. Explic- 
itly, any fermion of spin 1/2 should have a boson superpartner of 
spin 0, while any gauge boson of spin 1 should have a fermion 
superpartner of spin 1/2. This is apparently a disaster for the 
idea of supersymmetry since it predicts, for instance, that there 
should exist a spin-0 boson partner of the electron, the selectron, 
with electric charge and mass equal to that of the electron. Such 
a particle would be easy to detect and is certainly ruled out by 
very many experiments. 

The crucial caveat to this negative result is the condition that 
supersymmetry be realized as an exact symmetry. A fundamental 
concept of modern physics is spontaneously broken symmetry. 
Physics displays many examples of symmetries that are exact 
symmetries of the fundamental equations describing a system, 
but not of their solutions. In particle physics the spontaneous 
breaking of a symmetry usually results in a difference in the 
masses of the particles related by the symmetry; the amount 
of breaking can be quantified by this mass difference. See SYM- 
METRY BREAKING. 

If supersymmetry is broken by a large amount, then all the 
superpartners have masses much greater than the particles that 
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are currently observed, and there is little hope of seeing evidence 
for supersymmetry. However, evidence that supersymmetry is 
broken by only a moderate amount comes from examination of 
the properties of the fundamental forces at high energy. 

Of the four fundamental forces, the three excluding gravity are 
very similar in their basic formulation; they are all described by 
gauge theories, generalizations of the quantum theory of electro- 
magnetism, and quantum electrodynamics (QED). The strength 
of electrical interaction between two electrons can be quantified 
in terms of a number, the coupling constant œ. However, the 
quantity a, is actually not a constant, but depends on the en- 
ergies at which the interaction strength is measured. In fact, the 
interaction strengths, 01, a2, and a3, of the three forces (exclud- 
ing gravity) all depend on energy, w. The couplings a 93 satisfy 
differential equations—renormalization group equations—that 
depend on the types of elementary particles that exist with mass 
at or below the energy scale jx and that are charged with respect 
to each of the three interactions. If the fundamental particles in- 
clude not only the observed particles but also their superpartners, 
taken to have masses not greater than 1000 GeV heavier than 
their (observed) partners, then from the renormalization group 
equations, the couplings a; are predicted to meet (unify) at a huge 
energy of 2 x 10'° GeV. In contrast, if either supersymmetry is 
not an underlying symmetry of the world, or it is very badly bro- 
ken so that the superpartners are very massive, the couplings fail 
to unify at a single point. See FUNDAMENTAL INTERACTIONS; GAUGE 
THEORY; QUANTUM ELECTRODYNAMICS; RENORMALIZATION. 

Although the unification of couplings is the most significant 
indication that supersymmetry is a new law of nature, there are 
a number of other hints in this same direction. By observing 
the large-scale motions of the galaxies, the average density of 
large volumes of the universe can be deduced, resulting in a 
value that is substantially greater than that directly observed in 
luminous matter such as stars and hot gas. Therefore, a substan- 
tial fraction of the mass of the universe must be composed of 
some form of nonluminous or dark matter. Remarkably, many 
attractive models of supersymmetry predict that the lightest of all 
the superpartners is a weakly interacting massive particle with 
just the right characteristics to be this dark matter. See COSMOL- 
OGY; UNIVERSE; WEAKLY INTERACTING MASSIVE PARTICLE (WIMP). 

[J.M.R.] 


Suppression (electricity) 


Suppression (electricity) The process or technique 
of reducing electrical interference to acceptable levels or to sit- 
uations having no adverse effect. Suppression techniques may 
be applied to the interference source, the intervening path, the 
victim or receptor, or any combination. Normal strategy for in- 
terference control is to first suppress the source, if possible, since 
it may disturb many victims. 

For intentional transmitters, suppressing interference may in- 
clude reducing or eliminating harmonic radiations, restricting the 
bandwidth, or restricting levels of unnecessary or excessive mod- 
ulation sidebands. These are usually accomplished by radio fre- 
quency filters. See ELECTRIC FILTER. 

For many devices involving incidental radiators, such as brush- 
type motors and fluorescent lights, interference suppression may 
require both filtering and shielding. Electrical filtering may take 
the form of transient surge suppressors, feed-through capacitors, 
electromagnetic interference (EMI) filters, ferrite absorbers, iso- 
lation transformers, or Faraday shielded isolation transformers. 
See ELECTRIC PROTECTIVE DEVICES. 

Shielding to control radiation involves using metal boxes, 
cases, cabinet housings, or metalized plastic versions thereof. 
Since the interconnecting cables between equipment offer the 
greatest threat as an “antenna farm,” the dominant suppres- 
sion technique is to shield the cables, wires, or harnesses. For 
the best protection from electromagnetic interference, the cable 
shield should be designed as an extension of the box or equip- 
ment shield. Other forms of interference hardening of cables 
include twisting parallel wire pairs, multiple-layer shields, and 
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absorbing jackets. Where possible, best electromagnetic com- 
patibility performance is obtained by replacing the signal cables 
with fiber-optic links. See ELECTRICAL SHIELDING. 

From a system point of view, the radiated emission propensity 
or pickup susceptibility of interconnected equipment is propor- 
tional to the maximum length or dimension and frequency until 
this length corresponds to an electrical length of about one-half 
wavelength. Thus, to help suppress electromagnetic interference, 
the corresponding frequency should not fall within the passband 
of victims or receptors. See ELECTRICAL INTERFERENCE; ELECTRO- 
MAGNETIC COMPATIBILITY. [D.R.J.W] 


Supramolecular chemistry 


Supramolecular chemistry A highly _interdisci- 
plinary field covering the chemical, physical, and biological fea- 
tures of complex chemical species held together and organized 
by means of intermolecular (noncovalent) bonding interactions. 
See CHEMICAL BONDING; INTERMOLECULAR FORCES. 

When a substrate binds to an enzyme or a drug to its tar- 
get, and when signals propagate between cells, highly selective 
interactions occur between the partners that control the pro- 
cesses. Supramolecular chemistry is concerned with the study 
of the basic features of these interactions and with their imple- 
mentation in biological systems as well as in specially designed 
nonnatural ones. In addition to biochemistry, its roots extend 
into organic chemistry and the synthetic procedures for receptor 
construction, into coordination chemistry and metal ion-ligand 
complexes, and into physical chemistry and the experimental 
and theoretical studies of interactions. See BIOINORGANIC CHEM- 
ISTRY; ENZYME; LIGAND; PHYSICAL ORGANIC CHEMISTRY; PROTEIN. 

The field started with the selective binding of alkali metal 
cations by natural as well as synthetic macrocyclic and macrop- 
olycyclic ligands, the crown ethers and cryptands. This led to 
the emergence of molecular recognition as a new domain of 
chemical research that, by encompassing all types of molecular 
components and interactions as well as both oligo and poly- 
molecular entities, became supramolecular chemistry. It under- 
went rapid growth with the development of synthetic receptor 
molecules of numerous types for the strong and selective bind- 
ing of cationic, anionic, or neutral complementary substrates 
of organic, inorganic, or biological nature by means of vari- 
ous interactions (electrostatic, hydrogen bonding, van der Waals, 
and donor-acceptor). Molecular recognition implies the (molec- 
ular) storage and the (supramolecular) retrieval and processing 
of molecular structural (geometrical and interactional) informa- 
tion. See HYDROGEN BOND; MACROCYCLIC COMPOUND; MOLECULAR 
RECOGNITION. 

Many types of receptor molecules have been explored (crown 
ethers, cryptands, cyclodextrins, calixarenes, cavitands, cyclo- 
phanes, cryptophanes, and so on), and many others may be 
imagined for the binding of complementary substrates of chemi- 
cal or biological significance. They allow, for instance, the devel- 
opment of substrate-specific sensors or the recognition of struc- 
tural features in biomolecules (for example, nucleic acid probes, 
affinity cleavage reagents, and enzyme inhibitors). See BIOPOLY- 
MER; CYCLOPHANE; ENZYME INHIBITION. 

A major step in the development of supramolecular chemistry 
over the last 20 years involved the design of systems capable of 
spontaneously generating well-defined, supramolecular entities 
by self-assembly under a given set of conditions. 

The information necessary for supramolecular self-assembly 
to take place is stored in the components, and the program that 
it follows operates via specific interactional algorithms based on 
binding patterns and molecular recognition events. Thus, rather 
than being preorganized, constructed entities, these systems may 
be considered as self-organizing, programmed supramolecular 
systems. 

Self-assembly and self-organization have recently been imple- 
mented in numerous types of organic and inorganic systems. By 
clever use of metal coordination, hydrogen bonding, and donor- 
acceptor interactions, researchers have achieved the sponta- 


neous formation of a variety of novel and intriguing species 
such as inorganic double and triple helices termed helicates, 
catenates, threaded entities (rotaxanes), cage compounds, grids 
of metal ions, and so on. 

Another major development concerns the design of molecular 
species displaying the ability to perform self-replication, based on 
components containing suitable recognition groups and reactive 
functions. Self-recognition processes involve the spontaneous 
selection of the correct partner(s) in a self-assembly event—for 
instance, the correct ligand strand in helicate formation. 

A major area of interest is the design of supramolecular de- 
vices built on photoactive, electroactive, or ionoactive compo- 
nents, operating respectively with photons, electrons, or ions. 
Thus, a variety of photonic devices based on photoinduced en- 
ergy and electron transfer may be imagined. Molecular wires, ion 
carriers, and channels facilitate the flow of electrons and ions 
through membranes. Such functional entities represent entries 
into molecular photonics, electronics, and ionics, which deal with 
the storage, processing, and transfer of materials, signals, and in- 
formation at the molecular and supramolecular levels. Dynamic 
and mechanical devices exploit the control of motion within 
molecular and supramolecular entities. See INORGANIC PHOTO- 
CHEMISTRY; ION TRANSPORT; PHOTOCHEMISTRY. 

The design of systems that are controlled, programmed, 
and functionally self-organized by means of molecular infor- 
mation contained in their components represents new hori- 
zons in supramolecular chemistry and provides an original ap- 
proach to nanoscience and nanotechnology. In particular, the 
spontaneous but controlled generation of well-defined, func- 
tional supramolecular architectures of nanometric size through 
self-organization—supramolecular nanochemistry—represents 
a means of performing programmed engineering and process- 
ing of nanomaterials. It offers a powerful alternative to the de- 
manding procedures of nanofabrication and nanomanipulation, 
bypassing the need for external intervention. A rich variety of 
architectures, properties, and processes should result from this 
blending of supramolecular chemistry with materials science. 
See NANOCHEMISTRY; NANOTECHNOLOGY. [J.M.Le.] 


Surface (geometry) A two-dimensional geometric fig- 
ure (a collection of points) in three-dimensional space. The sim- 
plest example is a plane—a flat surface. Some other common 
surfaces are spheres, cylinders, and cones, the names of which 
are also used to describe the three-dimensional geometric fig- 
ures that are enclosed (or partially enclosed) by those surfaces. 
In a similar way, cubes, parallelepipeds, and other polyhedra are 
surfaces. See CUBE; POLYHEDRON; SOLID (GEOMETRY). 

Any bounded plane region has a measure called the area. If 
a surface is approximated by polygonal regions joined at their 
edges, an approximation to the area of the surface is obtained 
by summing the areas of these regions. The area of a surface is 
the limit of this sum if the number of polygons increases while 
their areas all approach zero. See AREA; CALCULUS; INTEGRATION; 
PLANE GEOMETRY; POLYGON. 

Methods of description. The shape of a surface can be de- 
scribed by several methods. The simplest is to use the commonly 
accepted name of the surface, such as sphere or cube. In math- 
ematical discussions, surfaces are normally defined by one or 
more equations, each of which gives information about a rela- 
tionship that exists between coordinates of points of the surface, 
using some suitable coordinate system. See COORDINATE SYS- 
TEMS. 

Some surfaces are conveniently described by explaining how 
they might be formed. If a curve, called the generator in three- 
dimensional space, is allowed to move in some manner, then 
each position the generator occupies during this motion is a 
collection of points, and the set of all such points constitutes a 
surface that can be said to be swept out by the generator. In 
particular, if the generator is a straight line, a ruled surface is 
formed. If the generator is a straight line and the motion is such 


